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PREFACE 


Digital Signal Processing (DSP) involves the representation, transmission, and manipulation of 
signals using numerical techniques and digital processors. It has been an exciting and growing 
technology during the past few years. Its applications have also been expanded vigorously to 
encompass not only the traditional radar signal processing but also today’s digital audio processing 
(for consumer laser disc players). 


In designing a DSP application, the designer faces the following immediate technical challenges: 


e Selecting digital signal processors powerful enough to perform the task, 

¢ Obtaining technical support as well as development tools from the semiconductor vendor, 
e Creating DSP algorithms to execute the application, and 

¢ Implementing these algorithms on the processors. 


These difficult but not impossible tasks challenge design engineers. Each of these problems must 
be resolved before a successful DSP product can be introduced. The purpose of this applications 
book is to serve as a reference guide for engineers who seek solutions to the above design problems 
in DSP applications. Readers will benefit from the DSP devices/tools introduced and the fundamental 
DSP operations and application examples presented in the book for either instant solutions or ideas 
for solutions to the above challenges. 


This book consists of three major parts. The first part briefly introduces the device architectures, 
characteristics, support, and development tools for the first two generations of the Texas Instruments 
TMS320 digital signal processors. Readers who are not familiar with these processors should begin 
their reading with this part of the book. The second part of the book covers some of the common 
DSP routines, such as Finite Impulse Response (FIR) and Infinite Impulse Response (IIR) filters 
and Fast Fourier Transforms (FFT), implemented using the TMS320 devices. Hardware interfacing 
and multiprocessing with these devices are also included. The last part of the book is applications 
specific. Some typical DSP applications are selected and thoroughly discussed. These applications 
are divided into two categories: telecommunications, and computers and peripherals (including 
speech coding/recognition, image/graphics, and digital control). 


The materials included in this book are primarily application reports, which have been generated 
by the digital signal processing engineering staff of the Texas Instruments Semiconductor Group. 
Some published articles and a technical report from MIT Lincoln Laboratory have also been reprinted 
herein to supplement the application reports in order to provide completeness of the subject matter. 
The application reports contain more complete theory and implementations (consisting of algorithms, 
TMS320 code, and/or schematics) than the reprinted articles. 


Readers who desire to obtain more information regarding the device characteristics and programming 
of the TMS320 digital signal processors should refer to other publications, such as the appropriate 
device User’s Guide or Data Sheet. References are usually included at the end of each report. 
A comprehensive list of literature published since 1982 on the TMS320 digital signal processors 
is provided in the appendix for interested readers to gain further knowledge on various aspects 
of the TI processors. 


The editor would like to credit the authors who contributed the application reports and articles 
included in the book. A special thanks and note of appreciation go to Maridene Lemmon for a 
thorough review of the entire manuscript. Many comments from members of the TI Semiconductor 
Digital Signal Processing staff, especially Maridene Lemmon and Mike Hames, have also greatly 
improved the structure of the book. 


Kun-Shan Lin, Ph.D. 
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A brief overview of digital signal processing and its major application areas is presented 
in this section. A guide to how the book is organized and can be used is also provided. 


Overview of Digital Signal Processing 


In the last decade, Digital Signal Processing (DSP) has made tremendous progress in 
both the theoretical and practical aspects of the field.1-5 While more DSP algorithms 

are being discovered, better tools are also being developed to implement these algorithms. 

One of the most important breakthroughs in electronic technology is the high-speed 

digital signal processors. These single-chip processors are now commercially available 

in Very Large-Scale Integrated (VLSI) circuits from semiconductor vendors. Digital 

signal processors are essentially high-speed microprocessors/microcomputers, designed 

specifically to perform computation-intensive digital signal processing algorithms. By 

taking advantage of the advanced architecture, parallel processing, and dedicated DSP 

instruction sets, these devices can execute millions of DSP operations per second. This 
capability allows complicated DSP algorithms to be implemented in a tiny silicon chip, 
which previously required the use of a minicomputer and an array processor. 


With this VLSI advancement, innovative engineers in industry are discovering more 
and more applications where digital signal processors can provide a better solution than 
their analog counterparts for reasons of reliability, reproducibility, compactness, and 
efficiency. These digital signal processors are also highly programmable, which makes 
them very attractive for (1) system upgrades, in the case of advancements in DSP 
algorithms, and (2) multitasking where different tasks can be performed with the same 
device by simply changing its program. Because of these and many other advantages, 
digital signal processors are becoming more prevalent in areas of general-purpose digital 
signal processing, telecommunications, voice/speech, graphics/imaging, control, 
instrumentation, and the military. Table 1 lists some applications in these areas. 


Table 1. Typical Applications of the TMS320 Family 


GENERAL-PURPOSE DSP GRAPHICSAMAGING INSTRUMENTATION 


Digital Filtering 3-D Rotation Spectrum Analysis 
Convolution Robot Vision Function Generation 
Correlation Image Transmission/ Pattern Matching 
Hilbert Transforms Compression Seismic Processing 
Fast Fourier Transforms Pattern Recognition Transient Analysis 
Adaptive Filtering Image Enhancement Digital Filtering 
Windowing Homomorphic Processing Phase-Locked Loops 
Waveform Generation Workstations 

Animatior/Digital Map 


VOICE/SPEECH CONTROL MILITARY 


Voice Mail Disk Control Secure Communications 
Speech Vocoding Servo Control Radar Processing 

Speech Recognition Robot Control Sonar Processing 
Speaker Verification Laser Printer Control Image Processing 

Speech Enhancement Engine Control Navigation 

Speech Synthesis Motor Control Missile Guidance 

Text to Speech Radio Frequency Modems 


TELECOMMUNICATIONS AUTOMOTIVE 


Echo Cancellation FAX Engine Control 
ADPCM Transcoders Cellular Telephones ’ Vibration Analysis 
Digital PBXs Speaker Phones Antiskid Brakes 

Line Repeaters Digital Speech Adaptive Ride Control 
Channel Multiplexing Interpolation (DS!) Global Positioning 
1200 to 19200-bps Modems X.25 Packet Switching Navigation 

Adaptive Equalizers Video Conferencing Voice Commands 
DTMF Encoding/Decoding Spread Spectrum Digital Radio 

Data Encryption Communications Celiutar Telephones 


Radar Detectors Robotics Hearing Aids 
Power Tools Numeric Control Patient Monitoring 
Digital Audio/TV Security Access Ultrasound Equipment 
Music Synthesizer Power Line Monitors Diagnostic Tools 
Educational Toys Prosthetics 

Fetal Monitors 


Organization of the Book 


This book is organized into three major parts. Part I contains a brief introduction to 
digital signal processing and a cursory review of the first two generations of the TMS320 
digital signal processors, their characteristics, and the support available from Texas 
Instruments and third parties. 


Part II consists of a collection of common DSP routines and interfaces for implementing 


DSP algorithms using the TMS320 processors. These DSP routines are: Finite Impulse 
Response (FIR)Anfinite Impulse Response (IIR) filters, Fast Fourier Transforms (FFT), 
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Pulse Code Modulation (PCM) companding, floating-point arithmetic, precision digital 
sine-wave generation, and matrix operations. Interfacing the TMS320 to external memory 
devices and microprocessors, and using the TMS320 in a multiprocessing environment 
are also included. These DSP routines and interfaces serve as the basis for developing 
DSP applications. 


Part III is applications specific. Some typical DSP applications are encompassed and 
catagorized into two areas: telecommunications and computers and peripherals. In the 
telecommunications area, six applications are selected: telecommunications interfacing 
using the TMS32010, a single-chip TMS32020 echo canceller, implementation of the 
Data Encryption Standard (DES) algorithm using the TMS32010, 32-kbit/s Adaptive 
Differential Pulse Code Modulation (ADPCM) transcoders using the TMS32010, 
TMS32010 16-kbit/s subband coders, and a Dual-Tone MultiFrequency (DTMF) 
encoder/decoder using the TMS32010. For computers/peripherals, which has become 
a fast expanding area for numerous DSP applications, reprints of a series of articles 
on speech vocoding at 2.4-kbit/s and 9.6-kbit/s and speech recognition with the 
TMS32010 are included. An application report on the TMS32020 for image/graphics 
applications is also presented. A detailed report on the design of a digital control system 
with the TMS32010 digital signal processor concludes Part III. 


Readers are encouraged to adapt these DSP routines, algorithms, and implementations 
to their applications using the TMS320. It is important to bear in mind that the materials 
given in each report and article only serve as examples. Further optimization of either 
the code or the circuit to meet specific performance/cost goals is possible. These goals 
are usually application-dependent. Readers need to make the appropriate tradeoffs to 
fit their specific design criteria. 


In summary, even with 2 book of this size, it is impossible to cover all'the DSP routines 
and applications. Because of the TMS320 family’s excellent computation power, high 
programmability, and complete development tools, more applications will be created 
in the future. These applications will extend beyond the two areas selected for this volume. 
The TMS320 family of processors will continue to provide cost-/performance-effective 
solutions in the dynamic field of digital signal processing. 


References 


The following references in digital signal processing may be helpful to the reader of 
this book: 


1. A. Oppenheim and R. Schafer, Digital Signal Processing, Prentice-Hall (1975). 


2. L. Rabiner and B. Gold, Theory and Application of Digital Signal Processing, Prentice- 
Hall (1975). 


3. A. Oppenheim (editor), Applications of Digital Signal Processing, Prentice-Hall, 
(1978). 


4. Issues of IEEE Transactions on Acoustics, Speech, and Signal Processing. 


5. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal 
Processing. 
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This section provides a brief description of the TMS320 digital signal processing family. 
The family’s device architectures, characteristics, and features suitable for digital signal 
processing applications are discussed. To succeed in designing a DSP application, 
comprehensive development and support tools are required. Texas Instruments provides 
a whole family of tools to help DSP engineers. These tools are summarized in this section. 
Extensive publications produced by Texas Instruments include device specifications, 
device user’s guides, and development tools reference guides. A comprehensive list 
of these publications is also included. Please see page 19 for RTC addresses and 
phone numbers. 


TMS320 Digital Signal Processing Family 


The Texas Instruments TMS320 product line contains a family of digital signal processors, 
designed to support a wide range of high-speed or numeric-intensive DSP applications. 
These 16/32-bit single-chip microprocessors/microcomputers combine the flexibility 
of a high-speed controller with the numerical capability of an array processor, thereby 
offering an inexpensive alternative to a multichip bit-slice processor or an expensive 
commercial array processor. 


The TMS320 family contains the first MOS microprocessor capable of executing five 
million instructions per second. This high throughput is the result of the comprehensive, 
efficient, and easily programmed instruction set and of the highly pipelined architecture. 
Special instructions, such as multiply/accumulate with fast data move, have been 
incorporated to speed the execution of DSP algorithms. A comprehensive set of general- 
purpose microprocessor instructions is also included. For example, the branch instructions 
encompass all the various conditions of the accumulator. Three different addressing 
modes are provided: direct, indirect, and immediate. A full set of Boolean instructions 
is included for testing bits. Bit extractions and interrupt capabilities are also part of 
the features of the TMS320 processors. | 


Architecturally, the TMS320 utilizes a modified Harvard architecture for speed and 
flexibility. In a strict Harvard architecture, the program and data memories lie in two 
separate spaces, permitting a full overlap of the instruction fetch and execution. The 
TMS320 family’s modification of the Harvard architecture allows transfer between 
program and data spaces, thereby increasing the flexibility of the device. This architectural 
modification eliminates the need for a separate coefficient ROM and also maximizes 
processing power by maintaining two separate bus structures (program and data) for 
full-speed execution. 


The Texas Instruments TMS320 family consists of two generations of digital signal 
processors. The first generation contains the TMS32010 and its offspring: TMS320C10, 
TMS32011, and TMS32010-25. The TMS32020 and TMS320C25 are the second- 
generation processors. The features described thus far are common among the processors 
in the family. Some specific features are added in each processor to provide different 
cost/performance tradeoffs. These features will be covered in the remaining part of this 
section. The flexibility of the TMS320 family gives a DSP designer alternatives in 
selecting a device in the family that can best serve his applications. 


The two generations of digital signal processors can be plotted on a hypothetical 
performance and time scale, as shown in Figure 1. Several trends in digital signal 
processors can be observed from the figure. Offspring of the first two generations of 


PERFORMANCE 


processors are becoming available to provide better DSP system integration and 
cost/performance tradeoffs. Specifically, peripheral circuits are being integrated into 
the DSF device to reduce chip counts, board space, power consumption, and system 
cost. Because of the low-power and high-speed advantages of CMOS circuits, DSP 
devices are also being introduced in CMOS in addition to their NMOS counterparts. 
In the future, newer generations of DSP processors will also be needed to meet higher 
performance requirements for some applications and further expand the DSP horizon. 
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Figure 1. The Generations of Digital Signal Processors 
TMS32010 Digital Signal Processor 


The TMS32010! is the first-generation digital signal processor. Its hardware implements 
many functions that other processors typically perform in software. For example, this 
device contains a hardware multiplier that performs a 16 x 16-bit multiplication with . 
a 32-bit result in a single 200-ns cycle. A hardware barrel shifter is used to shift data 
on its way into the ALU. Extra hardware has been included so that auxiliary registers, 
which provide indirect data RAM addresses, can be configured in an auto- 
increment/decrement mode for single-cycle manipulation of data tables. This hardware- 
intensive approach gives the design engineer the type of power previously unavailable 
on a single chip. 


The TMS32010 has both microprocessor and microcomputer modes selectable via the 
device MC/MP pin. When used in the microcomputer mode, the TMS32010 is equipped 
with a 1536-word ROM, which is mask-programmed at the factory with a customer’s 
program. It can also execute from an additional 2560 words of off-chip program memory 
at full speed. This memory expansion capability is especially useful for applications 
sharing the same subroutines. In this case, the common subroutines can be stored on- 
chip while the application-specific code is stored off-chip. When used in the 
microprocessor mode, the TMS32010 can execute full-speed 4096-word off-chip 
instructions. Figure 2 shows the functional block diagram of the TMS32010 processor. 
The device is fabricated in a 2.4 NMOS technology and has a chip area of 51K square 
mil. It is produced in a 40-pin dual-in-line package and a 44-pin plastic-leadless-chip- 
carrier (PLCC). dissipating 950 mW (typically). The maximum clock frequency is 20.5 
MHz for an instruction rate of five million instructions per second. 
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Some of the key features of the TMS32010 are: 


200-ns instruction cycle 

1.5K words (3K bytes) of program ROM 

144 words (288 bytes) of data RAM 

16 x 16-bit parallel multiplier 

External memory expansion to 4K words (8K bytes) at full speed 
Interrupt with context save ; 

Barrel shifter 

On-chip clock 

Single 5-volt supply, NMOS technology, 40-pin DIP and 44-pin PLCC. 


x1 
CLKOUT X2/CLKIN 
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T = T register 
SHIFTER (0, 1, 4) 
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DATA BUS 


Figure 2. Functional Block Diagram of the TMS32010 


TMS320C10 Digital Signal Processor 


The TMS320C 102 is essentially a CMOS replica of the TMS32010. The device is both 
plug-in and object-code compatible to its NMOS counterpart, the TMS32010. The 
TMS320C10 also has an instruction cycle time of 200 ns. The device is fabricated in 
a 24 CMOS technology with power consumption of 100 mW (typically) at 20-MHz 
operation. Because of the low-power consumption, the CMOS TMS32010 is very useful 
for power-sensitive applications, such as digital telephony and portable consumer 
products. 


TMS32011 Digital Signal Processor 


The TMS320113 is a dedicated microcomputer with 1.5K words of on-chip program 
ROM (and no external memory expansion) intended for high-volume applications. The 
TMS32011 is essentially a TMS32010 with the address bus stripped off and two serial 
ports integrated on-chip. In addition, PCM companding (y/A-law to and from linear 
PCM) functions and a timer have been implemented in hardware to reduce the program 
memory size and increase the CPU utilization for applications using codecs. The device 
has all the TMS32010 hardware features, such as a 200-ns instruction cycle time, 16 
x 16-bit multiplier, and 32-bit ALU/accumulator. The instruction sets are fully 
compatible, enabling existing TMS32010 development tools to be used in TMS32011 
applications. 


The following key features distinguish the TMS32011 microcomputer: 


Dual-channel serial port for full-duplex serial communication 

Direct interface to combo-codec and PCM highway systems 
Serial-port timer 

Internal framing-pulse generation 

On-chip companding hardware for p-law and A-law PCM conversions 
Object-code compatible with the TMS32010 instruction set 
Compatible with TMS32010 development support tools 

Peripheral mode to TMS32010 for application development 

1.5K words (3K bytes) of program ROM 

144 words (288 bytes) of data RAM. 


TMS32010-25 Digite! Signal Processor 


The TMS32010-25 is a 160-ns instruction cycle time version of the TMS32010. This 
microprocessor is intended for higher performance applications using off-chip memory, 
which require 25 percent greater processor throughput (6.25 million instructions per 
second) than the TMS32010. Existing TMS32010 designs can simply speed up the system 
clock to 25 MHz to take advantage of the increased processor throughput without any 
software redesign. The device is capable of accessing 4K words of external program 
memory. Other key features of the TMS32010-25 are the same as the ones in the 20-MHz 
version of the TMS32010. 


TMS32020 Digital Signal Processor 
The TMS320204 digital signal processor is the second-generation member of the 
TMS320 family of VLSI processors. The TMS32020 architecture is based upon that 


of the TMS32010, the first member of the TMS320 family. The TMS32020 greatly 
enhances the memory spaces of the processor, providing 544 words of on-chip data 


2. The TMS320 Family 


2. The TMS320 Family 


and program memory. Increased throughput is accomplished by means of single-cycle 
multiply/accumulate instructions with a data move option, five auxiliary registers with 
a dedicated arithmetic unit, and faster /O necessary for data-intensive applications. 
The TMS32020 has a comprehensive instruction set of 109 instructions to increase 
software throughput and ease of development. The processor contains special repeat 
instructions for streamlining program space and execution time. 


The architectural design of the TMS32020 emphasizes overall system speed, 
communication, and flexibility in processor configuration. Control signals and software 
instructions provide block memory transfer, communication to slower off-chip devices, 
multiprocessing implementations, and floating-point support. Peripheral functions, such 
as the hardware timer and serial port, have been integrated on-chip to reduce overall 
system cost. The combination of increased memory (both on-chip and off-chip), expanded 
instruction set (for example, single-cycle multiply/accumulate), and additional hardware 
features give the TMS32020 two to three times the performance over the TMS32010 
in DSP applications. 


Figure 3 shows the functional block diagram of the TMS32020. The TMS32010 source 
code is upward-compatible with the TMS32020 source code, and can be assembled using 
the TMS32020 Macro Assembler. The TMS32020 is fabricated in a /4p NMOS 
technology and has a chip area of 119K square mil. It is produced in a 68-pin grid array 
package and has a typical power consumption of 1.2 W. The maximum clock frequency 
is 20.5 MHz for an instruction rate of five million instructions per second. 


Some of the key features of the TMS32020 are: 


e 544 words of on-chip data RAM (256 words configurable as either data or 
program memory) 

128K words of memory space (64K words program and 64K words data) 
Single-cycle multiply/accumulate instructions 

Repeat instructions 

200-ns instruction cycle 

Serial port for multiprocessing or codec interface 

Sixteen input and sixteen output channels 

16-bit parallel interface 

Directly accessible external data memory space 

Global data memory interface 

Block moves for data/program memory 

TMS32010 software-upward compatibility 

Instruction set support for floating-point operations 

On-chip clock 

Single 5-volt supply, NMOS technology, 68-pin grid array package. 
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Figure 3. Functional Block Diagram of the TMS32020 
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TMS320C25 Digital Signal Processor 


The TMS320C25 is a pin-compatible CMOS version of the TMS32020 with additional 
features to further enhance the processor speed, system integration, and ease of application 
development. The TMS320C25’s faster instruction cycle time of 100 ns, additional 
instructions, and additional on-chip hardware such as an 8-deep hardware stack and 
8 auxiliary registers result in two to three times the throughput of its predecessor, the 
TMS32020. The inclusion of a large on-chip masked ROM (4K words) makes the 
TMS320C25 ideal for single-chip DSP applications, thus reducing the power, cost, and 
board space of many applications. All 109 TMS32020 instructions are implemented 
on the device with object-code compatibility. Two new instructions (MPYA and ZALR) 
on the device allow an LMS adaptive filter tap and update to be performed in 4 machine 
cycles. A 256-tap adaptive filter can thus be sampled at 8 kHz to be executed on a single 
chip (with no external memory) in real time. The device is fabricated in 1.84 CMOS 
technology and is packaged in a 68-pin PLCC. Figure 4 shows the functional block 
diagram of the TMS320C25. 


The following outlines the major enhancements of the TMS320C25 over the TMS32020: 


© Two versions: 100-ns instruction cycle time 
125-ns instruction cycle time 
4K x 16-bit on-chip masked ROM 
Object-code compatible with TMS32020 
MAC/MACD operation with external program memory 
Double-buffered static serial port 
T1/G.711 transmission interface 
Unsigned multiply and carry bit for complex arithmetic 
Bit-reversed addressing for FFTs 
Powerdown mode 
Eight auxiliary registers 
Eight-deep hardware stack 
MPYA and rounding for adaptive filtering 
Concurrent DMA via redefined HOLD mode 
1.84 CMOS, 68-pin PLCC. 
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The TMS320 family’s unique versatility, computational power, and high I/O throughput 
give the DSP designer a new powerful solution to a variety of complicated applications. 
Some of these applications are discussed in Part III of this book. 


Development and Support Tools 


In developing an application, problems are usually encountered, and questions must 
be answered before funding to start the project, or to continue it, is granted. Oftentimes 
the tools and vendor support provided to the designer are the difference between the 
success and failure of the project. This is especially true when using state-of-the-art 
electronic devices, such as digital signal processing ICs. 


The TMS320 family of digital signal processors has a wide range of development tools 
available (see Figure 5). These tools range from very inexpensive evaluation modules" 
for application evaluation and benchmarking purposes, assembler/linkers, and software 
simulators to full-capability hardware emulators. For a complete listing of the support 
available from TI, DSP text/reference books, design services and support offered by 
TI Regional Technology Centers and third parties, please refer to the Development 
Support Reference Guide.5 A brief summary of them is provided in the succeeding 
subsections. 


2. The TMS320 Family 


Figure 5. TMS320 Product Development Tools 
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Software Tools 


Assembler/linkers and software simulators are available on PC and VAX for users to 
develop and debug TMS320 DSP algorithms. Their features are described as follows: 


ASSEMBLER/LINKER4-9: The TMS320 Macro Assembler translates TMS320 
assembly language source code into executable object code. The assembler allows the 
programmer to work with mnemonics rather than hexadecimal machine instructions and 
to reference memory locations with symbolic addresses. The macro assembler supports 
macro calls and definitions along with conditional assembly. The TMS320 Linker permits 
a program to be designed and implemented in separate modules that will later be linked 
together to form the complete program. The linker resolves external definitions and 
references for relocatable code, creating an object file that can be executed by the TMS320 
Simulator, TMS320 Emulator, or the TMS320 processor. The TMS320 Macro 
Assembler/Linker is currently available for the VAX/VMS, TI PC/MS-DOS, and IBM 
PC/PC-DOS operating systems for both generations of DSP processors. 


SIMULATOR®,9: The TMS320 Simulator is a software program that simulates 
operations of the TMS320 digital signal processor to allow program verification. The 
debug mode enables the user to monitor the state of the simulated TMS320 while the 
program is executing. The simulator uses the TMS320 object code produced by the 
TMS320 Macro Assembler/Linker. During program execution, the internal registers 
and memory of the simulated TMS320 are modified as each instruction is interpreted 
by the host computer. Once program execution is suspended, the internal registers and 
both program and data memories can be inspected and/or modified. The TMS320 
Simulator is currently available for the VAX/VMS, TI PC/MS-DOS, and IBM PC/PC- 
DOS operating systems for both generations of DSP devices. 


Hardware Tools 


Powerful TMS320 evaluation and emulation tools provide in-circuit emulation and 
hardware program debugging (such as hardware breakpoint/trace) for developing and 
testing DSP algorithms in a real product environment. The following paragraphs provide 
a brief description of these tools. 


EVALUATION MODULE (EVM)!1: The EVM allows the designer to determine at 
minimal cost if the TMS320 meets the timing requirements of an application. It is a 
standalone single-board module that contains all of the tools necessary to evaluate the 
device as well as provide in-circuit emulation. The powerful firmware package on the 
EVM contains a debug monitor, editor, assembler, reverse assembler, EPROM 
programmer, software communication to two EIA ports, and an audio cassette interface. 
Communication to a host computer and several peripherals is provided on the TMS320 
EVM. Dual EIA ports allow the EVM to be connected to a terminal and either a host 
computer or a line printer. The EVM accepts either source or object code downloaded 
from the host computer. The resident assembler converts incoming source text into 
executable code in just one pass by automatically resolving labels after the first assembly 
pass is completed. When a session is finished, code is saved via the host computer, 
audio cassette recorder, or EPROM programmer. The EVM is currently available for 
the evaluation of the TMS32010. 


EMULATOR (XDS)!2: The TMS320 XDS Emulator is a powerful, sophisticated 


development tool providing full-speed in-circuit emulation with real-time hardware 
breakpoint/trace and program execution capability from target memory. The XDS allows 
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integration of the hardware and software modules in the debug mode. By setting 
breakpoints based on internal conditions or external events, execution of the program 
can be suspended and control given to the debug mode. In the debug mode, all registers 
and memory locations can be inspected and modified. Single-step execution is available. 
Full-trace capabilities at full speed and a reverse assembler that translates machine code 
back into assembly instructions also increase debugging productivity. The XDS system 
is designed to interface with either a terminal or a host computer. The object code 
generated by the assembler/linker can be downloaded to the XDS and then controlled 
through a terminal. The XDS is available for both the TMS32010 and TMS32020. 


ANALOG INTERFACE BOARD (AIB)13: The AJB is an analog-to-digital and digital- 

to-analog conversion board that can be used in conjunction with the EVM or XDS. 
It can also be used in an educational environment to aid in familiarizing the student 
with digital signal processing techniques. Analog-to-digital and digital-to-analog 
converters with 12-bit resolution and anti-aliasing and smoothing filters with cutoff 
frequency programmable from 4.7 kHz to 20 kHz are included on-board. The AJB is 
designed to adapt to either the TMS32010 or the TMS32020. 


A summary of these tools and their TI part numbers is shown in Tables 2 and 3 for 
the TMS32010 and TMS32020, respectively. 


Table 2. TMS32010 Family Hardware and Soztware Support Tools 


HOST COMPUTER OPERATING SYSTEM PART NUMBER 


MACRO ASSEMBLERS/LINKERS 
DEC VAX VMS TMDS32402 10-08 


TI/IBM PC MS/PC-DOS TMDS32408 10-02 


SIMULATORS 


DEC VAX VMS TMDS324021 1-08 
TI/IBM PC MS/PC-DOS TMDS324081 1-02 


HARDWARE TOOL PART NUMBER 


Evaluation Module (EVM) TMDS3268811-03 
Analog Interface Board (AIB) TMDS3268812-06 
Emulator (XDS/22) __TMDS3262281 


Table 3. TMS32020 Family Hardware and Software Support Tools 


HOST COMPUTER OPERATING SYSTEM PART NUMBER 


MACRO ASSEMBLERS/LINKERS 


DEC VAX VMS TMDS3241210-08 
TI/IBM PC MS/PC-DOS TMDS3241810-02 


SIMULATORS 


DEC VAX VMS TMDS3241211-08 
TI/IBM PC MS/PC-DOS TMDS3241811-02 


HARDWARE TOOL PART NUMBER 


Analog Interface Board (AIB) Adaptor RTC/ADP320A-06 
Emulator (XDS/11) TMDS3261122 
Emulator (XDS/22) TMDS3262291 


Digital Filter Design Package 


In addition to the above design tools, Texas Instruments has also made available a Digital 
Filter Design Package (DFDP)>, developed by Atlanta Signal Processors Inc. (ASPI). 
The package runs on both TI and IBM PCs. The DFDP allows the user to design a 
digital filter (lowpass, highpass, bandpass, and bandstop types) using a menu-driven 
approach. A filter can be designed by simply specifying its characteristics. After the 
filter design, the DFDP can automatically generate the TMS320 code, which later can 
be integrated into a designer’s application. The DFDP can be ordered through TI or 
ASPI using TI part number DFDP-IBMO01 for the IBM PC version and DFDP-TIOO1 
for the TI PC. 


RTC and Third-Party Support 


The TI Regional Technology Centers (RTC) are staffed with qualified engineers to provide 
technical support to the TMS320 customers. TMS320 hands-on workshops are also 
offered by professional instructors in the RTC to give engineers a quick start designing 
with the TMS320 digital signal processors. Hands-on exercises are included in the 
workshop to familiarize students with various hardware and software design tools. Many 
design examples are given throughout the course, which allows students to learn and 
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practice essential TMS320/DSP design skills. For further information, please contact 
the nearest RTC through the following phone numbers: 


GERMANY HOLLAND 
8050 Freising - Tel. : 08161/80-4002 1100 AZ Amsterdam-Zuidoost - Tel. : (020) 5602911 
3000 Hannover 51 - Tel. : 0511/64 80 21 
4300 Essen 1 - Tel. : 0201/24 25-0 ITALY | _ 
7302 Ostfildern 2/Nellingen - Stuttgart 02015 Cittaducale (Rieti) - Tel. : (0746) 6941 

Tel. : 0711/34 03-0 SWEDEN 

100 54 Stockholm - Tel. : 

PRANGE | ockholm - Tel. : (08) 235480 
78141 Vélizy-Villacoublay Cedex - Tél. (1) 39 469712 UNITED KINGDOM 
06270 Villeneuve-Loubet - Tel. : 93 20 O1 01 Bedford, MK41 7PA - Tel. : (0234) 67466 


TI also provides a team of third parties5 with DSP expertise to support customer design. 
An extensive list of their names, phone numbers, and areas of support is given as follows: 


COMPANY PHONE NUMBER TMS320 SUPPORT 
Alembic Computer (213) 306-2865 IBM PC Speech Development 
System Inc. 

Allen Ashley (818) 793-5748 MS-DOS/PC-DOS, CP/M-80 
Assemblers 
American (714) 731-1661 EZ-PRO In-Circuit Emulator 
Automation 
Atlanta Signal (404) 892-7265 Digital Filter Design Package, 
Processors Inc. Algorithm Development 
(ASP) Package 
Bedford Research (617) 275-7246 PDP and VAX Interactive 
Signal Processing Software 
Burr Brown (602) 746-1111 VMEBUS Board 
Computalker (213) 828-6546 IBM PC, CP/M, Apple II 
Assemblers, Simulators 
Digital Signal (613) 825-5476 PDP-11, VAX, TI/IBM PC 
Processing Software Inc. 
Digital Sound Corp. (805) 569-0700 Audio Data Conversion 
System for Speech 
Development 
DSP Technology (214) 247-8831 Four-Channel DTMF 
Detectors 
Gas Light Software (713) 729-1257 PC Digital Filter Software 
Hewlett-Packard Contact the Local HP Logic Development 
HP Sales Office System Assembler/Linker 
Kontron Electronics (800) 227-8834 Logic Analyzer, Disassembler 
Lawrence Livermore 
Lab. Livermore, CA Signal Processing Software 


COMPANY 


Microstuf, Inc. 


Pacific Microcircuits 


Pratika SRL 


PH Associates 


Signal Technology, Inc. 


Signix Corporation 


SKY Computers, Inc. 


Televic 


Texas Instruments Inc. 


Thorn EMI Elec. 
TIAC Corp. 


Voice Control Systems 
Inc. 


Votan 


Whitman Engineering 


PHONE NUMBER 


(404) 952-0267 


(604) 536-1886 


Torino, Italy 


(703) 281-5762 


(800) 235-5787 


(617) 358-5955 


(617) 454-6200 


Belgium 


(512) 250-7474 


Australia 


(604) 461-0120 


(214) 248-8244 


(415) 490-7600 


(305) 628-4516 


TMS320 SUPPOKf 


TI/IBM PC Data 
Communications Software 
(CROSSTALK XVI) 


TMS32010 Peripheral 
Interface Chip 


Olivetti PC Assembler, Linker 
Loader 


IBM PC, CP/M, PDP-11, 
UNIX, TEKTRONIX 
Assemblers 


Speech/DSP Software (ILS) 


IBM PC Digital Filter 
Design/Evaluation 


PDP-11 Q-Bus/IBM PC-Bus 
Real-Time Processor Boards, 
Assembler, C Compiler 


SPL Compiler for VAX and 
PDP-11 


TI/IBM PC Speech Cards 
for Speech Vocoding 

and Recognition 

S/W Support for Eclipse 
DSP Hardware for IBM PC 
Speech Recognition 

IBM PC Speech -Cards 
Speech Chip Sets 


DSP Development System 


DSP Text/Reference Books 


A series of DSP books have been or are being written by experts in the field to help 
both university students and practicing engineers. These books feature DSP theory, 
algorithms, applications, and TMS320 implementations, and are available from 
publishers, John Wiley & Sons and Prentice-Hall. The first book in the series, DFT/FFT 
and Convolution Algorithms, is now available from John Wiley & Sons. Since these 
books cover both the theoretical and practical aspects of DSP, they can be used by 
universities as textbooks or DSP engineers as reference guides. The titles, authors, 
publishers, and availability of these books are provided as follows: 
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TITLE AUTHOR 


DFT/FFT and C. S. Burrus and 
Convolution Algorithms TT. W. Parks 
Digital Filter Design | T. W. Parks and 
C. S. Burrus 
Adaptive Filter Design R. Johnson, Jr. 
et al 
Digital Control -W. Kohn 


Practical Approaches to P. Papamichalis 
Speech Coding 


For ordering information from publishers, please write or call: 


John Wiley & Sons, Inc. 
605 Third Avenue 

New York, NY 10158 
1-800-526-5368 


TMS320 Documentation Support 


PUBLISHER 


John Wiley 
& Sons 


John Wiley 
& Sons 


John Wiley 
& Sons 


John Wiley 
& Sons 


Prentice- 
Hall 


Prentice-Hall,Inc. 
Route 9W 

Englewood Cliffs, NJ 07632 
201-767-9520 


AVAILABILITY. 


now 


1986 


1986 


1986 


1986 


Extensive publications are produced by Texas Instruments to support the TMS320 digital 
signal processing family. A list of these publications is given below. 


1. TMS32010 User’s Guide (SPRUOOIB). 


2. TMS320C10 Data Sheet (SPRS006). 


3. TMS32011 User’s Guide (SPRUO10). 
4. TMS32020 User’s Guide (SPRUOO4A). 


5. TMS320C25 User’s Guide (SPRUO12). 


6. TMS320 Development Support Reference Guide (SPRU007A). 


7. TMS32010 Assembly Language Programmer’s Guide (SPRU002B). 


8. Link Editor User’s Guide (SPDU037C). 


9. TMS32010 Crossware Installation Guide (SPDU049). 


10. TMS320 Family Simulator User’s Guide (SPRU009). 


11. TMS32010 Evaluation Module User’s Guide (SPRUOOSA). 


12. XDS/22 TMS32010 Emulator User’s Guide (SPDUO15). 


13. TMS32010 Analog Interface Board User’s Guide (SPRU006). 
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Some common digital signal processing routines and interface circuits are frequently 
used in DSP applications. For example, the same structure of a digital filter used for 
audio signal processing may also be used for a modem in data communications. A Fast 
Fourier Transform (FFT) routine can be used for analyzing signals both in 
instrumentations and in speech coding. Another example is the interface of a digital 
signal processor to external memory devices in a standalone system or in a multiprocessing 
environment. A collection of application reports containing these routines and interfaces 
is included in Part II. Theory, block diagrams, algorithms, and TMS320 implementations 
are provided in these reports. The DSP Software Library includes code for the major 
DSP routines and applications. 


Part II begins with a series of reports containing DSP routines. The first report discusses 
the implementation of Finite Impulse Response (FIR)/Infinite Impulse Response (IIR) 
filters using the TMS32010 and TMS32020. Filters designed with digital processors, 
such as the TMS320, are superior over their analog counterparts for better specifications, 
stability, performance, and reproducability. The report describes a variety of methods 
for implementing FIR/IIR filters using the TMS320. The TMS320 algorithm execution 
time and data memory requirements are considered. Tradeoffs between several different 
filter structures are also discussed. This application report complements the Digital Filter 
Design Package (DFDP) discussed in Section 2. 


Fast Fourier Transforms (FFT), containing a class of computationally efficient algorithms 
implementing the Discrete Fourier Transforms (DFT), are widely used in DSP 
applications. In the report on FFT, the development of the FFT from the continuous 
Fourier Transform and DFT is first presented. Issues regarding the implementation of 
the FFT with the TMS32020 processor are then discussed, such as scaling, special FFT 
structures, and system memory and I/O considerations. The report also includes the 
TMS32020 code for 256-point and 1024-point FFT algorithms. The next report discusses 
companding routines. Companding is required for applications that use codec devices, 
such as in public and private telephone networks. With the speed and the versatility 
of the TMS320, companding can be performed in either software or hardware. The 
report describes both the A-law and y-law software companding methods. Programs 
are also provided to show how the software companding can be performed using the 
computational power of the TMS32010 and the TMS32020. An example of the hardware 
companding is presented in Section 14 of Part III. 


Although The TMS32010 and TMS32020 are fixed-point 16/32-bit digital signal 
processors, they can also perform floating-point computations at a speed comparable 
to dedicated floating-point processors. The next two reports present algorithms and code 
implementing floating-point addition, subtraction, multiplication, and division with the 
TMS320. The support of floating-point operations by the TI processors has made possible 
some applications, such as the implementation of the CCITT Adaptive Differential Pulse 
Code Modulation (ADPCM) algorithm and image/graphics operations. 


Sine-wave or waveform generations are used in instrumentations and communications. 
Both speed and accuracy are major concerns for these applications. The report in Section 
8 presents two methods of sine-wave generation. The first method is a fast direct table 
lookup scheme suitable for applications where speed is critical. The second approach, 
an enhancement of the first, includes linear interpolation to provide higher accurate 
waveforms. The last report in the DSP routines portion of Part II is on matrix 
multiplication with the TMS32010 and TMS32020. Matrix multiplication is useful in 
applications, such as graphics, numerical analysis, or high-speed control. Because of 
the high speed of the multiply/accumulate operations and fast data I/O, both processors 
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can multiply in microseconds large matrices with their sizes only limited by the internal 
data memory. Programs are included in the report to illustrate matrix multiplication 
on both processors. 


The second half of Part II encompasses various hardware interface techniques useful 
for integrating systems using the TMS320 digital signal processors. The first two reports 
describe interface circuits for the TMS32010 to asynchronous inputs and to external 
memory devices, such as external ROM or RAM. A description of a hardware peripheral 
interface device produced by Pacific Microcircuits is also included, which eases the 
TMS32010 interface to both external memory and codec devices. The report in Section 
12 suggests hardware design techniques for interfacing memory devices and peripherals 
to the TMS32020. Examples of PROM, EPROM, static RAM, and dynamic RAM 
circuits built around the TMS32020 are demonstrated, with the timing requirements 
given for the processor and external devices. Interfaces to a combo-codec and a host 
computer through UART are also presented. The last report of Part II shows a scheme 
where the TMS32020 can be used as a numeric coprocessor to a host processor for 
numeric-intensive applications, such as image/graphics processing. The host processor 
selected for the example is the MC68000. The interface and communication techniques 
presented in the report are generic and directly applicable to other host processors. 


Digital Signal Processing Routines 


3. Implementation of FIR/IIR Filters with the 
TMS32010/TMS32020 


Al Lovrich and Ray Simar, Jr. 
Digital Signal Processing - Semiconductor Group 
Texas Instruments 
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INTRODUCTION 


In many signal processing applications, it is advantageous 
to use digital filters in place of analog filters. Digital filters 
can meet tight specifications on magnitude and phase 
characteristics and eliminate voltage drift, temperature drift, 
and noise problems associated with analog filter components. 

This application report describes a variety of methods 
for implementing Finite Impulse Response (FIR) and Infinite 
Impulse Response (IIR) digital filters with the TMS320 
family of digital signal processors. Emphasis is on 
minimizing both the execution time and the number of data 
memory locations required. Tradeoffs between several 
different structures of the two classes of digital filters are 
also discussed. 

In this report, TMS320 source code examples are 
included for the implementation of two FIR filters and three 
IIR filters based on the techniques presented. Plots of 
magnitude response, log-magnitude response, unit-sample 
response, and other pertinent data accompany each of the 
filter implementations. Important performance considerations 
in digital filter design are also included. The methods 
presented for implementing the different types of filters can 
be readily extended to any desired order of filters. 

Readers are assumed to have some familiarity with the 
basic concepts of digital signal processing theory.! The 
notation used in this report is consistent with that used in 
reference [1]. 


FILTERING WITH THE TMS320 FAMILY 


Almost every field of science and engineering, such 
as acoustics, physics, telecommunications, data 
communications, control systems, and radar, deal with 
signals. In many applications, it is desirable that the 
frequency spectrum of a signal be modified, reshaped, or 
manipulated according to a desired specification. The process 
may include attenuating a range of frequency components 
and rejecting or isolating one specific frequency component. 

Any system or network that exhibits such frequency- 
selective characteristics is called a filter. Several types of 
filters can be identified: lowpass filter (LPF) that passes only 
‘‘low’’ frequencies, highpass filter (HPF) that passes “‘high’”’ 
frequencies, bandpass filter (BPF) that passes a ‘‘band’’ of 
frequencies, and band-reject filter that rejects certain 
frequencies. Filters are used in a variety of applications, such 
as removing noise from a signal, removing signal distortion 
due to the transmission channel, separating two or more 
distinct signals that were mixed in order to maximize 
communication channel utilization, demodulating signals, and 
converting discrete-time signals into continuous-time signals. 
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Advantages of Digital Filtering 

The term ‘‘digital filter’? refers to the computational 
process or algorithm by which a digital signal or sequence 
of numbers (acting as input) is transformed into a second 
sequence of numbers termed the output digital signal. Digital 
filters involve signals in the digital domain (discrete-time 
signals), whereas analog filters relate signals in the analog 
domain (continuous-time signals). Digital filters are used 
extensively in applications, such as digital image processing, 
pattern recognition, and spectrum analysis. A band-limited 
continuous-time signal can be converted to a discrete-time 
signal by means of sampling. After processing, the discrete- 
time signal can be converted back to a continuous-time 
signal.Some of the advantages of using digital filters over 
their analog counterparts are: 

1. High reliability 

2. High accuracy 

3. No effect of component drift on system 
performance 

4. Component tolerances not critical. 

Another important advantage of digital filters when 
implemented with a programmable processor such as the 
TMS320 is the ease of changing filter parameters to modify 
the filter characteristics. This feature allows the design 
engineer to effectively and easily upgrade or update the 
characteristics of the designed filter due to changes in the 
application environment. 


Design of Digital Filters 
The design of digital filters involves execution of the 
following steps: 
1. Approximation 
2. Realization 
3. Study of arithmetic errors 
4. Implementation. 

Approximation is the process of generating a transfer 
function that satisfies a set of desired specifications, which 
may involve the time-domain response, frequency-domain 
response, or some combination of both responses of the filter. 

Realization consists of the conversion of the desired 
transfer function into filter networks. Realization can be 
accomplished by using several network structures,2,3 as 
listed below. Some of these structures are covered in detail 
in this report. 

Direct 

Direct canonic (direct-form II) 
Cascade 

Parallel 

Wave4 

Ladder. 

Approximation and realization assume an infinite- 
precision device for implementation. However, 
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implementation is concerned with the actual hardware circuit 
or software coding of the filter using a programmable 
processor. Since practical devices are of finite precision, it 
is necessary to study the effects of arithmetic errors on the 
filter response. 


TMS320 Digital Signal Processors 

Digital Signal Processing (DSP) is concerned with the 
representation of signals (and the information they contain) 
by sequences of numbers and with the transformation or 
processing of such signal representations by numeric- 
computational procedures. In the past, digital filters were 
implemented in software using mini- or main-frame 
computers for non-realtime operation or on specialized 
dedicated digital hardware for realtime processing of signals. 

The recent advances in VLSI technology have resulted 
in the integration of these digital signal processing systems 
into small integrated circuits (ICs), such as the TMS320 
family of digital signal processors from Texas Instruments. 
The TMS320 implementation of digital filters allows the filter 
to operate on realtime signals. This method combines the 
_ ease and flexibility of the software implemention of filters 
with reliable digital hardware. To further ease the design 
task, it is now possible for engineers to design and test filters 
using any one of the commercially available filter design 
packages, some of which create TMS320 code and decrease 
the design time. 

The Texas Instruments TMS320 digital signal 
processing family contains two generations of digital signal 
processors. The TMS32010, the first-generation digital signal 
processor, implements in hardware many functions that 
other processors typically perform in software. Some of the 
key features of the TMS32010 are: 

e 200-ns instruction cycle 
e 1.5K words (3K bytes) program ROM 
e 144 words (288 bytes) data RAM 
e External memory expansion to 4K words 
(8K bytes) at full speed 
16 x 16-bit parallel multiplier 
Interrupt with context save 
Two parallel shifters 
On-chip clock 
Single 5-volt supply, NMOS technology, 
40-pin DIP. 

The TMS32020 is the second-generation processor® in 
the TMS320 DSP family. To maintain device compatibility, 
the TMS32020 architecture is based upon that of the 
TMS32010, the first member of the family, with emphasis 
on overall speed, communication, and flexibility in processor 
configuration. Some of the key features of the TMS32020 
are: 


e 544 words of on-chip data RAM, 256 
words of which may be programmed as 
either data or program memory 

¢ 128K words of data/program space 

e Single-cycle multiply/accumulate 
‘instructions 


TMS32010 software upward compatibility 
200-ns instruction cycle 
Sixteen input and sixteen output channels 
16-bit parallel] interface 
Directly accessible external data memory 
space 
¢ Global data memory interface for 
multiprocessing 
e Instruction set support for floating-point 
operations 
¢ Block moves for data/program memory 
e Serial port for multiprocessing or codec 
interface 
¢ On-chip clock 
e Single 5-volt supply, NMOS technology, 
68-pin grid array package. 

Because of their computational power, high I/O 
throughput, and realtime programming, the TMS320 
processors have been widely adapted in telecommunication, 
data communication, and computer applications. In addition 
to the above features, the TMS320 has efficient DSP-oriented 
instructions and complete hardware/software development 
tools, thus making the TMS320 highly suitable for DSP 
applications. 


DIGITAL FILTER IMPLEMENTATION ON THE 
TMS320 


For a large variety of applications, digital filters are 
usually based on the following relationship between the filter 
input sequence x(n) and the filter output sequence y(n): 


N M 
yin) = yy agy(n-k) + yr by x(n-k) (1) 
k=0 k=0 


Equation (1) is referred to as a linear constant- 
coefficient difference equation. Two classes of filters can be 
represented by linear constant-coefficient difference 
equations: 

1. Finite Impulse Response (FIR) filters, and 
2. Infinite Impulse Response (IIR) filters. 

The following sections describe the implementation of 

these classes of filters on the TMS32010 and TMS32020. 


FIR Filters 


For FIR filters, all of the a, in (1) are zero. Therefore, 
(1) reduces to 


M 
y(n)= yr bk x(n-b) (2) 
k=0 


where (M + 1) is the length of the filter. 

As a result, the output of the FIR filter is simply a finite- 
length weighted sum of the present and previous inputs to 
the filter. If the unit-sample response of the filter is denoted 
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as h(n), then from (2), it is seen that h(n) = b(n). Therefore, 
(2) is sometimes written as 


M 
y(n) = > h(k)x(n —k) 
k=0 


(3) 


From (3), it can be seen that an FIR filter has, as the 
name implies, a finite-length response to a unit sample. 
Denoting the z transforms of x(n), y(n), and h(n) as X(z), 
Y(z), and H(z), respectively, then 


X(z) 


M 
> h(k)z —k 
k=0 


M 
= yp bol (4) 


k=0 


H(z) 


Equations (3) and (4) may also be represented by the 
network structure shown in Figure 1. This structure is 
referred to as a direct-form realization of an FIR filter, 
because the filter coefficients can be identified directly from 
the difference equation (3). The branches labeled with z— 
in Figure 1 correspond to the delays in (3) and the 
multiplications by z—! in (4). Equation (3) may be 
implemented in a straightforward and efficient manner on 
a TMS320 processor. 


TMS32010 Implementation of FIR Filters 

Figure 2 gives an example of a length-5 direct-form 
FIR filter, and Figure 3 shows a portion of the TMS32010 
code for implementing this filter. , 

The notation developed in this section will be used 
throughout this application report. XN corresponds to 
x(n), XNM1 corresponds to x(n-— 1), etc. 


In the above implementation, the following three basic 
and important concepts for the implementation of FIR filters 
on the TMS320 should be understood: 

1. The relationship between the unit-sample 
response of an FIR filter and the filter 
structure, 

The power of the LTD and MPY 
instruction pair for this implementation, and 

3. The ordering of the input samples in the 

data memory of the TMS320, which is 
critical for realtime signal processing. 

The input sequence x(n) is stored as shown in Figure 
4. In general, each of the multiplies and shifts of x(n) in (3) 
is implemented with an instruction pair of the form 


LTD XNMI1 
MPY Hl 


The instruction LTD XNM1 loads the T register with 
the contents of address XNM1, adds the result of the previous 
multiply to the accumulator, and shifts the data at address 
XNM1 to the next higher address in data memory. Using 
the storage scheme in Figure 4, this corresponds to shifting 
the data at address XNM1 to address XNM2. The instruction 
MPY Hi multiplies the contents of the T register with the 
contents of address H1. The shifting is the reason for the 
storage scheme used in Figure 4. This scheme, critical for 
realtime digital signal processing, makes certain that the input 
sequence x(n) is in the correct location for the next pass 
through the filter. 

By comparing (3) with the code in Figure 3, the reason 
for the ordering of the data and the importance of the shift 
implemented by the LTD instruction can be seen. To better 


x(n) 
h(2) h(M — 1) 
yin) 
Figure1. Direct-Form FIR Filter 
z 1 y 1 2 1 Zz” 1 
x(n) 
h(4) 
yin) 


Figure 2. Length-5 Direct-Form FIR Filter 
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* THIS SECTION OF CODE IMPLEMENTS THE FOLLOWING EQUATION: = 


* x(n-4)h(4) + x(n-3)h(3) 
* 


NXTPT 
* 


* 


+ x(n-2)h(2) + x(n-1)h(1) + x(n)h(0) = y(n) * 


IN XN,PA2 * GET THE NEW INPUT VALUE XN FROM PORT PAO * 

ZAC * ZERO THE ACCUMULATOR * 

LT XNM4 * x(n-4)h(4) * 

MPY H4 

LTD XNM3 * x(n-4)h(4) + x(n-3)h(3) * 

MPY H3 

LTD XNM2 * SIMILAR TO THE PREVIOUS STEPS * 

MPY H2 

LTD XNM1 

MPY Hl 

LTD XN 

MPY HO 

APAC * ADD THE RESULT OF THE LAST MULTIPLY TO * 
* THE ACCUMULATOR * 

SACH YN,1 * STORE THE RESULT IN YN * 

OUT YN,PA2 * OUTPUT THE RESPONSE TO PORT PAl * 

B NXTPT * GO GET THE NEXT POINT * 


Figure 3. TMS32010 Code for Implementing a Length-5 FIR Filter 


LOW DATA 
ADDRESS 


oars | xin 2) | INPUT SAMPLES 


HIGH DATA 
ADDRESS 


Figure 4. TMS32010 Input Sample Storage for a 


Length-N FIR Filter 


understand the algorithm, the relationship between the input 
and output of the filter must be considered. Evaluating (3) 
for a particular value of n, for example, no, yields 
N-1 
y(nop) = = hk) x(ng —k) (5) 
k=0 
If the next sample of the filter response y(ng + 1) is 


needed, it is seen from (3) that 


N-1 
ying+1) = yy Atk) x(ng +1 ~k) (6) 


k=0 
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Equations (5) and (6) show that the samples of x(n) 
associated with particular values of h(k) in (5) have been 
shifted to the left (i.e., to a higher data address) by one in 
(6). This shifting of the input data, illustrated in Figure 5, 
corresponds to the shifting of the flipped input sequence in 
relation to the unit-sample response. 

Depending on the system constraints, the designer may 
choose to reduce program memory size by taking advantage 
of indirect addressing capability provided by the TMS32010. 
Using either of the auxiliary registers along with the 
autoincrement or autodecrement feature, the FIR filter 
program can be rewritten in looped form as shown in Figure 
6. 

The input sequence x(n) is stored as shown in Figure 
4, and the impulse response h(n) is stored as shown in Figure 
7. In the looped version, the indirect addressing mode is used 
with the autodecrement feature and BANZ instruction to 
contro] the looping and address generation for data access. 
While the looped code requires less program memory than 
the straightline version, the straightline version runs more 
quickly than the looped code because of the overhead 
associated with loop control. This design tradeoff should be 
carefully considered by the design engineer. 


Relationship between h(n) and 
x(n) for y(ng) 


It is also possible to use the LTD/MPYK instruction 
pair to implement each filter tap in straightline code. The 
MPYK instruction is used to multiply the contents of the T 
register by a signed 13-bit constant stored in the MPYK 
instruction word. For many applications, a 13-bit coefficient 
can adequately implement the filter without significant 
changes to the filter response. An advantage of using this 
approach is that the coefficients are stored in program 
memory and there is no need to transfer them to data 
memory. This reduces the amount of data memory locations 
required per filter tap from two to one. 

The length-80 FIR filter program in Appendix A 
implements a linear-phase FIR filter in straightline code. The 
unit-sample response of the filter is symmetric in order to 
achieve linear phase. Because of the symmetry, it is necessary 
to store only 40 (rather than 80) of the samples of the impulse 
response. This symmetry can often be used to a designer’s 
advantage since it significantly reduces the amount of storage 
space required to implement the filter. 

In summary, by taking advantage of the TMS32010 
features, a designer can implement a direct-form FIR filter, 
optimized for execution time, data memory, or program 
memory. 


Relationship between h(n) and 
x(n) for y(ng + 1) 


LOW DATA ADDRESS 


HIGH DATA ADDRESS 


Figure 5. Relationship Between the Contents of Data Registers 
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* THIS SECTION OF CODE IMPLEMENTS THE EQUTION: 


* x(n-(N-1))h(N-1) + x(n-(N-2))h(N-2) + ... + x(n)h(0) 


* 


y(n) * 


* AUXILIARY REGISTER POINTER SET TO ARO * 


PULL IN NEW INPUT FROM PORT PAO * 


x (n-(N-1))h(N-1) +x (n-(N-2) )h(N-2)+...+x(n)h(0)=y(n)* 
IF ARO DOES NOT EQUAL ZERO, * 
THEN DECREMENT ARO AND BRANCH TO LOOP * 


ADD THE P REGISTER TO THE ACCUMULATOR * 


OUTPUT THE RESPONSE TO PORT PAI] * 


* 
LARP ARO 
* 
NXTPT IN XN,PA2 * 
* 
LARK ARO,XNMNM] * ARO POINTS TO X(n-(N-1)) * 
LARK AR1,HNM1 * AR1 POINTS TO H(N-1) * 
* 
ZAC * ZERO THE ACCUMULATOR * 
* 
LT *~,AR1 * x(n-(N-1))h(N-1) * 
MPY *-,ARO 
* 
LOOP LTD *,AR1 * 
MPY *-,ARO 
* 
BANZ LOOP * 
* * 
* 
APAC * 
* 
SACH YN,1 * STORE THE RESULT IN YN * 
* 
OUT YN,PA2 * 
*& 
B NXTPT * 


GO GET THE NEXT INPUT POINT * 


Figure 6. TMS32010 Code for Implementing a Looped FIR Filter 
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ADDRESS 


UNIT-SAMPLE 
: RESPONSE 
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Figure 7. TMS32010 Unit-Sample Response 
Storage for a Looped FIR Filter 


TMS32020 Implementation of FIR Filters 

In many DSP applications, realtime processing of 
signals is very critical. Important choices must be made in 
selecting a DSP device capable of realtime filtering, For 
example, in a speech application, a sampling rate of 8 kHz 
is common, which corresponds to an interval of 125 ps 
between consecutive samples. This interval is the maximum 


allowable time for realtime operation, corresponding to 625 
cycles on the TMS32010. In order to perform the required 
signal processing tasks in that interval, it is essential to reduce 
filter execution time. This can be accomplished by a single- 
cycle multiply/accumulate instruction. The TMS32020, the 
second-generation DSP device, is a processor with such a 
capability. A single-cycle multiply/accumulate with data- 
move instruction and larger on-chip RAM make it possible 
to implement each filter tap in approximately 200 ns. 
The TMS32020 provides a total of 544 16-bit words 
of on-chip RAM, divided into three separate blocks of BO, 
B1, and B2. Of the 544 words, 288 words (blocks B1 and 
B2) are always data memory, and 256 words (block BO) are 
programmable as either data or program memory. The 
CNFD (configure block BO as data memory) and CNFP 
(configure block BO as program memory) instructions allow 
dynamic configuration of the memory maps_ through 
software, as illustrated in Figure 8. After execution of the 
CNFP instruction, block BO is mapped into program 
memory, beginning with address 65280. To take advantage 
of the MACD (multiply and accumulate with data move) 
instruction, block BO must be configured as program memory 
using the CNFP instruction. MACD only works with on- 
chip RAM. The use of the MACD instruction helps to speed 


34 3. Implementation of FIR/IIR Filters with the TMS32010/TMS32020 


PROGRAM 


65,535(>FFFF) 


DATA 


0(>0000) 0(>0000) 
INTERRUPTS ON-CHIP 
AND RESERVED MEMORY-MAPPED 
aiisootk (EXTERNAL) ee REGISTERS 
32(>0020) 6(>0006) PAGE 0 
RESERVED 
95(>005F) 
127(>007F) BhOCt Es 
PAGES 1-3 
511(>01FF) 
EXTERNAL 512(>0200) ON-CHIP 
BLOCK BO PAGES 4-5 
767(>O2FF) 
BLOCK BY PAGES 6-7 
1023(>03FF) 
1024(>0400) 


65,535(>FFFF) 


(a) ADDRESS MAPS AFTER A CNFD INSTRUCTION 


PROGRAM 
0(>0000) 
INTERRUPTS 
AND RESERVED 
(EXTERNAL) 
31(>001F) 
32(>0020) 


95(>005F) 
96(> 0060) 


DATA 


0(>0000) 


ON-CHIP 
MEMORY-MAPPED 
REGISTERS 


5(>0005) 
6(>0006) PAGE 0 


RESERVED 


ON-CHIP 
BLOCK B2 


127(>007F) 
EXTERNAL 128(>0080) } esenve 
RESERVED PAGES 1-3 
511(>01FF) 
512(>0200) 


767(>02FF) 


PAGES 4-5 


DOES NOT 
EXIST 


768(>0300) 

PAGES 6-7 
65,279(>FEFF) 1023( > O03FF) 
65,280(>FFOO) 1024(>0400) 


ON-CHIP 
BLOCK BO 


65,535(>FFFF) 


65,535( > FFFF) 


PAGES 8-511 


ON-CHIP 
BLOCK B1 
EXTERNAL } 


(b) ADDRESS MAPS AFTER A CNFP INSTRUCTION 


Figure 8. TMS32020 Memory Maps 


the filter execution and allows the size of the FIR filter to 
expand to 256 taps.® 

The TMS32020 implementation of (3) is made even 
more efficient with a repeat instruction, RPTK. It forms a 


useful instruction pair with MACD, such as 
RPTK NM1 
MACD (PMA),(DMA) 
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The RPTK NM1 instruction loads an immediate 8-bit 
value N-1 into the repeat counter. This causes the next 
instruction to be executed N times (N = the length of the 
filter). The instruction MACD (PMA),(DMA) performs the 
following functions: 

1. Loads the program counter with PMA, 
2. Multiplies the value in data memory 
location DMA (on-chip, block B1) by the 
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value in program memory location PMA 
(on-chip, block BO), 

3. Adds the previous product to the 
accumulator, 

4. Copies the data memory value (block BO) 
to the next higher on-chip RAM location. 
The data move is the mechanism by which 
the z~! delay can be implemented, and 

5. Increments the program counter with each 
multiply/accumulate to point to the next 
sample of the unit-sample response. 

In other words, the MACD instruction combines the 
LTD/MPY instruction pair into one. With the proper storage 
of the input samples and the filter unit-sample response, one 
can take advantage of the power of the MACD instruction. 
Figure 9 is a data storage scheme that provides the correct 
sequence of inputs for the next pass through the filter. 

In the TMS32020 code example of Figure 10, data 
memory values are accessed indirectly through auxiliary 
register 1 (AR1) when the MACD instruction is 
implemented. For low-order filters (second-order), using the 
MACD instruction in conjunction with the RPTK instruction 
is less effective due to the overhead associated with the 
MACD instruction in setting up the repeat construct. To take 
advantage of the MACD instruction, the filter order must 
be greater than three. For lower-order filters, it is 
recommended to use the LTD/MPY instruction pair in place 
of RPT/MACD. 

Writing looped code for the TMS32020 implementation 
of an FIR filter gives no further advantage. Since the MACD 
instruction already uses less program memory, looped code 
in this case does not reduce program memory size. 
Implementing FIR filters of length-3 or higher requires the 
same amount of program memory (excluding coefficient 


BLOCK BO 


LOW ADDRESS 


>FFOO 

(65280) 
e 
° 
e 


HIGH ADDRESS 


UNIT-SAMPLE RESPONSE 
STORAGE FOR 
LENGTH-N FIR FILTER 


storage). For example, an FJR filter of length-256 takes the 
same amount of program memory space as a FIR filter of 
length-4. 

Since the ‘TMS32020 instruction set is upward- 
compatible with the TMS32010 instruction set, it is possible 
to use the LTD/MPYK instruction pair to implement the 
filter. With the TMS32020, the designer can use either 
RPTK/MACD or LTD/MPY(K) where appropriate. 
Depending on the application and the data memory 
constraints, the use of the LTD/MPYK instruction pair 
results in less data memory usage at the cost of increasing 
the program memory storage. 

The FIR filter program of Appendix A is an 
implementation of the same length-80 FIR filter used in the 
TMS32010 example. In this implementation, it can be seen 
that the TMS32020 uses less program memory than the 
TMS32010 with the tradeoff of using more data memory 
words. The increase in data memory size is indirectly related 
to the MACD instruction; i.e., in order to take full advantage 
of the instruction, it is necessary to keep the multiplier 
pipeline as busy as possible. Therefore, the filter will execute 
faster when all 80 coefficients are provided in block BO. 

The TMS32020 provides a solution for the faster 
execution of FIR filters. The combination of the 
RPTK/MACD instructions provides for a minimum program 
memory and high-speed execution of an FIR filter. If data 
memory is a concern, the designer can use the LTD/MPYK 
instruction pair at the cost of increasing program memory 
and using 13-bit filter coefficients. 


IIR Filters 

The concepts introduced for the implementation of FIR 
filters can be extended to the implementation of IIR filters. 
However, for an IIR filter, at least one of the a, in (1) is 


BLOCK B1 


LOW ADDRESS 


e 
e 
e 
>3FF 
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Figure 9. TMS32020 Memory Storage Scheme 
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* THIS SECTION OF CODE IMPLEMENTS THE EQUATION: 


* x(n-(N-1))h(N-1) + x(n-(N-2))h(N-2) + 2... + x(n)h(0) = y(n) 
* 
CNFP * USE BLOCK BO AS PROGRAM AREA 
* 
NXTPT IN XN, PAO * BRING IN THE NEW SAMPLE XN 
* 
LRLK AR1,>3FF * POINT TO THE BOTTOM OF BLOCK Bl 
LARP AR1 
Py 
MPYK 0 * SET P REGISTER TO ZERO 
ZAC * CLEAR THE ACCUMULATOR 
* 
RPTK NM1 * REPEAT N-l TIMES 
MACD >FFO00,*- * MULTIPLY/ACCUMULATE 
* 
APAC | 
SACH YN, 1 
* 
OUT YN,PA1 * QUTPUT THE FILTER RESPONSE y(n) 
* 
B NXTPNT * GET THE NEXT POINT 


Figure 10. TMS32020 Code for Implementing a Length-5 FIR Filter 


nonzero. It has been shown! that the z transform of the unit- 
sample response of an IIR filter corresponding to (1) is 


M 
> byz—k 
k=0 
H(z) = a 2 are (7) 
X(z) N 
i- ¥ ayz—k 
k=1 


where H(z), Y(z), and X(z) are the z transforms of h(n), y(n), 
and x(n), respectively. Three different network structures 
often used to implement (7) are the direct form, the cascade 
form, and the parallel form. Implementation of these 
structures is discussed in the following sections. 


Direct-Form IIR Filter 

Equations (1) and (7) may also be represented by the 
network structure shown in Figure 11. For convenience, it 
is assumed chat M = N. This network structure is referred 
to as the direct-form I realization of an Nth-order difference 
equation. As was the case for the direct-form FIR filter, the 
structure in Figure 11 is called direct-form since the 
coefficients of the network can be obtained directly from the 
difference equation describing the network. Again, the 
branches associated with the z~ ! correspond to the delays 
in (1) and the multiplications in (7). 

The following difference equation: 


N M 
y(n) = y 4k y(n—k) + > by x(n —-k) (8) 
k=1 k=0 


shows that the output of the filter is a weighted sum of past 
values of the input to the filter and of the output of the filter. 
Using techniques similar to those for an FIR filter, this 
realization can be implemented in a straightforward and 
efficient way on the TMS32010 and TMS32020. 

A network flowgraph equivalent to that in Figure 11 
is shown in Figure 12. This system is referred to as the direct- 
form II structure. Since the direct-form II has the minimum 
number of delays (branches labeled z~1), it requires the 
minimum number of storage registers for computation. This 
structure is advantageous for minimizing the amount of data 
memory used in the implementation of IIR filters. 

In Figures 13 through 17, a second-order direct-form 
II IIR filter is used as an example for the TMS320 
implementation of the IIR filter. The network structure is 
shown in Figure 13. 

The difference equation for this network is 


d(n) 
y(n) 


x(n) + ay d(n—1) + a2 d(n—2) (9) 
bo d(n) + by d(n—1) + bg d(n—2) 


In this case, d(n), shown in (9) and Figure 13, 
corresponds to the network value at the different delay nodes. 
The zero-delay register corresponds to d(n); d(n — 1) is the 
register for the delay of one; and d(n — 2) is the register for 
the delay of two. A portion of the TMS32010 code necessary 
to implement (9) is shown in Figure 14. Initially all d(n — i) 
for i=0,1,2 are set to zero, 

The delay-node values of the filter are stored in 
data memory as shown in Figure 15. At each major step of 
the algorithm, a multiply is done, and the result from the 
previous multiply is added to the accumulator. Also, the past 
delay-node values are shifted to the next higher location in 
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Figure 12. Direct-Form II IIR Filter 
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Figure 13. Second-Order Direct-Form I IIR Filter 
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Figure 11. Direct-Form I IIR Filter 
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data memory, thus placing them in the correct position for 
the next pass through the filter. All of these operations are 
carried out with instruction pairs, such as 


LTD DNM1 
MPY Bl 


where DNMI corresponds to d(n — 1) and B1 corresponds 
to by as in (9). 

When the last multiplication is performed and the result 
is added to the accumulator, the accumulator contains the 
result of (9), which is y(n). From (9) and Figure 13, it is 
evident that the delay-node value d(n) depends on several 
of the previous delay-node values. This feedback is illustrated 
by the instruction 


SACH DN,]1 
and the use of the statements 


LTD DNMI1 


LTD DN 


The ordering of the delay-node values, shown in Figure 
15, allows for a simple program structure with minimal 
computations and minimal data locations. It also 
accommodates the shifting of the delay-node values in a 
straightforward way. The feedback of DN makes apparent 
the underlying structure of the direct-form II filter and (10). 
This form of the algorithm is flexible and can be extended 
to higher-order direct-form filters in a straightforward way. 
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* THIS SECTION OF CODE IMPLEMENTS THE EQUATIONS: 


* d(n) = x(n) + d(n-l)a + d(n-2)a 
* - ] 2 


* 
* 
* 
* 


y(n) = d(n)b + d(n-1)b + d(n-2)b 
0 1 2 


IN XN,PAO * NEW INPUT VALUE XN * 


+ 


LAC XN,15 * LOAD ACCUMULATOR WITH XN * 


LT DNM1 
MPY Al 


LTA DNM2 
MPY A2 


APAC 


ZAC 
MPY B2 


LTD DNM1 
MPY Bl 


LTD DN 
MPY BO 


APAC 


SACH DN,1l * a(n) = x(n) + d(n-l)a + d(n-2)a * 
* 1 


2x 


SACH YN,1l * y(n) = d(n)b + d(n-1)b + d(n-2)b * 
ba 0 1 yas 


OUT YN,PA1 * YN IS THE OUTPUT OF THE FILTER * 


Figure 14. TMS32010 Code for Implementing a Second-Order Direct-Form II IIR Filter 
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Figure 15. Delay-Node Value Storage for a 
Second-Order Direct-Form IIR Filter 


- Figure 16 shows the necessary ordering of the delay-node 
values for a general direct-form II structure for the case 
M= N. Filter order is determined by M or N, whichever 
is greater. 
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Figure 16. Delay-Node Value Storage for a 
Direct-Form II IIR Filter 
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Figure 17 shows a portion of the TMS32020 code for 
implementing the same second-order direct-form II IIR filter 
using the MACD instruction. As discussed in the section on 
FIR filters, using the RPTK/MACD instruction pair is most 
effective when the filter order is three or higher. The use 
of the MACD instruction allows the designer to save one 
word of program memory over the LTD/MPY 
implementation. The TMS32020 code in Figure 17 is 
provided only as an example. For a biquad implementation 
(second-order direct-form II IIR filter), the TMS32010 code 
and TMS32020 code for the filter implementation are 
identical. Note that due to larger on-chip RAM of the 
TMS32020, higher-order IIR filters or sections of IIR filters 
can be implemented. For the rest of the IIR filter structures, 
the same discussion applies to both processors. 

An example of a TMS32010/TMS32020 program 
implementing a fourth-order direct-form II structure can be 
found in Appendix C. 


Cascade-Form IIR Filter 

In this section, the realization and implementation of 
cascade-form IIR filters are discussed. The implementation 
of a cascade-form IIR filter is an extension of the results of 
the implementation of the direct-form IIR filter. 

The z transform of the unit-sample response of an IIR 
filter 


byz —k 


(10) 
ayz —k 


SECTION OF CODE IMPLEMENTS A SECOND-ORDER DIRECT-FORM II IIR FILTER 


* NEW INPUT VALUE XN 


* CLEAR P REGISTER 


* USE BLOCK BO AS PROGRAM AREA 


* CLEAR P REGISTER 


* SAVE FILTERED OUTPUT 


* YN IS THE OUTPUT OF THE FILTER 


* THIS 
* d(n) = x(n) + d(n-l)a + d(n-2)a 
* 1 2 
* y(n) = d(n)b + d(n-1)b + d(n-2)b 
* 0 l 2 
* 
NEXT IN XN, PA2 
* 
LAC XN 
MPYK 0 
* 
LARP AR1 
LRLK AR1,>03FF 
CNFP 
* 
* da(n) = x(n) + d(n-l)a + d(n-2)a 
* 1 2 
* 
RPTK 1 * REPEAT 2 TIMES 
MACD »>FFOO,*+ 
* 
APAC 
SACH DN,1l * d(n) 
* 
* y(n) = d(n)b + d(n-1)b + d(n-10)b 
* 0 1 2 
* 
ZAC 
MPYK 0 
* 
MPY >FFO2 
* 
RPTK 1 
MACD >FFO3,*- 
* 
APAC 
SACH YN,l 
* 
OUT YN, PA2 
B NEXT 


Figure 17. TMS32020 Code for Implementing a Second-Order Direct-Form DR Filter with MACD 
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may also be written in the equivalent form 


N/2 Bok + Bixz~} + Boz? 
Hz) = [JT —————————_- (11) 
k=1 1 —oy,z- 1 — a2, -2 


where the filter is realized as a series of biquads. Therefore, 
this realization is referred to as the cascade form. Figure 18 
shows a fourth-order IIR filter implemented in cascade 
structure, where the subsections are implemented as direct- 
form II sections. Each subsection corresponds to one of the 
terms in the product in (11). Note that any single cascade 
section is identical to the second-order direct-form II IR filter 
described previously. 

The difference equation for cascade section i can be 
written as 


di(n) = yj-4(n) + oyj d(n—1) + a2 di(n-2) (12) 


yi(n) = Boj di(n) + By di(n—1) + 82; di(n—2) 


where 

i = 1,2,...,N/2. 

yi-1(n) = input to section i. 

d;(n) = value at a particular delay 
node in section 1. 

yi(n) = output of section i. 

yo(n) = x(n) = sample input to the filter. 

YN/2 = y(n) = output of the filter. 


For the IR filter consisting of the two cascaded sections 
shown in Figure 18, there are two sets of equations describing 
the relationship between the input and output of the filter. 
The delay-node values for each section are stored as shown 
in Figure 19. The same indexing scheme used previously 


Bo1 
: Pte 


SECTION 1 


LOW DATA 
ADDRESS 

SECTION 1 
safe 
HIGH DATA 
ADDRESS 


Figure 19. Delay-Node Storage for Cascaded 
IIR Filter Subsections 


is used here (i.e., from the higher address in data memory 
to the lower address in data memory). In this case, the 
algorithm can be structured so that the 32-bit accumulator 
of the TMS320 acts as a storage register and carries the 
output of one of the second-order subsections to the input 
of the next second-order subsection. This avoids unnecessary 
truncation of the intermediate filter values into 16-bit words, 
and therefore provides better accuracy in the final output. 

The implementation of the cascaded fourth-order IIR 
filter can be summarized as follows: 

1. Load the new input value x(n). 

2. Operate on the first section as outlined in 
Figure 12. 

3. Leave the output of the first section in the 
accumulator (i.e., the SACH YN can be 
omitted for the first-section implementation 
since the accumulator links the output of 
one section to the input of the following 
section). 

4. Operate on the second section in the same 
way as the first section, remembering that 


Bo2 
Pt : 


SECTION 2 


Figure 18. Fourth-Order Cascaded IIR Filter 
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the accumulator already contains the output 
of the previous section. 

5. The output of the second section is the 
filter output y(n). 

The above procedures can be applied to the IIR filter 
implementation of higher orders. It can be shown? that with 
proper ordering of the second-order cascades, the resulting 
filter has better immunity to quantization noise than the 
direct-form implementation, as will be discussed later. 

An example of a TMS32010/TMS32020 program that 
implements a fourth-order IIR cascaded structure is contained 
in Appendix C. 


Parallel-Form IIR Filter 
The third form of an IIR filter is referred to as the 
parallel form. In this case, H(z) is written as 


(13) 
M-N N/2 


Yok + Vikz ~ ! 
H(z) = Xs Cyz-k + 


k=1 l-oyz~!—an,z-2 
If M <N, then the term (Cxz~k) = 0. The network 

form is shown in Figure 20, where it is assumed that M = 

N = 4. The multiplication of the input by C (a constant) 


is trivial. However, for one of the parallel branches of this 
structure, the difference equation is 


dj(n) = x(n) + o4j di(n—1) + a2; dj(n—2) (14) 


pi(n) = Yoi dj(n) + Yj di(n—1) 


where i = 1,2,...,N/ 2, and pj(n) = the present output of 
a parallel branch. 

The similarity to the second-order direct-form II 
network and the single parallel section is apparent. However, 
in this case, the outputs of all sections are summed to give 
the output y(n), i.e., 


N/2 
y(n) = Cx(n) + YY pi) (15) 


i=1 


if M = N. For the parallel implementation, the delay-node 
values are also structured in data memory, as shown in Figure 
21, thus allowing for an implementation similar to that used 
previously. After the output of each section stored in the 
32-bit accumulator is determined, these outputs are summed 
to yield the filter output y(n). An example of a 
TMS32010/TMS32020 program to implement a parallel 
structure can be found in Appendix C. 


PERFORMANCE CONSIDERATIONS IN 
DIGITAL FILTER DESIGN 


In the previous sections, different realizations of the 
FIR and IIR digital filters were discussed. This section is 
mainly concerned with the effects of finite wordlength on 
filter performance. 

Some features of FIR and IIR filters, which distinguish 
them from each other and need special considerations when 
they are implemented, include phase characteristics, stability, 
and coefficient quantization effects. 


Figure 20. Parallel-Form IIR Filter 
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Figure 21. Delay-Node Value Storage 
for a Parallel IIR Filter 


Given a set of frequency-response characteristics, 
typically a higher-order FIR filter is required to match these 
characteristics to a corresponding IIR filter. However, this 
does not imply that IIR filters should be used in all cases. 
In some applications, it is important that the filter have linear 
phase, and only FIR filters can be designed to have linear 
phase. 

Another important consideration is the stability of the 
filter. Since the unit-sample response of an FIR filter is of 
finite length, FIR filters are inherently stable (i.e., a bounded 
input always produces a bounded output). This can be seen 
from (5) where the output of an FIR filter is a weighted finite 
sum of previous inputs. On the other hand, IIR filters may 
or may not be stable, depending on the locations of the poles 
of the filter. 

Digital filters are designed with the assumption that the 
filter will be implemented on an infinite precision device. 
However, since all processors are of finite precision, it is 
necessary to approximate the ‘‘ideal’’ filter coefficients. This 
approximation introduces coefficient quantization error. The 
net result due to imprecise coefficient representations is a 
deviation of the resultant filter frequency response from the 
ideal one. For narrowband IIR filters with poles close to the 
unit circle, longer wordlengths may be required. The worst 
effect of coefficient quantization is instability resulting from 
poles being moved outside the unit circle. 

The effect of coefficient quantization is highly 
dependent on the structure of the filter and the wordlength 
of the implementation hardware. Since the poles and zeroes 
for a filter implemented with finite wordlength arithmetic 
are not necessarily the same as the poles and zeroes of a filter 
implemented on an infinite precision device, the difference 
may affect the performance of the filter. 

In the IIR filter, the cascade and parallel forms 
implement each pair of complex-conjugate poles separately. 
As a result, the coefficient quantization effect for each pair 
of complex-conjugate poles is independent of the other pairs 
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of complex-conjugate poles. This is generally not true for 
direct-form filters. Therefore, the cascade and parallel forms 
of IIR filters are more commonly used than the direct form. 

Another problem in implementing a digital filter is the 
quantization error due to the finite wordlength effect in the 
hardware. Sources of error arising from the use of finite 
wordlength include the following: 

1. I/O signal quantization 

2. Filter coefficient quantization 

3. | Uncorrelated roundoff (or truncation) noise 
4. Correlated roundoff (or truncation) noise 
5. | Dynamic range constraints. 

These problems are addressed in the following 
paragraphs in more detail. 

Representing instantaneous values of a continuous-time 
signal in digital form introduces errors that are associated 
with I/O quantization. Input signals are subjected to A/D 
quantization noise while output signals are subjected to D/A 
quantization noise. Although output D/A noise is less 
detrimental, input A/D quantization noise is the more 
dominant factor in most systems. This is due to the fact that 
input noise ‘‘circulates’’ within IIR filters and can be 
‘‘regenerative’’ while output noise normally just 
“‘propagates’’ off-stage. 

The filter coefficients in all of the routines described 
in this report are initially stored in program memory, and 
then moved to data memory. These coefficients are 
represented in Q15 format; i.e., the binary point (represented 
in two’s-complement form) is assumed to follow the most- 
significant bit. This gives a coefficient range of 0.999969 
to -1.0 with increments of 0.000031. The input is also in 
Q15 format so that when two Q15 numbers are multiplied, 
the result is a number in Q30 format. When the Q30 number 
resides in the 32-bit accumulator of the TMS320, the binary 
point follows the second most-significant bit. Since the output 
of the filter is assumed to be in Q15 format, the Q30 number 
must be adjusted by left-shifting by one while maintaining 
the most-significant 16 bits of the result. This is accomplished 
with the step SACH YN, 1, which shifts the Q30 number to 
the left by one and stores the upper sixteen bits of the 
accumulator following the shift. The result YN is in Q1I5 
format. Note that it is important to keep intermediate values 
in the accumulator as long as possible to maintain the 32-bit 
accuracy. 

Uncorrelated roundoff (or truncation) noise may occur 
in multiplications. Even though the input to the digital filter 
is represented with finite wordlength, the result of processing 
leads to values requiring additional bits for their 
representation. For example, a b-bit data sample, multiplied 
by a b-bit coefficient, results in a product that is 2b bits long. 
In a recursive filter realization, 2b bits are required after the 
first iteration, 3b bits after the second iteration, and so on. 
The fact that multiplication results have to be truncated means 
that every ‘‘multiplier’’ in a digital structure can be regarded 
as a noise source. The combined effects of various noise 
sources degrade system performance. 
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Truncation or rounding off the products formed within 
the digital filter is referred to as correlated roundoff noise. 
The result of correlated roundoff (or truncation) noise, 
including overflow oscillations, is that filters suffer from 
“‘limit-cycle effect’? (small-amplitude oscillations). For 
systems with adequate coefficient wordlength and dynamic 
range, this problem is usually negligible. Overflows are 
generated by additions resulting in undesirable large- 
amplitude oscillations. Both limit cycles and overflow 
oscillations force the digital filter into nonlinear operations. 
Although limit cycles are difficult to eliminate, saturation 
arithmetic can be used to reduce overflow oscillations. The 
overflow mode of operation on the TMS320 family is 
accomplished with the SOVM (set overflow mode) 
instruction, which sets the accumulator to the largest 
representable 32-bit positive (> 7FFFFFFF hex) or negative 
(> 80000000 hex) number according to the direction of 
overflow. 

Dynamic range constraints, such as scaling of 
parameters, can be used to prevent overflows and underflows 
of the finite wordlength registers. The dynamic range is the 
ratio between the largest and smallest signals that can be 
represented in a filter. For an FIR filter, an overflow of the 
output results in an error in the output sample. If the input 
sample has a maximum magnitude of unity, then the 
worstcase output is 


yn)= DY hh) =s (16) 


To guarantee y(n) to be a fraction, either the filter gain 
or the input x(n) has to be scaled down by a factor ‘‘s’’. 
Reducing the filter gain implies scaling down the filter 
coefficients so that the 16-bit coefficient is no longer used 
effectively. An implication of this scaling is a degradation 
of the filter frequency response due to higher quantization 
errors. As an alternative, the input signal may be scaled, 
resulting in a reduction in signal-to-noise ratio (SNR). In 
practice, the second approach is preferred since the scaling 
factor is normally less than two and does not change the SNR 
drastically. The required scaling on a TMS32020 is achieved 
by using the SPM (set P register output shift mode) 
instruction to invoke a right-shift by six bits to implement 
up to 128 multiply/accumulates without overflow occurring. 

For an IIR filter, an overflow can cause an oscillation 
with full-scale amplitude, thus rendering the filter useless. 
In general, if the input signal x(n) is sinusoidal, the reciprocal 
of the gain ‘‘s’’ of the IIR filter is used to prevent output 
overflows. 

For the TMS320 implementation with its double- 
precision accumulator and P register, scaling down the input 
sequence by the scaling factor ‘‘s’’ while maintaining a 16-bit 
accuracy for the coefficients can accomplish the task. For 
this reason, use of the MPYK instruction for IIR filter 
implementation is not recommended. Scaling the input signal 
by a factor ‘‘s’’ results in a degradation in the overall system 
SNR. Therefore, for IIR filters, it is important to keep the 


coefficient quantization errors as small as possible since less 
accurate coefficients may cause an unstable filter if the poles 
are moved outside the unit circle. The LAC (load 
accumulator with shift) instruction on the TMS320 processors 
easily accomplishes input signal scaling. 

In the previous paragraphs, finite wordlength problems 
associated with digital filter implementation on 
programmable devices were discussed. The 16-bit 
coefficients and the 32-bit accumulator of the TMS320 
processor help minimize the quantization effects. Special 
instructions also help overcome problems in the accumulator. 
These features, in addition to a powerful instruction set, make 
the TMS32010 and TMS32020 ideal programmable 
processors for filtering applications. 


SOURCE CODE USING THE TMS320 


Examples of TMS320 source code for the 
implementation of two FIR filters and three IIR filters, based 
on the techniques described in this application report, are 
contained in the appendixes. Plots of the magnitude response, 
log-magnitude response, unit-sample response, and other 
pertinent data precede the filter programs. 

Five filter types are presented in the three appendixes 
as follows: 

Appendix A  Length-80 bandpass FIR filter 
(TMS32010 and TMS32020) 


Appendix B_ Length-60 FIR differentiator 
(TMS32010/TMS32020) 
Appendix C  Fourth-order lowpass IIR filters: 


direct-form, cascade, and parallel 
types (TMS32010/TMS32020) 

The purpose of the source code is to further illustrate 
the use of the TMS320 devices for filtering applications and 
to allow implemention and analysis of these filters. The code 
is based on the programming techniques discussed earlier 
in this report. 

TMS32020 source code is listed in the appendix for 
a length-80 FIR filter. The TMS32020 source code for the 
rest of the filter programs is identical to the TMS32010 code, 
as explained earlier. TMS32010 and TMS32020 instructions 
are compatible only at the mnemonic level. TMS32010 
source programs should be reassembled using a TMS32020 
assembler before execution. For more detail about code 
migration, refer to the TMS32020 User’s Guide appendix, 
‘*TMS32010/TMS32020 System Migration,’’ for detailed 
information .6 

These filters were designed using the Digital Filter 
Design Package (DFDP) developed by Atlanta Signal 
Processors Incorporated (ASPI).7 This package runs on 
either a Texas Instruments Professional Computer or an IBM 
Personal Computer and can generate TMS320 code for the 
filter designed. DFDP was used to design the FIR filters with 
the Remez exchange algorithm developed by Parks and 
McClellan, and to design the IIR filters by bilinear 
transformation of an elliptic analog prototype. All plots 
supplied with the filter programs were produced by DFDP. 

Filter design packages, such as DFDP, make the design 
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and implementation of digital filters straightforward. They 
allow the DSP engineer to quickly examine a variety of filters 
and understand the tradeoffs involved in. varying the 
characteristics of the filters. Several digital filter design 
packages and other useful software support from third parties 
are described in the TMS32010 Development Support 
Reference Guide.® | 

All of the TMS320 source code examples have several 
features in common that depend on the implementation and 
application. These features include the moving of filter 
coefficients from storage in program memory to data 
memory, their representation in Q15 format, and the 
instructions that control the analog interface used for testing. 

The hardware configuration that was used to test these 
filters included a Texas Instruments analog interface board 
(AIB) to provide an analog-to-digital and digital-to-analog 
interface. The sampling rate was 10 kHz in all cases. The 
filters were driven by a white-noise source, and the frequency 
response was estimated by a spectrum analyzer. Each filter 
routine contains several lines of code to initialize the analog 
interface board. The AIB signals the TMS320 that another 
input sample is available by pulling the BIO pin low. The 
TMS320 polls this pin using the BIOZ instruction. The AIB 
houses a TMS32010 device. In order to use the TMS32020 
with the AIB (PN: RTC/EVM320C-06), a specially designed 
adaptor (PN: RTC/ADP320A-06) must be inserted to convert 
TMS32020 signals to TMS32010 signals. All of these 
implementation- and application-dependent sections of code 
are labeled. 

Appendix A provides programs for the implementation 
of a length-80 linear-phase bandpass FIR filter on the 
TMS32010 and the TMS32020. The filter has been designed 
using the Parks-McClellan algorithm. Pertinent data for this 
filter is as follows: 


Passband 1.375 - 3.625 kHz 
Stopbands 0.0 - 10 kHz 
40 - 5.0 kHz 
Attenuation in stopbands -68.4 dB 
Transition regions 1.0 - 1.375 kHz 


3.625 - 4.0 kHz 


The figures preceding the program show the magnitude 
response using a linear scale, the log-magnitude response, 
and the unit-sample response. Both the magnitude response 
and the log-magnitude response illustrate the equiripple 
response expected from using the Parks-McClellan 
algorithm. The unit-sample response possesses the symmetry 
that is characteristic of linear-phase FIR filters. 

A length-60 FIR differentiator, shown in Appendix B, 
is also designed using the Parks-McClellan algorithm. 
Characteristics for the FIR differentiator are listed below. 


Lower band edge 0.0 kHz 
Upper band edge 5.0 kHz 
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Desired slope 
Maximum deviation 


0.4800 
0.3172 percent 


The log-magnitude resonse is illustrated as well as the 
unit-sample response, which is antisymmetric for an FIR 
differentiator. Because the code is written in looped form, 
there is a dramatic reduction in the amount of program space 
necessary to implement this filter. 

The three filters in Appendix C are fourth-order 
lowpass IIR filters, designed using the bilinear-transform 
technique. The first filter is based on a direct-form II 
structure, the second filter is based on a cascade structure 
with two second-order direct-form II subsections, and the 
third filter is based on a parallel structure. These three IIR 
filters are identical in terms of their frequency response and 
have the following characteristics: 


Passband 0.0 - 2.5 kHz 
Transition region 2.5 - 2.75 kHz 
Stopband 2.75 - 5.0 kHz 


Attenuation in stopband -25.17 dB 

The figures that show the magnitude response, log- 
magnitude response, phase response, group delay, and the 
unit-sample response for the three IIR filters are treated as 
a group and precede the three programs for filter 
implementation. 

Table 1 is a summary of information about the five 
digital filters that are implemented in the appendixes. 

An examination of the length-80 FIR filter 
implementation reveals the advantages of using a TMS32020 
over the TMS32010. The program memory size is reduced 
by a factor of 15 (11 words vs. 163 words) while execution 
speed is improved by a factor of 1.8. Since the other filter 
types do not take advantage of the RPTK/MACD instruction 
pair, the performance results are the same. For example, a 
fourth-order cascade-form IIR filter executes at 5.4 us using 
only 27 program memory words. 

When implementing linear-phase FIR filters, the 
designer must choose the right device for the application. 
If fast execution time and less program memory are essential, 
then the TMS32020 is the right choice. 

The IIR filters are direct transformations of analog 
filters, exhibiting the same amplitude and phase 
characteristics as their analog counterparts. IIR filters tend 
to be more efficient than FIR filters with respect to 
transitionband sharpness and filter orders required. Although 
they require less code for implementation than the FIR filters 
(TMS32010 straightline code), they show great nonlinearity 
in phase, which limits their use in some applications. 

By far the most commonly used IIR structure is the 
cascade-form realization. It has been shown that proper 
ordering of the poles and zeroes results in less .ensitivity 
to quantization noise. The Digital Filter Design Package 
designs IIR filters in cascade form only. 

By using a TMS32020 for both FIR and IIR filter 
implementations, it is possible to design a higher-order filter 
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Table 1. Summary Table of Filter Programs 


CYCLES 


Straight Line: 
TMS32010 163 32.6 
TMS32020 90 18 

(with RPTK) 


CYCLES 


Looped: 
TMS32010/20 


243 48.6 


STRUCTURE CYCLES 


Direct-Form II: 
T™TMS32010/20 F 24 


LENGTH-80 LINEAR-PHASE BANDPASS FIR (STRAIGHT-LINE CODE) 


EXECUTION TIME 
(MICROSECONDS) (WORDS) (WORDS) 


LENGTH-60 FIR DIFFERENTIATOR (LOOPED CODE) 


EXECUTION TIME 
(MICROSECONDS) 


FOURTH-ORDER LOWPASS IIR FILTERS 


EXECUTION TIME 
(MICROSECONDS) (WORDS) (WORDS) 


Cascade: 
™4S32010/20 2 


4.8 24 16 
5.4 27 18 
5.6 28 18 


7 
Parallel: 
™MS3210/20 28 


PROGRAM MEMORY 


DATA MEMORY 


163 120 
11 161 


DATA MEMORY 
(WORDS) 


PROGRAM MEMORY 
(WORDS) 


11 120 


PROGRAM MEMORY DATA MEMORY 


NOTE: The above performance figures are only given as a reference. They should not be taken as 


benchmarks since programs can always be 


than with the TMS32010. The TMS32020 is also ideal for 
higher-order FIR filters that require single-cycle 
multiply/accumulate operations. 


SUMMARY 


A brief review of FIR and IIR digital filters has been 
given to assist in understanding the fundamentals of digital 
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improved for better speed and memory efficiency. 


filter structure and their implementations using a digital signal 
processor. Many design examples have also been included 
to show the tradeoffs between FIR and IIR structures. 

This application report has also described methods for 
implementing FIR and IIR filters with the TMS32010 and 
TMS32020. The design engineer can now choose between 
the two devices, depending on the application. 
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APPENDIX A 
LENGTH-80 LINEAR-PHASE PASSBAND FIR FILTER 
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! 
ARAN 1 H 
Nit 1 APY 
-8@. ae Z BP eB dks. ce wi ded ae! eine othe ape atoms be Goh! acta be RED. H 
\ . 1 { | { 
: t \ 
! 1 
-186.08 : 
@. aaa 1-@006 2-aga6 3-B980 4.8008 5.aeaa 


FREQUENCY IN KILOHERTZ 


UNIT SAMPLE RESPONSE 


Ce 


rs 


, 


a. aaa 1-580 3.16900 4-7400 6.3206 7-9080 
TIME IN MILISECONDS 


3. Implementation of FIR/IIR Filters with the TMS32010/TMS3202C 


os 


OZOZESWL/OLOZESWLL OU) IM SIT UI/AlA JO uoneuowsjduy *¢ 


FIRBPASS 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
0050 
0051 
0052 


0053 
0054 


0055 
0056 


0000 
0000 
0001 


0020 


0020 
0021 
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002D 
OO2E 
O02F 
0030 


FF80 
0072 


FFDC 
OO1F 


PC 1.0 85.157 16:55 


256 08-15-85 
PAGE 0001 


TRE EPEERRESERESE SEES EL ERE ESE ERS ERE SESE SESE REE EES ELE SESE SERS SE 


LINEAR-PHASE FIR FILTER 
LENGTH-80 BANDPASS FILTER 


SAMPLING FREQUENCY = 10 KHZ 


FILTER CHARACTERISTICS 


BAND 1 BAND 2 
LOWER BAND EDGE 0.0000 1.3750 
UPPER BAND EDGE 1.0000 3.6250 
NOMINAL GAIN 0.0000 1.0000 
NOMINAL RIPPLE 0.0010 0.0200 
MAXIMUM RIPPLE 0.0004 0.0076 
RIPPLE IN DB -68.3965 0.0657 


FILTER STRUCTURE 


-1 -1 -1 
Zz vA Zz 
Ta ataad Reatentanl © tented detente © SeatestenD checked ° Secleie? eel ° eerie? ee 
x(n) | l | | 
| | 
v h(0) v h(l) v h(2) v h(N~-2) 
| | I I 
| | | | 


0--->---0--->---0---> - = -9--->--- 


LER ERERELASESSELSESSELELEEESESLESEE SELES SSE SE SS 


e*eeee ee + + eee He eH Ft HHH HHH eee He HEH EH He HH 


CYCLES | EXECUTION TIME | PROGRAM MEMORY | 
| (MICROSECONDS) | (WORDS) 
w  -------- $---------------- $---------------- +- 
* | I | 
* 90 | 18 10 | 
* | | | 
x 
* (EXCLUDING INITIALIZATION AND I/O) 
REKKKKKREREREKREKREKERREKEKREKERERKEKKAKKKRKKKREAE 
* 
IDT 'FIRBPASS' 
YN EQU 45 
MODE QU 46 
CLOCK EQU 47 
XN EQU 48 
* 
AORG 0 
B START 
* 
CTABLE AORG 32 
CHO DATA >FFDC * -0.107251E-02 * 
CH1 DATA >001F * 0.973976E-03 * 


BAND 3 


4.0000 
5.0000 
0.0000 
0.0010 
0.0004 
-68 .3997 


-~O 


I 
v h(N-1) 
| 
| 
-O--->---9 
y(n) 


KaKKKKKKKAKKE 


DATA MEMORY 


(WORDS) 


RKAKKKKKKKKEK 


FIRBPASS 


0057 
0058 
0059 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
0080 
0081 
0082 
0083 
0084 
0035 
0086 
0087 
0088 
0089 
0090 
0091 
0092 
0093 
0094 
0095 
0096 
0097 
0098 
0099 
0100 
0101 
0102 
0103 
0104 
0105 
0106 
0107 
0108 
0109 
0110 
0111 
0112 
0113 


0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
002A 
002B 
002C 
002D 
0O02E 
002F 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
003A 
003B 
003C 
003D 
003E 
003F 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
004A 
004B 
004c 
004D 
O04E 
004F 
0050 
0051 
0052 
0053 
0054 
0055 
0056 
0057 
0058 
0059 
OO5A 
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0051 
FFE9 
FFE6 
FFBA 
FFB4 
004B 
FFF9 
0069 
00A2 
FF6F 
FFFE 
FF70 
FEF4 
0O0CB 
000B 
OOE£6 
0187 
FEES 
000B 
FE7F 
FDBF 
0192 
FFB5 
026A 
0368 
FDC2 
00co 
FCOA 
FAA3 
0347 
FE3D 
0747 
09BB 
FA3D 
052B 
EB59 
DC2A 
2D57 
2D57 
DC2A 
EB59 
052B 
FA3D 
09BB 
0747 
FE3D 
0347 
FAA3 
FCOA 
00co 
FDC2 
0368 
026A 
FFB5 
0192 
FDBF 
FE7F 


CH2 

CH3 

CH4 

CH5 

CH6 

CH7 

CH8 

CH9 

CH10 
CH11 
CH12 
CH13 
CH14 
CH15 
CH16 
CH17 
CH18 
CH19 
CH20 
CH21 
CH22 
CH23 
CH24 
CH25 
CH26 
CH27 
CH28 
CH29 
CH30 
CH31 
CH32 
CH33 
CH34 
CH35 
CH36 
CH37 
CH38 
CH39 
CH40 
CH41 
CH42 
CH43 
CH44 
CH45 
CH46 
CH47 
CH48 
CH49 
CH50 
CH51 
CH52 
CH53 
CH54 
CH55 
CH56 
CH57 
CH58 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


>0051 
>FFE9 
>FFE6 
>FFBA 
>FFB4 
>004B 
>FFF9 
>0069 
>00A2 
>FFO6F 
>FFFE 
>FF70 
>FEF4 
>00CB 
>000B 
>OOE6 
>0187 
>FEES 
>000B 
>FEVF 
>FDBF 
>0192 
>FFBS 
>026A 
>0368 
>FDC2 
>00C0 
>FCOA 
>FAA3 
>0347 
>FE3D 
>0747 
>09BB 
>FA3D 
>052B 
>EBS9 
>DC2A 
>2D57 
>2D57 
>DC2A 
>EBS9 
>052B 
>FA3D 
>09BB 
>0747 
>FE3D 
>0347 
>FAA3 
>FCOA 
>00C0 
>FDC2 
>0368 
>026A 
>FFBS 
>0192 
>FDBF 
>FE7F 


*eeee ee ee eee ee He HHH HEHEHE HHH He HHO Ee EE hh Uh hUhhhUh hh hlUhhUhhlUhhlUhhUhhUhhlUhUh Uh 


PC 1.0 85.157 


0.249065E-02 
-~0.675043E-03 
-0.771385E-03 
-0.212256E-03 
-0.229530E-02 
0.231021E-02 
-0.194902E-03 
0.322896E-02 
0.496452E-02 
-0.440419E-02 
-0.314831E-04 
-0.438169E-02 
-0.815474E-02 
0.621682E-02 
0.342216E-03 
0.704627E-02 
0.119391E-01 
-0.860811E-02 
0.346738E-03 
-0.117293E-01 
-0.175964E-01 
0.122947E-01 
-0.227426E-02 
0.188796E-01 
0.266148E~-01 
-0.175126E-01 
0.586574E-02 
-0.309240E-01 
-0.418954E-01 
0.256315E-01 
-0.137498E-01 
0.568720E-01 
0.76028G6E-01 
-0.450011E-01 
0.403853E-01 
-0.161339E+00 
~0.279963E+00 
0.352454E+00 
0.352454E+00 
-0.279963E+00 
-0.161339E+00 
0.403853E-01 
-0.450011E-01 
0.760286E-01 
0.568720E-01 
-0.137498E-01 
0.256325E-01 
-0.418954E-01 
-0.309240E-01 
0.586574E-02 
-0.175126E-01 
0.266148E~01 
0.188796E-01 
~0.227426E-02 
0.122947E-01 
-0.175964E-01 
~0.117293E-01 


eeepeee ee ee eee Hee HHH HHH HEHEHE HEHE HEH HEHEHE EEUU UU Uhl hlU hl hl hlU UU hl 
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OZOTESWL/OLOZESWLL OW) YUM SION AI/AlA Jo uoneuswsjdwuy “¢ 


tS 


FIRBPASS 


0114 
0115 
0116 
0117 
0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 
0134 
0135 
0136 
0137 
0138 
0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 
0153 
0154 


0155 
0156 


0157 
0158 
0159 
0160 


0161 


0162 
0163 
0164 
0165 


005B 
005C 
005D 
OOSE 
OOSF 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
006A 
006B 
006C 
006D 
006E 
OO6F 


0070 
0071 


0072 
0073 
0074 
0075 
0076 
0077 
0078 


0079 
OO7A 
007B 
007C 
007D 
OO7E 


00O7F 
0080 
0081 
0082 
0083 
0084 


0085 
0086 
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000B 
FEES 
0187 
O00E6 
000B 
0OCB 
FEF4 
FF70 
PFFE 
FFOF 
00A2 
0069 
FFF9 
004B 
FFB4 
FFBA 
FFE6 
FFE9 
0051 
001F 
FFDC 


000A 
O1F3 


0072 


C807 
CA70 
582E 
E02E 
CA71 
582F 
El2F 


5588 
DO00 
0200 
CB4F 
FCAO 
0020 


CEOS 
FA80 
0084 
FF80 
0080 
8230 


D100 
O3FF 


CH59 
CH60 
CH61 
CH62 
CH63 
CH64 
CH65 
CH66 
CH67 
CH68 
CH69 
CH70 
CH71 
CH72 
CH73 
CH74 
CH75 
CH76 
CH77 
CH78 
CH79 
* 


MD 
SMP 
* 


START 
* 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


DATA 
DATA 


EQU 


>000B 
>FEES 
>0187 
>00E6 
>000B 
>O00CB 
>FEF4 
>FF70 
>FFFE 
>FP6OF 
>00A2 
>0069 
>FFF9 
>004B 
>FFB4 
> FFBA 
>FFE6 
>FFES 
>0051 
>OO1F 
>FFDC 


ee ee eee eH He Heh ehUOhUhUhrhUhhUhHhUhUhhUhMLhUh}DhUm% 


>000A 
>O1F3 


$ 
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0. 346738E-03 
-0.860811E-02 
0.119391E-01 
0.704627E-02 
0.342216E-03 
0.621682E-02 
-0.815474E-02 
-0.438169E-02 
-0.314831E-04 
-0.440419E-02 
0.496452E-02 
0.322896E-02 
-0.194902E-03 
0.231021E~-02 
-0.229530E-02 
-0.212256E-03 
-0.771385E-03 
-0.675043E-03 
0.249065E-02 
0.973976E-03 
-0.107251E-02 


»e Oe ee ee ee Hee HHH HH DS DH DD 


3; SAMPLING RATE OF 10 KHZ * 


* INITIALIZATION OF THE ANALOG INTERFACE BOARD 


*® 


+. 


LOAD 


LDPK 
LACK 
TBLR 
OUT 

LACK 
TBLR 
ouT 


7 
MD 

MODE 
MODE, PAO 
SMP 
CLOCK 
CLOCK, PAl 


FILTER COEFFICIENTS 


LARP 
LRLK 


RPTK 
BLKP 


CNFP 


BIOZ 


LRLK 


ARO 
ARO,>200 


>4F 
CTABLE, *+ 


NXTPT 


WAIT 


XN, PA2 


AR1,>3FF 


; USE ARO FOR INDIRECT ADDRESSING 
3; POINT TO BLOCK BO 


3 80 COEFFICIENTS 


3; USE BLOCK BO AS PROGRAM AREA 
+ BIO PIN GOES LOW WHEN A 


3 NEW SAMPLE IS AVAILABLE 


BRING IN THE NEW SAMPLE XN 


= 


POINT TO THE BOTTOM OF BLOCK Bl 


me 


PIRBPASS 


0166 
0167 
0168 
0169 
0170 
0171 
0172 


0173 
0174 
0175 
0176 
0177 
0178 
0179 


0180 
0181 


0087 


0088 
0089 


008A 
008B 
008sc 


008D 
OO8E 


OO8FP 


0090 
0091 


NO ERRORS, 
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5589 


A000 
CA0O 


CB4F 
5c90 
FFOO 


CE15 
692D 


E22D 


PF80 
0080 


a 


NO WARNINGS 


LARP 


MPYK 
ZAC 


RPTK 
MACD 


YN,1 
YN, PA2 


WAIT 


pc 1.0 85.157 16:55:56 08-15-85 
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; OUTPUT THE FILTER RESPONSE y(n) 
; GO GET THE NEXT POINT 


cs 
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FIRBPASS 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
0050 
0051 
0052 
0053 
0054 
0055 
0056 
0057 


32010 FAMILY MACRO ASSEMBLER 


0000 
0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 


PC2.1 84.107 


20 
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PE SRE SESSSESLE SEES EEE RE RR EEE RESALE SESS RS SESE ERE E SER EEE REE ESS 


SAMP 


x(n) 
Vv 


“ee eee ee eee He HH eH HH HH HH HE He HH HS HH 


LINEAR-PHASE FIR FILTER 
LENGTH-80 BANDPASS FILTER 


LING FREQUENCY = 10 KHZ 


FILTER CHARACTERISTICS 


BAND 1 
LOWER BAND EDGE 0.0000 
UPPER BAND EDGE 1.0000 
NOMINAL GAIN 0.0000 
NOMINAL RIPPLE 0.0010 
MAXIMUM RIPPLE 0.0004 
RIPPLE IN DB -68 .3965 


BAND 2 


1.3750 
3.6250 
1.0000 
0.0200 
0.0076 
0.0657 


FILTER STRUCTURE 


-1 -1 
z Zz 


0--->---0--->---0--->---0---> - 


| | 
| 
h(0) i h(1) Y h(2) 


| | 


O--->---0--->---0---> = 


z 


-9O-77"°" 
| 

| 

v h(N 
| 

| 
-0o---> 


BAND 3 


4.0000 
5.0000 
0.0000 
0.0010 
0.0004 
-68 .3997 


>---0 


| 
| 
-2) v h(N-1) 
| 
| 


----0--->---o 
y(n) 


PES RECS REE RSESEE SELES ETE SES ESE SESE SE SESE RE REE ERE RE SEES EES ESS 


CYCLES 


EXECUTION TIME 
(MICROSECONDS) 


PROGRAM MEMORY 
(WORDS ) 


DATA MEMORY 
(WORDS) 


(EXCLUDING INITIALIZATION AND I/0) 


eee ee +e FH HF H 
~ 
for 
Ww 


@ 


PEPE S SEER EE EE SE SESE SIRS ER SEER SESE SS ESE SESE SESE RE REESE RE RES EE 


IDT 
XN EQU 
XNM1 EQU 
XNM2 EQU 
XNM3 EQU 
XNM4 EQU 
XNMS EQU 
XNM6 EQU 
XNM7 EQU 
XNM8 EQU 
XNM9 EQU 


‘FIRBPASS‘ 


WAIN UNS WN eH O 


FIRBPASS 


0058 
0059 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
0080 
0081 
0082 
(C083 
0084 
0085 
0086 
0087 
0088 
0089 
0090 
0091 
0092 
0093 
0094 
0095 
0096 
0097 
0098 
0099 
0100 
0101 
0102 
0103 
0104 
0105 
0106 
0107 
0108 
0109 
0110 
0111 
0112 
0113 
0114 
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000A 
000B 
oo0oc 
000D 
OO0E 
OOOF 
0010 
ooll 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
001A 
001B 
oo1c 
001D 
OO1E 
OO1F 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
002A 
002B 
002c 
002D 
OO2E 
002F 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
003A 
003B 
003C 
003D 
OO03E 
003F 
0040 
0041 
0042 


XNM10 
XNM11 
XNM12 
XNM13 
XNM14 
XNM15 
XNM16 
XNM17 
XNM18 
XNM19 
XNM20 
XNM21 
XNM22 


XNM23. 


XNM24 
XNM25 
XNM26 
XNM27 
XNM28 
XNM29 
XNM30 
XNM31 
XNM32 
XNM33 
XNM34 
XNM35 
XNM36 
XNM37 
XNM38 
XNM39 
XNM40 
XNM41 
XNM42 
XNM43 
XNM44 
XNM45 
XNM46 
XNM47 
XNM48 
XNM49 
XNM50 
XNM51 
XNM52 
XNM53 
XNM54 
XNM55 
XNM56 
XNM57 
XNM58 
XNM59 
XNM60 
XNM61 
XNM62 
XNM63 
XNM64 
XNM65 
XNM66 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


EQU, 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


10 
ll 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 


PC2.1 84.107 
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€¢ 


FIRBPASS 


0115 
0116 
0117 
0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 
0134 
0135 
0136 
0137 
0138 
0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 
0153 
0154 
0155 
0156 
0157 
0158 
0159 
0160 
0161 
0162 
0163 
0164 
0165 
0166 
0167 
0168 
0169 
0170 
0171 
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0043 
0044 
0045 
0046 
0047 
0048 
0049 
004A 
004B 
004c 
004D 
004E 
OO4F 


0050 
0051 
0052 
0053 
0054 
0055 
0056 
0057 
0058 
0059 
OOS5SA 
005B 
005c 
005D 
OOSE 
OOSF 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
006A 
006B 
006C 
006D 
OO06E 
OO6F 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 


0078 
0079 


XNM67 
XNM68 
XNM69 
XNM70 
XNM71 
XNM72 
XNM73 
XNM74 
XNM75 
XNM76 
XNM77 
XNM78 
XNM79 
* 


HO 
Hl 
H2 
H3 
H4 
H5 
H6 
H7 
H8 
H9 
H10 
H1l 
H12 
H13 
H14 
H15 
H16 
H17 
H18 
H19 
H20 
H21 
H22 
H23 
H24 
H25 
H26 
H27 
H28 
H29 
H30 
H31 
H32 
H33 
H34 
H35 
H36 
H37 
H38 
H39 
* 
MODE 
CLOCK 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


EQU 
EQU 
EQu 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


EQU 
EQU 
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FIRBPASS 


0172 
0173 
0174 
0175 
0176 
0177 


0178 
0179 
0180 
0181 
0182 
0183 
0184 
0185 
0186 
0187 
0188 
0189 
0190 
0191 
0192 
0193 
0194 
0195 
0196 
0197 
0198 
0199 
0200 
0201 
0202 
0203 
0204 
0205 
0206 
0207 
0208 
0209 
0210 
0211 
0212 
0213 
0214 
0215 
0216 
0217 
0218 
0219 
0220 
0221 
0222 
0223 
0224 
0225 
0226 
0227 


0000 


0000 
0001 


0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
OOOA 
000B 
oo00c 
000D 
OOOE 
OOOF 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
OO1A 
001B 
oo1c 
001D 
OOlE 
OOlF 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 


32010 FAMILY MACRO 


OO7A 
0078 


F900 
002Cc 


FFDC 
OOlF 
0051 
FFE9 
FFE6 
FFBA 
FFB4 
004B 
FFEF9 
0069 
OOA2 
FFOF 
FFFE 
FF70 
FEF4 
00CB 
000B 
OOE6 
0187 
FEES 
000B 
FE7F 
FDBF 
0192 
FFB5 
026A 
0368 
FDC2 
00Cc0 
FCOA 
FAA3 
0347 
FE3D 
0747 
09BB 
FA3D 
052B 
EBS9 
DC2A 
2D57 


YN 
ONE 
* 


* 


+e +e + He 


* 


CHO 

CH1l 

CH2 

CH3 

CH4 

CHS 

CH6 

CH7 

CH8 

CH9 

CH10 
CH1l1l 
CH12 
CH13 
CH14 
CHL5 
CH16 
CH17 
CH18 
CH19 
CH20 
CH21 
CH22 
CH23 
CH24 
CH25 
CH26 
CH27 
CH28 
CH29 
CH30 
CH31 
CH32 
CH33 
CH34 


EQU 122 
EQU 123 


AORG 0 


B START 


COEFFICIENTS ARE 
STORED IN PROGRAM MEMORY - 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


ASSEMBLER 


(n). 


>FFDC 
>OO1F 
>0051 
>FFES9 
>FFE6 
>FFBA 
>FFB4 
>004B 
>FFF9 
>0069 
>00A2 
>FFOF 
>FFFE 
>FF70 
>FEF4 
>00CB 
>000B 
>O00E6 
>0187 
>FEES 
>000B 
>FET7F 
>FDBF 
>0192 
>FFBS 
>026A 
>0368 
>FDC2 
>00C0 
>FCOA 
>FAA3 
>0347 
>FE3D 
>0747 
>09BB 
>FA3D 
>052B 
>EB59 
>DC2A 
>2D57 


eee eee ee ee He He He HH HH He HEH HHH HH EHH HH hh hUhhUhHhUhHhUhHRhUhhh hh 


PC2.1 84.107 


INITIALLY *. 


-0.107251E-02 
0.973976E-03 
0.249065E-02 

-0.675043E-03 

-0.771385E-03 

-0.212256E-03 

-0.229530E-02 
0.231021E-02 

-0.194902E-03 
0.322896E-02 
0.496452E-02 

-0.440419E-02 

-0.314831E-04 

-0.438169E-02 

-~0.815474E-02 
0.621682E-02 
0.342216E-03 
0.704627E-02 
0.119391E-01 

-0.860811E-02 
0.346738E-03 

-0.117293E-01 

-0.175964E-01 
0.122947E-01 

-0.227426E-02 
0.188796E-01 
0.266148E-01 

-0.175126E-01 
0.586574E-02 

-0.309240E-01 

-0.418954E-01 
0.256315E-01 

-0.137498E-01 
0.568720E-01 
0.760286E-01 

-0.450011E-01 
0.403853E-01 

-0.161339E+00 

-0.279963E+00 
0.352454E+00 


e+ eetee ee ee 6+ FH He Ht He HH HH HH HH HH HEH HE HH HH HE He 
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* 
x 


DUE TO THE SYMMETRY OF THE IMPULSE RESPONSE * 
ONLY HALF OF THE SAMPLES OF THE IMPULSE 
RESPONSE ARE STORED. THIS MEANS THAT 
h(N-l-n) = h 


* 


PAGE 0004 


OZOZESWL/OLOZESWLL OD PIM SiO AI/AlA JO uoneuoursjduyy *¢ 


PIRBPASS 


0228 
0229 
0230 
0231 
0232 
0233 
0234 
0235 
0236 
0237 
0238 
0239 
0240 
0241 
0242 


0243 
0244 
0245 
0246 
0247 


0248 


0249 
0250 
0251 
0252 
0253 
0254 
0255 
0256 
0257 
0258 
0259 
0260 
0261 
0262 
0263 
0264 
0265 
0266 
0267 
0268 
0269 
0270 
0271 
0272 
0273 
0274 
0275 
0276 
0277 
0278 
0279 
0280 
0281 


002A 
002B 


002c 


002D 
002E 


002F 
0030 
0031 
0032 
0033 
0034 
0035 
0036 


0037 
0038 


0039 
003A 
003B 
003C 
003D 
003E 


003F 
0040 


0041 
0042 


0043 
0044 


0045 
0046 


0047 
0048 


0049 
004A 


004B 
004Cc 


004D 
OO4E 


OO04F 
0050 


0051 


32010 PAMILY MACRO ASSEMBLER 


000A 
O1F3 


6E00 


7E01 
507B 


7079 
7129 
7E2B 
6880 
6791 
107B 
F400 
0032 


4878 
4979 


F600 
003D 
F900 
0039 
4200 
7F89 


6A4F 
6D50 


6B4E 
6D51 


6B4D 
6D52 


6B4C 
6D53 


6B4B 
6D54 


6B4A 
6D55 


6B49 
6D56 


6B48 


6D57. 


6B47 
6D58 


6B46 


MD 


LOAD 


DATA >O000A 
DATA >O1LF3 


LDPK 


LACK 
SACL 


LARK 
LARK 
LACK 
LARP 
TBLR 


SUB 


0 


1 
ONE 


ARO , CLOCK 
AR1,>29 
SMP 

ARO 
*-,AR1 
ONE 


BANZ LOAD 


OUT 
OUT 


MODE, PAO 
CLOCK, PAL 


BIOZ NXTPT 


B WAIT 


IN XN,PA2 


ZAC 


LT XNM79 


MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 


HO 


XNM78 
Hl 


XNM77 
H2 


XNM76 
H3 


XNM75 
H4 


XNM74 
HS 


XNM73 
H6 


XNM72 
H7 


XNM71 
H8 


XNM70 


eee » 


* 
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PAGE 0005 


SAMPLING RATE OF 10 KHZ * 


CONTENT OF ONE IS 1 * 


THIS SECTION OF CODE LOADS 
THE FILTER COEFFICIENTS AND 
OTHER VALUES FROM PROGRAM 
MEMORY TO DATA MEMORY 


¢*eee 


INITIALIZATION OF ANALOG * 
INTERFACE BOARD * 


BIO PIN GOES LOW WHEN A * 


NEW SAMPLE IS AVAILABLE * 


BRING IN THE NEW SAMPLE XN * 


DUE TO SYMMETRY h(0) = h(79) * 
x(n-79) * h(79) * 


h(l) = n (78) * 


FIRBPASS 


0282 
0283 
0284 
0285 
0286 
0287 
0288 
0289 
0290 
0291 
0292 
0293 
0294 
0295 
0296 
0297 
0298 
0299 
0300 
0301 
0302 
0303 
0304 
0305 
0306 
0307 
0308 
0309 
0310 
0311 
0312 
0313 
0314 
0315 
0316 
0317 
0318 
0319 
0320 
0321 
0322 
0323 
0324 
0325 
0326 
0327 
0328 
0329 
0330 
0331 
0332 
0333 
0334 
0335 
0336 
0337 
0338 


0052 


0053 
0054 


0055 
0056 


0057 
0058 


0059 
OOSA 


005B 
005c 


005D 
OOSE 


OOSF 
0060 


0061 
0062 


0063 
0064 


0065 
0066 


0067 
0068 


0069 
006A 


0068 
006C 


006D 
006E 


OO06F 
0070 


0071 
0072 


0073 
0074 


0075 
0076 


0077 


32010 FAMILY MACRO ASSEMBLER 


6D59 


6B45 
6D5A 


6B44 
6D5B 


6B43 
6D5C 


6B42 
6D5D 


6B41 
6DSE 


6B40 
6D5F 


6B3F 
6D60 


6B3E 
6D61 


6B3D 
6D62 


6B3C 
6D63 


6B3B 
6D64 


6B3A 
6D65 


6B39 
6D66 


6B38 
6D67 


6B37 
6D68 


6B36 
6D69 


6B35 
6D6A 


6B34 
6D6B 


6B33 


MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


L'tD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
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OZOZESWL/OLOZESWLL 9 TM sioI14 UII/Al JO voneueursjduy “¢ 


gs 


FIRBPASS 


0339 
0340 
0341 
0342 
0343 
0344 
0345 
0346 
0347 
0348 
0349 
0350 
0351 
0352 
0353 
0354 
0355 
0356 
0357 
0358 
0359 
0360 
0361 
0362 
0363 
0364 
0365 
0366 
0367 
0368 
0369 
0370 
0371 
0372 
0373 
0374 
0375 
0376 
0377 
0378 
0379 
0380 
0381 
0382 
0383 
0384 
0385 
0386 
0387 
0388 
0389 
0390 
0391 
0392 
0393 
0394 
0395 


0078 


0079 
OO7A 


007B 
007C 


007D 
OO7E 


OO7F 
0080 


0081 
0082 


0083 
0084 


0085 
0086 


0087 
0088 


0089 
008A 


008B 
o00sc 


008D 
OO8E 


OO8F 
0090 


0091 
0092 


0093 
0094 


0095 
0096 


0097 
0098 


0099 
009A 


009B 
0o09Cc 


009D 


32010 FAMILY MACRO ASSEMBLER 


6D6C 


6B32 
6D6D 


6831 
6D6E 


6B30 
6D6F 


6B2F 
6D70 


6B2E 
6D71 


6B2D 
6D72 


6B2C 
6D73 


6B2B 
6D74 


6B2A 
6D75 


6B29 
6D76 


6B28 
6D77 


6B27 
6D77 


6B26 
6D76 


6B25 
6D75 


6B24 
6D74 


6B23 
6D73 


6B22 
6D72 


6B21 
6D71 


6B20 


MPY 


LTD 
MPY 


LTD 


MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 


H28 


XNM5O 
H29 


XNM49 
H30 


XNM48 
H31 


XNM47 
H32 


XNM46 
H33 


XNM45 
H34 


XNM44 
H35 


XNM43 
H36 


XNM42 
H37 


XNM41 
H38 


XNM40 
H39 


XNM39 
H39 


XNM38 
H38 


XNM37 
H37 


XNM36 
H36 


XNM35 
H35 


XNM34 
H34 


XNM33 
H33 


XNM32 


PC2.1 84.107 


20:41:39 


PAGE 


08-29-85 
0007 


FIRBPASS 


0396 
0397 
0398 
0399 
0400 
0401 
0402 
0403 
0404 
0405 
0406 
0407 
0408 
0409 
0410 
0411 
0412 
0413 
0414 
0415 
0416 
0417 
0418 
0419 
0420 
0421 
0422 
0423 
0424 
0425 
0426 
0427 
0428 
0429 
0430 
0431 
0432 
0433 
0434 
0435 
0436 
0437 
0438 
0439 
0440 
0441 
0442 
0443 
0444 
0445 
0446 
0447 
0448 
0449 
0450 
0451 
0452 


009E 


OO9F 
OOAO 


OOA1l 
O00A2 


00A3 
00A4 


OOA5 
OOA6 


OOA7 
OOA8 


OOA9 
OOAA 


OOAB 
OOAC 


OOAD 
OOAE 


OOAF 
00BO 


O0OB1 
00B2 


00B3 
00B4 


00B5 
00B6 


00B7 
00B8 


00B9 
OOBA 


OOBB 
OOBC 


OOBD 
OOBE 


OOBF 
00co 


00cl 
00C2 


00C3 


32010 FAMILY MACRO ASSEMBLER 


6D70 


6B1F 
6D6F 


6B1E 
6D6E 


6B1D 
6D6D 


6B1C 
6D6C 


6B1B 
6D6B 


6B1A 
6D6A 


6B19 
6D69 


6B18 
6D68 


6B17 
6D67 


6B16 
6D66 


6B15 
6D65 


6B14 
6D64 


6B13 
6D63 


6B12 
6D62 


6B1ll 
6D61 


6B10 
6D60 


6BOF 
6D5F 


6BOE 
6D5E 


6B0D 


MPY H32 


LTD XNM31 
MPY H31 


LTD XNM30 
MPY H30 


LTD XNM29 
MPY H29 


LTD XNM28 
MPY H28 


LTD XNM27 
MPY H27 


LTD XNM26 
MPY H26 


LTD XNM25 
MPY H25 


LTD XNM24 
MPY H24 


LTD XNM23 
MPY H23 


LTD XNM22 
MPY H22 


LTD XNM21 
MPY H21 


LTD XNM20 
MPY H20 


LTD XNM19 
MPY H19 


LTD XNM18 
MPY H18 


LTD XNM17 
MPY H17 


LTD XNM16 
MPY H16 


LTD XNML5 
MPY H15 


LTD XNM14 
MPY H14 


LTD XNM13 
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9S 


OZOZESW.L/OLOZESWLL OF) YIM sI9qTT YI/Ald JO vonewoursjduy *¢ 


FIRBPASS 


0453 00C4 
0454 
0455 00C5 
0456 00CE6 
0457 
0458 00C7 
0459 00C8 
0460 
0461 00C9 
0462 OOCA 
0463 
0464 00CB 
0465 00CC 
0466 
0467 OOCD 
0468 OOCE 
0469 
0470 OOCF 
0471 OODO 
0472 
0473 00D1l 
0474 00D2 
0475 
0476 00D3 
0477 00D4 
0478 
0479 OODS5 
0480 00D6 
0481 
0482 00D7 
0483 00D8 
0484 
0485 00D9 
0486 OODA 
0487 
0488 00DB 
0489 OODC 
0490 
0491 OODD 
0492 OODE 
0493 
0494 OODF 
0495 
0496 O00E0 
0497 
0498 OO0E1 
0499 
0500 O00E2 
00E3 
0501 
0502 
NO ERRORS, 


32010 FAMILY MACRO ASSEMBLER 


6D5D 


6B0C 
6D5C 


6B0B 
6D5B 


6BOA 
6DSA 


6B09 
6D59 


6B08 
6D58 


6B07 
6D57 


6B06 
6D56 


6B05 
6D55 


6B04 
6D54 


6B03 
6D53 


6B02 
6D52 


6B01 
6D51 


6B00 
6D50 


7F8F 
597A 
4A7A 


F900 
0039 


NO WARNINGS 


x 


MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


LTD 
MPY 


APAC 


H13 


XNM12 
H12 


XNM11 
Hll 


XNM1O 
H10 


XNM9 
H9 


XNM8 
H8 


XNM7 
H7 


XNM6 
H6 


XNM5 
H5 


XNM4 
H4 


XNM3 
H3 


XNM2 
H2 


XNM1 
H1 


XN 
HO 


SACH YN,1 


OUT YN,PA2 


B WAIT 


END 


PC2.1 84.107 


20:41:39 08-29-85 
PAGE 0009 


* OUTPUT THE FILTER RESPONSE y(n) * 


* GO GET THE NEXT POINT * 


APPENDIX B 
LENGTH-60 FIR DIFFERENTIATOR 


e.50 2.48 
ewas plage i teeta gdak eats pie re tet Pa Gana wate eager eee rea ta 
4. 38 Pn Cece eee ee ee eRe 
0.28 DAU rr gt 
8.10 2.98 sess AMO ete er Mr hace eel creel ox ts Ne daa bath, 
eho 1.98 : 
a. 000 6.2000 @. 4000 @. 6000 @.8e¢e 1.00e8 4.000 4.2800 4.4060 4.6000 4.8000 5.8000 
FREQUENCY IN KILOHERTZ FREQUENCY IN KILOHERTZ 
MAGNITUDE RESPONSE LOG MAGNITUDE RESPONSE 

2.58 ; : : 22.90 j j ” 

O60 te Ae pee eed ree cage Se ee tae! 
Be: “rte shaken bate ake a hat eeorts ieee ce LM teow 

-~20.AG me” eee aS ee Se a eee ae ee ae ec eC 
Gig a easel pce ah dre ie tite Shh ts iN Rie oa ee hh aaa ” 

w . - * i 
- ~48.06 J.-------- ee rs Sigh she T oars The ee Te ee 

‘va. <“poPalevesiligcvdaue eae rs wet hes bakes Aiea 44 aeee ia 

=60-60 . «Seo ia eA es a eee Ol eae Oe eee 
BoB GE ee SSENOO. lle ohm te, Behe e ot ale Bak se Se Aina es eek BES Oo aghee ta al ot teas A Be eth 
8.2 . a : -106.08 - 

@.6e0 1. e008 2-8@08 3-28000 4.0009 5. e800 a.aaa 1.aaee 2- 800A 23-8088 4-2800 5.6008 
FREQUENCY IN KILOHERTZ FREQUENCY IN KILOHERTZ 


UNIT SAMPLE RESPONSE 


9. a9 1.1888 2- 3600 3.5400 4.7208 5- 99008 
TIME IN MILISECONDS 


3. Implementation of FIR/IIR Filters with the TMS32010/TMS32020 57 


8s 


OZOZESWL/OLOZESWLL OD WM SIO YT/ALA JO voneuoursjdwy -¢ 


FIRDIF 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
0050 
0051 
0052 
0053 
0054 
0055 
0056 
0057 
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0000 
o0o0ol 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
000A 
000B 
oooc 
000D 


PAGE 0001 


RRERKEKRAEHREERREREHRREREEREREERERERREEEERERRREHANAEARKREKEREKRKHEK 


ee eee eee ee ee eee Ree ee HHH DE ee ee Oe 


FIR FILTER 
LENGTH-60 DIFFERENTIATOR 


FILTER CHARACTERISTICS 


SAMPLING FREQUENCY = 10 KHZ 


LOWER BAND EDGE 0.0000 
UPPER BAND EDGE 5.0000 
DESIRED SLOPE 0.4800 
MAX % DEVIATION 0.3171 


FILTER STRUCTURE 


-1 ~1 -1 
z z Zz 


0--->---0--->---0--->---0---> - - -9---->--~-0 
x(n) 


rt | a 

: h(0) h(1) ; h(2) v h(N-2) v h(N-1) 
| I 

I | | | I 


0--->---0--->---0---> = = -9---->---0--->---0 
y(n) 


RREEKCRAEEKEEKEEEEEEEEREREKEREEREREEEKERAEEREREEEHKEKEHEKRKERKKKRKAR 


CYCLES | EXECUTION TIME | PROGRAM MEMORY | DATA MEMORY 
| (MICROSECONDS) | (WORDS ) | (WORDS) 
------- | ----------------| ---------------- | ------------ 
| 
243 «| 48.6 I ll | 120 
| | 


(EXCLUDING I/O AND INITIALIZATION) 


PERE RERESSELSSEE RES EES ESSE SEES EL ESE SESS E SESS EEE SLES SEE ee 


* 


XN 
XNM1 
XNM2 
XNM3 
XNM4 
XNM5 
XNM6 
XNM7 
XNM8 
XNM9 
XNM10 
XNM11 
XNM12 
XNM13 


IDT 'FIRDIF' 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


WONIAU ePWNHH O 


ae 
WN © 


FIRDIF 


0058 
0059 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
0080 
0081 
0082 
0083 
0084 
0085 
0086 
0087 
0088 
0089 
0090 
0091 
0092 
0093 
0094 
0095 
0096 
0097 
0098 
0099 
0100 
0101 
0102 
0103 
0104 
0105 
0106 
0107 
0108 
0109 
0110 
0111 
0112 
0113 
0114 
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OO00E 
OOOF 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
OO1A 
001B 
oolc 
001D 
OO1LE 
OOlF 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
002A 
002B 
002c 
002D 
002E 
002F 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
003A 
003B 


003C 
003D 
003E 
003F 
0040 
0041 
0042 
0043 
0044 
0045 


XNM14 
XNM15 
XNM16 
XNM17 
XNM18 
XNM19 
XNM20 
XNM21 
XNM22 
XNM23 
XNM24 
XNM25 
XNM26 
XNM27 
XNM28 
XNM29 
XNM30 
XNM31 
XNM32 
XNM33 
XNM34 
XNM35 
XNM36 
XNM37 
XNM38 
XNM39 
XNM40 
XNM41 
XNM42 
XNM43 
XNM44 
XNM45 
XNM46 
XNM47 
XNM48 
XNM49 
XNM50 
XNM51 
XNM5S2 
XNM53 
XNM54 
XNM55 
XNM56 
XNM57 
XNM58 
XNM59 
* 

HO 

Hl 

H2 

H3 

H4 

H5 

H6 

H7 

H8 

H9 


EQuU 
EQU 
EQU 
EQU 
EQu 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQu 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


14 
15 
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6S 


FIRDIF 


0115 
0116 
0117 
0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 
0134 
0135 
0136 
0137 
0138 
0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 
0153 
0154 
0155 
0156 
0157 
0158 
0159 
0160 
0161 
0162 
0163 
0164 
0165 
0166 
0167 
0168 
0169 
0170 
0171 0000 
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0046 
0047 
0048 
0049 
004A 
004B 
004C 
004D 
OO04E 
OO4F 
0050 
0051 
0052 
0053 
0054 
0055 
0056 
0057 
0058 
0059 
005A 
005B 
o00o5Cc 
005D 
OOSE 
OO5SF 
0060 
0061 

0062 
0063 
0064 

0065 
0066 
0067 

0068 

0069 

006A 
006B 
006C 

006D 
OO6E 
006F 

0070 

0071 

0072 

0073 

0074 

0075 

0076 
0077 


0078 
0079 
OO7A 
0078 


H10 
H1ll 
H12 
H13 
H14 
H15 
H16 
H17 
H18 
H19 
H20 
H21l 
H22 
H23 
H24 
H25 
H26 
H27 
H28 
H29 
H30 
H31 
H32 
H33 
H34 
H35 
H36 
H37 
H38 
H39 
H40 
H41 

H42 
H43 
H44 

H45 

H46 
H47 

H48 

H49 

H50 
H51 
HS2 

H53 

H54 

H55 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


EQU 
EQU 
EQU 
EQU 


70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
90 
91 
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0172 
0173 


0174 
0175 
0176 
0177 
0178 
0179 
0180 
0181 
0182 
0183 
0184 
0185 
0186 
0187 
0188 
0189 
0190 
0191 
0192 
0193 
0194 
0195 
0196 
0197 
0198 
0199 
0200 
0201 
0202 
0203 
0204 
0205 
0206 
0207 
0208 
0209 
0210 
0211 
0212 
0213 
0214 
0215 
0216 
0217 
0218 
0219 
0220 
0221 
0222 
0223 
0224 
0225 
0226 
0227 
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0000 
0001 


0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
OO0OA 
0008 
000c 
000D 
O00E 
OOOF 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
OO1A 
001B 
001Cc 
001D 
OO1E 
OOlF 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
002A 
0028 
002Cc 
002D 
OO2E 
002F 
0030 
0031 
0032 
0033 
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F900 
0040 


0030 
FFC2 
0015 
FFEF 
OOOF 
FFFO 
0010 
FFEE 
0012 
FFEB 
0016 
FFE7 
001B 
FFE] 
0022 
FFD8 
002c 
FFCB 
003D 
FFB6 
0059 
FF90 
OO8E 
FF42 
0108 
FE75 
028B 
FBO4 
ODD6 
837E 
7081 
F229 
O4FB 
FD74 
018A 
FEF? 
0OBD 
PF71 
OO6F 
FFA6 
0049 
FFC2 
0034 
FFD3 
0027 
FFDD 
OOlE 
FFE4 
0018 
FFE 


B START 


Pc2.1 84.107 


* COEFFICIENTS ARE INITIALLY * 
* STORED IN PROGRAM MEMORY * 


* 


CHO 

CH1l 

CH2 

CH3 

CH4 

CHS 

CH6 

CH7 

CH8 

CH9 

CH10 
CH11 
CH12 
CH13 
CH14 
CH15 
CH16 
CH17 
CH18 
CH19 
CH20 
CH21 
CH22 
CH23 
CH24 
CH25 
CH26 
CH27 
CH28 
CH29 
CH30 
CH31 
CH32 
CH33 
CH34 
CH35 
CH36 
CH37 
CH38 
CH39 
CH40 
CH41 
CH42 
CH43 
CH44 
CH45 
CH46 
CH47 
CH48 
CH49 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


>0030 
>FFC2 
>0015 
>FFEF 
>OOOF 
>FFFO 
>0010 
>FFEE 
>0012 
> FFEB 
>0016 
>FFE7 
>001B 
>FFE1L 
>0022 
>FFD8 
>002C 
>FFCB 
>003D 
>FFB6 
>0059 
>FF9O 
>008E 
>FF42 
>0108 
>FE75 
>028B 
>FBO4 
>ODD6 
>B37E 
>7C81 
>F229 
>O4FB 
>FD74 
>018A 
>FEF7 
>00BD 
>FF71 
>006F 
>FFA6 
>0049 
>FFC2 
>0034 
>FFD3 
>0027 
>FFDD 
>O01E 
>FFE4 
>0018 
>FFE9 


* 
* 
* 
x 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
x 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
x 
* 
* 
* 
* 
* 
* 


0.146547E-02 
-0.186717E-02 
0.670857E-03 
-0.507893E-03 
0.476907E-03 
-0.482679E-03 
0.505055E-03 
-0.536698E-03 
0.576256E-03 
-0.624602E-03 
0.681939E-03 
-0.750338E-03 
0.831878E-03 
~0.929373E-03 
0.104702E-02 
-0.119041E-02 
0.136731E-02 
-0.158880E-02 
0.187070E-02 
-0.223732E-02 
0.272579E-02 
-0.339682E-02 
0.435422E-02 
-0.578642E-02 
0.806880E-02 
-0.120382E-01 
0.198777E-01 
-0.389339E-01 
0.108105E+00 
-0.972714E+00 
-CH29 * 
-CH28 
-CH27 
-CH26 
-CH25 
-CH24 
-CH23 
-CH22 
-CH21 
-CH20 
-CH19 
-CH18 
-CH17 
~CH16 
-CHIS 
-~CH14 
~CH13 
~-CH12 
-CH11 
-CH10 


ee ee ee Oe ee eH He HH HE He FH 


ee ePeeenee ve ee ee eee He HHH HHH eH ee He 
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* FIRDIF 


0228 
0229 
0230 
0231 
0232 
0233 
0234 
0235 
0236 
0237 
0238 
0239 
0240 
0241 
0242 
0243 
0244 
0245 
0246 
0247 
0248 
0249 
0250 
0251 
0252 
0253 


0254 
0255 
0256 
0257 
0258 
0259 
0260 


0261 


0262 
0263 
0264 
0265 
0266 
0267 
0268 
0269 
0270 
0271 
0272 
0273 
0274 
0275 
0276 


0277 
0278 
0279 
0280 


0034 
0035 
0036 
0037 
0038 
0039 
003A 
003B 
003C 
003D 


O003E 
003F 


0040 


0041 
0042 


0043 
0044 
0045 
0046 
0047 
0048 
0049 
004A 


004B 
004c 


004D 
004E 
004F 
0050 
0051 
0052 


0053 
0054 


0055 


0056 
0057 


0058 
0059 


005A 
0058 


005C 
005D 
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0014 
FFED 
0011 
FFEF 
OOOF 
FFFO 
0010 
FFEA 
003D 
FFCF 


OO0A 
01F3 


6E00 


7E01 
507B 


7079 
713C 
JE3F 
6880 
6791 
107B 
F400 
0046 


4878 
4979 


6880 
F600 
0052 
F900 
004E 
4200 


703B 
7177 


7F89 


6A91 
6D90 


6B81 
6D90 


F400 
0058 


7F8F 


597A 


CH50 
CH51 
CH52 
CH53 
CH54 
CHS5 
CH56 
CHS7 
CH58 
CH59 
* 


MD 
SMP 
* 


START 
* 


LOAD 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


DATA 
DATA 


LDPK 0 


LACK 1 
SACL ONE 


LARK ARO,CLOCK 


LARK AR1 
LACK SMP 
LARP ARO 


>0014 
>FFED 
>0011 
>FFEF 
>000F 
>FFFO 
>0010 
>FFEA 
>003D 
>FFCF 


>000A 
499 


60 


TBLR *-,AR1 


SUB ONE 


BANZ LOAD 


OUT MODE 
OUT CLOC 


LARP ARO 
BIOZ NXT 


B WAIT 


IN XN,PA 


LARK ARO ,XNM59 


LARK AR1 
ZAC 


LT *-,AR 


7PAO 
K,PA1 


PT 


2 


7459 


1 


MPY *-,ARO 


LTD *,AR 
MPY *-,A 


BANZ LOO 


APAC 


SACH YN, 


1 
RO 


P 


1 
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-CH9 
~-CH8 
-CH7 
-CH6 
~CH5 
~CH4 
-CH3 
~CH2 
-CH1l 
-~CHO 


e*eeneee bee ee FH 
+e eee © H HH FH 


* SAMPLING RATE OF 10 KHZ * 


* CONTENT OF ONE IS 1 * 


THIS SECTION OF CODE LOADS 
THE FILTER COEFFICIENTS AND 
OTHER VALUES FROM PROGRAM 
MEMORY TO DATA MEMORY 


+ & % & 
+e & 


* INITIALIZATION OF ANALOG * 
* INTERFACE BOARD * 


* SET ARP TO ARO * 
* BIO PIN GOES LOW WHEN A * 


* NEW SAMPLE IS AVAILABLE * 
* BRING IN THE NEW SAMPLE XN * 


* ARO POINTS TO THE INPUT SEQUENCE * 
* AR1 POINTS TO THE IMPULSE RESPONSE * 


* ACCUMULATE LAST MULTIPLY * 


FIRDIF 32010 FAMILY MACRO ASSEMBLER 
0281 * 

0282 OO5E 4A7A OUT YN,PA2 

0283 bs 

0284 OOSF F900 B WAIT 

0060 004E 
0285 * 
0286 END 


NO ERRORS, NO WARNINGS 
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* OUTPUT THE FILTER RESPONSE y(n) * 


* GO GET THE NEXT POINT * 


APPENDIX C 


FOURTH-ORDER LOWPASS IIR FILTERS 


MAGNITUDE RESPONSE PHASE RESPONSE 
-2 : : a : 3.8 : y : 
ce Bee (050 arm Sty A On Pe ee en Re By | heat eines can eee ae eas 
ee ae ee ee ee ee oe ee oe ee | 

i] 1-8 er er re ee ee) rr Or Or Or Sr Sr Or 2 Or 2 
[.  - (hs eee ee ee ee a pe \ bn eee) Leh HSL SW ee tere eg es Be - 

1 o 

' F650 ack ee ete Rane etree Pa [ee a sl eee 

I « 

@.6 eg eae ee ee ei OE ee ce i 6 ee i ee hie ee te ge Re Sans SE Se. ge See oe = 

H €-1-8 ee i. ae fees Scr cee Mc ec eer ir er ee 
6.4 a yb g-los sbi yet ie fe Maren bh Gress ee ee Jy =: 

I meet AR See Pee da oie ed a ees ake Site ee eee 
| NC? ee Rn ee ee a ' SS eae ie ea an A ei BO Pb ee Seed Be See ee dy i Be Wn se Ro EM GTN Dee Ee 
8.8 — __, eee -4.8 ~ - - ; 

@. 99a 1.0000 2.0000 3.9800 4.0000 5.0000 @.aee 1.9000 2.9000 3- 8008 4.0080 5.0000 
FREQUENCY IN KILOHERTZ FREQUENCY IN KILOHERTZ 
LOG MAGNITUDE RESPONSE GROUP DELAY 
18.2 ° : 1-€ : ; 
aoe ee my ee teres ee eS Crees eee ere aoe rere a: | Crete Poe eer or ee eae 
eiekoe. Pee he Nene es PSG po aa we MOP pals Paw eee mee he begs ne Coe eee tee es) © eee ee er Oe ree 
4 
a -~20.0a eek a lei aie ik erie, a Se Re er a ae . peat eee ae = = 1.8 Bo8. Bg ei een ee BO eR ge, hee eS er A Bite Bet xt fo ce i Ee we a a 
fl [o) 
© 30 |: ce ci OY Co ac i a oi we > re a cS CC SC a 
S 5 
O-46.0G fr ede fe NR eee ok eC Ce cc Sc a 
rc 
50308.) breton aloe whikng oA bE oak da. ae CMO 4 oa S44 ve Ona Ware igadh het dee oer ecfee eho od ase als one ing 
B6Os00" Pics ak olathe Maes Aap RR ace eS ge haga Oso Wie sd atan, notice Fa ee ce Ae be ells RO oo tn 
-78.00 - -0-8 
@. 408 1.9000 2-e008 3.0800 4-9008 5.9000 QO. 68 1.0006 2-8880 3- G89 4.0008 5.0080 
FREQUENCY IN KILOHERTZ FREQUENCY IN KILOHERTZ 


3. Implementation of FIR/IIR Filters w 


@.a8e 1-6808 


2-8880 3-8880 4-80600 5.8860 
TIME IN MILLISECONDS 


ith the TMS32010/TMS32020 61 


c9 
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PAGE 0001 PAGE 0002 

o0ol HR KIKI KKK KK EHR KEKE REE KEKEKREREEAKHAKKRKKKKAK 0058 * 

0002 * 0059 TEESE SEERSLESES ELE SESE SESE SESE SESE SES ESSER SESE SESE SES SSS ES SSS 

0003 * FOURTH-ORDER IIR 0060 * 

0004 * ELLIPTIC LOWPASS FILTER 0061 IDT 'IIR4DIR' 

es : 0062 0000 DN EQU 0 

0006 * DIRECT-FORM II STRUCTURE 0063 0001 DNMl EQU 1 

0007 . 0064 0002 DNM2 EQU 2 

0008 * FILTER CHARACTERISTICS 0065 0003 DNM3  EQU 3 

0009 = 0066 0004 DNM4 EQU 4 

0010 * SAMPLING FREQUENCY = 10 KHZ 0067 * 

0011 . 0068 0005 Al EQU 5 

0012 * BAND 1 BAND 2 0069 0006 A2 EQU 6 

0013 * 0070 0007 A3 EQU 7 

0014 * LOWER BAND EDGE 0.00000 2.75000 0071 0008 a4 EQU 8 

0015 * UPPER BAND EDGE 2.50000 5.00000 0072 * 

0016 * NOMINAL GAIN 1.00000 0.00000 0073 0009 BO EQU 9 

0017 * NOMINAL RIPPLE 0.06000 0.06000 0074 OOOA Bl EQU 10 

0018 * MAXIMUM RIPPLE 0.05617 0.05514 0075 000B B2 EQU 11 

0019 * RIPPLE IN DB 0.47469 -25.17089 0076 o00c B3 EQU 12 

0020 * 0077 000D B4 EQU 13 

0021 * 0078 * 

0022 * FILTER STRUCTURE 0079 OOOE MODE EQU 14 

0023 * 0080 OOOF CLOCK EQU 15 

0024 , 0081 0010 YN EQU 16 

0025 * b 0082 0011 XN EQU 17 

0026 * 0 0083 0012 ONE EQU 18 

0027 * Om rn 0 0 0 0084 * 

0028 * x(n) | } -1 | y(n) 0085 0000 AORG 0 

0029 * . a vz b * 0086 * 

0030 * | 1 | 1 | 0087 0000 F900 B START 

0031 * O77 Hf 2-70 -- - => ----0 0001 OOOD 

0032 - | } <1 | 0088 * 

0033 * . a vz b * 0089 * COEFFICIENTS ARE INITIALLY * 

0034 * | 2 | 2 | 0090 * STORED IN PROGRAM MEMORY * 

0035 * O----<€----0---->----0 0091 * 

0036 * | } -1 { 0092 0002 3845 CAl DATA >3845 * 0.4396070 * 

0037 * * a vz b 7 0093 * 

0038 * 3 4 3 | 0094 * * A2 = -1.172416. THE -1.0 TERM * 

0039 : G=ESSt nos sOssshpese=0 0095 0003 E9EE CaA2 DATA >E9EE * IS IMPLEMENTED WITH A SUB AND * 

0040 * | | -1 | 0096 * * A2 CONTAINS -0.172416 = >EQEE. * 

0041 * . a ar 4 b a 0097 * 

0042 * | 4 | 4 | 0098 0004 3167 CA3 DATA >3167 * 0.3859772 * 

0043 i On- +k 8-0 == ----0 0099 0005 DDCl ca4 DATA >DDC1 * -0.2675277 * 

0044 * 0100 * 

0045 : 0101 0006 17FA CBO DATA >17FA  * 0.1873279 * 

pues i 0102 0007 2802 CBl DATA  >2B02 * 0.3360168 * 

0047 RHEE KER KEKE KKK IRI REE IHRE EKRKEEEKEKKEKREKEKKKAKEKK 0103 0008 3D1F CB2 DATA >3D1F * 0.4775291 * 

0048 * 0104 0009 2AFF CB3 DATA >2AFF * 0.3359135 * 

0049 * CYCLES | EXECUTION TIME | PROGRAM MEMORY | DATA MEMORY 0105 OOOA 17F3 CB4 DATA >17F3 * 0.1871291 * 

0050 * | (MICROSECONDS) | (WORDS) | (WORDS) 0106 * 

0051 Seas Poses Testes eee aa one Pea Seesaemaies 0107 000B 000A MD DATA >000A 

0052 * | | | 0108 000C O1F3 SMP DATA 499 * SAMPLING RATE OF 10 KHZ * 

0053 * 24 | 4.8 | 24 | 16 0109 * 

0054 * | i | 0110 000D 6£00 START LDPK 0 

0055 BO ptr ere rt rn nn ee + Olll * 

ope : 0112 000E 7E01 LACK 1 

0057 * (EXCLUDING I/O AND INITIALIZATION) 0113 OOOF 5012 SACL ONE * CONTENT OF ONE IS 1 * 


OZOTESWLL/OLOZESILL 9) YM SiO YM/ALA JO uoneyuowsduly *¢ 


€9 


IIR4DIR 


0114 
0115 
0116 
0117 
0118 
0119 
0120 
0121 


0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 


0134 


0135 
0136 
0137 
0138 
0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 
0153 
0154 
0155 
0156 
0157 
0158 
0159 
0160 
0161 
0162 
0163 
0164 
0165 
0166 
0167 


0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 


0018 
0019 
OO1A 
001B 
001C 
001D 


OOlE 
OOlF 


0020 
0021 
0022 
0023 


0024 


0025 


0026 
0027 


0028 
0029 


002A. 


002B 
002C 


002D 
002E 


002F 
0030 
0031 
0032 


0033 
0034 


0035 


32010 FAMILY MACRO ASSEMBLER 


LOAD 


LARK ARO,CLOCK 


LARK AR1,10 
LACK SMP 
LARP ARO 
TBLR *-,ARL 
SUB ONE 
BANZ LOAD 


ZAC 

SACL DN 
SACL DNM1 
SACL DNM2 
SACL DNM3 
SACL DNM4 


OUT MODE,PAO 
OUT CLOCK,PA1 


BIOZ NXTPT 


B WAIT 


IN XN,PA2 


LAC XN,15 


LT DNM1 
MPY Al 


LTA DNM2 


MPY A2 
SUB DNM2,15 


LTA DNM3 
MPY A3 


LTA DNM4 
MPY A4 


APAC 

SACH DN,1 
ZAC 

MPY B4 


LTD DNM3 
MPY B3 


LTD DNM2 


20:44:36 
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PC2.1 84.107 


* THIS SECTION OF CODE LOADS 
* THE FILTER COEFFICIENTS AND 
* OTHER VALUES FROM PROGRAM 

* MEMORY TO DATA MEMORY 


+e te & 


* THIS SECTION SETS THE * 
* INITIAL STATE OF THE * 
* FILTER TO ZERO * 


* INITIALIZATION OF ANALOG * 
* INTERFACE BOARD = 


* BIO PIN GOES LOW WHEN A * 


* NEW SAMPLE IS AVAILABLE * 


* BRING IN THE NEW SAMPLE XN * 


* IMPLEMENTATION OF SYSTEM POLES * 


* d(n-1) * a * 

1 
* THIS SECTION IS EQUIVALENT TO 
* -1.172416 * DNM2. THE -1.0 
* TERM IS IMPLEMENTED WITH THE 
* SUB DNM2,15 AND A2 CONTAINS 
* ~0.,172416 = >EQEE. 


+e et 


* IMPLEMENTATION OF SYSTEM ZEROES * 


* d(n-3) * b * 
3 


08-29-85 


IITR4DIR 


0168 0036 
0169 
0170 0037 
0171 0038 
0172 
0173 0039 
0174 003A 
0175 
0176 003B 
0177 
0178 003C 
0179 
0180 003D 
0181 
0182 O03E 
003F 
0183 
0184 
NO ERRORS, 


32010 FAMILY MACRO ASSEMBLER 


6D0B MPY B2 
* 
6B01 LTD DNML 
6DOA MPY Bl 
* 
6B00 LTD DN 
6D09 MPY BO 
* 
7F8F APAC 
* 
5910 SACH YN,1l 
* 
4A10 OUT YN,PA2 
* 
F900 B WAIT 
0020 
* 
END 


NO WARNINGS 


PC2.1 84.107 


* FINISHED FILTER * 


20:44:36 08-29-85 
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* OUTPUT THE FILTER RESPONSE y(n) * 


* GO GET THE NEXT POINT * 


OZOZESWL/OLOZESWL OU YUM SiON U/AlA JO uonejwousjduy *¢ 


IIR4CAS 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
0050 
0051 
0052 
0053 
0054 
0055 
0056 
0057 
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0000 
0001 
0002 


PAGE 0001 


POO RESPECT OSCTOSCSCSCLSS SSIS SS SS SESE SEC ESE SESE EES EER EER SEE EEE SE SS 


FOURTH-ORDER IIR 
ELLIPTIC LOWPASS FILTER 


CASCADE STRUCTURE WITH 
SECOND-ORDER DIRECT~FORM II SUBSECTIONS 


FILTER CHARACTERISTICS 


SAMPLING FREQUENCY = 10 KHZ 


BAND 1 BAND 2 
LOWER BAND EDGE 0.00000 2.75000 
UPPER BAND EDGE 2.50000 5.00000 
NOMINAL GAIN 1.00000 0.00000 
NOMINAL RIPPLE 0.06000 0.06000 
MAXIMUM RIPPLE 0.05617 0.05514 
RIPPLE IN DB 0.47469 -25.17089 


FILTER STRUCTURE 


b y (n) b 
01 1 02 

On - 0-9 0 0 = 0 - $$ $0 - = 0-0 
x(n) | { -1 I | } -1 | y(n) 

a a vz b * “a vzb * 

I ll | ll | | 12] 

O--~ f= - 07-9 - 0 O---€-=-0--->---0 

{ 1 -1 | | | -1 | 

= ta vzb * oak vz b * 

| 21 | 21) | 22 | 22| 


0---<-~-0--->---0 0---<---0--->---0 


PERC ORE SE SEER SE SESS SESS SEES SLES LER EEE EE EE RS ESE RES SESE SESE ES | 


- eee eee He He HEH HEHEHE HHH eH HE HEHE eH HHH HH He EH HH OH 


CYCLES | EXECUTION TIME | PROGRAM MEMORY | DATA MEMORY 
| (MICROSECONDS) | (WORDS) | (WORDS) 
Ro ween | ---------------- | ---------------- | ------------- 
* | | | 
* 27 | 5.4 | 27 | 18 
| | | 


Re a a a a a a an a a ee ee ee ee ee ee 


* (EXCLUDING I/O AND INITIALIZATION) 
we ‘ 


RIKER RRR KEKEERERERERKREREKKEAREEERKKEKKEKRKKRKKKKAKKKKREKK 


IDT ‘IIR4CAS'* 
D2N EQU 0 
D2NM1 EQU 1 
D2NM2 EQU 2 
* 


ITR4CAS 


0058 
0059 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
0080 
0081 
0082 
0083 
0084 


0085 
0086 
0087 
0088 
0089 
0090 
0091 
0092 
0093 
0094 
0095 
0096 
0097 
0098 
0099 
0100 
0101 
0102 
0103 
0104 
0105 
0106 
0107 
0108 
0109 
0110 
0lll 
0112 
0113 


0000 


0000 
0001 


0002 
0003 
0004 


0005 
0006 


0007 
0008 
0009 


OO0A 
000B 


000C 
000D 


OOOE 


OOOF 
0010 


ooll 
0012 
0013 


32010 FAMILY MACRO ASSEMBLER 


0003 
0004 
0005 


0006 
0007 
0008 


0009 
000A 


000B 
000c 
000D 


OOOE 
OOOF 


0010 
0011 
0012 
0013 
0014 


F900 
OOOE 


1FO05 
2B75 
LEFD 


394D 
D889 


62F1 
26DB 
62ED 


FEF7 
90EE 


000A 
O1F3 


6£00 


7E01 
5014 


7011 
710B 
7EOD 


DIN 
D1INM1 
D1INM2 
* 


BOl 
Bll 
B21 
* 

All 
A21 
* 

BO2 
Bl2 
B22 
* 

Al2 
A22 
* 


MODE 
CLOCK 
YN 

XN 
ONE 


** % 


* 
CBOl 
CBll 
CB21 
* 

CAll 
CA21 
* 

CBO2 
CB12 
CB22 
* 

CAl12 
CA22 
* 


MD 
SMP 
* 


START 
* 


EQU 
EQU 
EQU 


Nl & Ww 


EQU 6 
EQU 7 
EQU 8 


EQU 9 
EQU 1 


EQU 11 
EQU 12 
EQU 13 


EQU 14 
EQU 15 


EQU 16 
EQU 17 
EQU 18 
EQU 19 
EQU 20 


AORG 0 


B START 


DATA 
DATA 
DATA 


DATA 
DATA 


DATA 
DATA 
DATA 


DATA 
DATA 


DATA 
DATA 


LDPK 0 


LACK 1 
SACL ONE 


LARK ARO,CLOCK 


LARK AR1l 
LACK SMP 


>1F05 
>2B75 
>1LEFD 


>394D 
>D889 


>62F1 
>26DB 
>62ED 


>FEF7 
>90EE 


>O00A 
499 


rll 


* 


* 
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COEFFICIENTS ARE INITIALLY * 
STORED IN PROGRAM MEMORY * 


0.242342 * 
0.339521 * 


0.242117 * 


0.447687 * 
-0.308310 * 


0.772990 * 
0.303581 * 
0.772887 * 
-0.008080 * 
-0.867723 * 


SAMPLING RATE OF 10 KHZ * 


CONTENT OF ONE IS 1 * 


THIS SECTION OF CODE LOADS * 
THE FILTER COEFFICIENTS AND * 
OTHER VALUES FROM PROGRAM bol 


OZOZESWL/OLOZESWLL 2) TIM SION YI/AlA JO uonejuowsjduy -¢ 


$9 


IITR4CAS 


0114 
0115 
0116 
0117 


0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 


0131 


0132 
0133 
0134 
0135 
0136 
0137 
0138 
0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 
0153 
0154 
9155 
0156 
0157 
0158 
0159 
0160 
0161 
0162 
0163 
0164 
0165 
0166 
0167 


0014 
0015 
0016 
0017 


0018 
0019 
001A 
001B 
001C 
001D 
OO1E 
OoOlF 


0020 
0021 


0022 
0023 
0024 
0025 
0026 
0027 


0028 
0029 


002A 
002B 


002c 
002D 
QO2E 
002F 


0030 
0031 


0032 
0033 


0034 
0035 


0036 
0037 


0038 


0039 


32010 FAMILY MACRO ASSEMBLER 


6880 LOAD LARP ARO 
6791 TBLR *-,AR1 
1014 SUB ONE 
F400 BANZ LOAD 
0014 
* 
7E89 ZAC 
5000 SACL D2N 
5001 SACL D2NM1 
5002 SACL D2NM2 
5003 SACL DIN 
5004 SACL DINM1 
5005 SACL DINM2 
* 
4810 OUT MODE,PAO 
4911 OUT CLOCK,PA1 
* 
F600 WAIT BIOZ NXTPT 
0026 
F900 B WAIT 
0022 
* 
4213 NXTPT IN XN,PA2 
* 
2F13 LAC XN,15 
* 
6A04 LT D1INM1 
6D09 MPY All 
* 
6c05 LTA D1NM2 
6DOA MPY A21 
* 
7F8F APAC 
* 
5903 SACH D1N,1 
* 
7TF89 ZAC 
* 
6D08 MPY B21 
* 
6B04 LTD DINM1 
6D07 MPY Bll 
® 
6B03 LTD DIN 
6D06 MPY BOl 
* 
* 
* 
6CcOl LTA D2NM1 
6D0E : MPY Al2 
6C02 LTA D2NM2 
6DOF MPY A22 
* 
TF8F APAC 
* 
5900 SACH D2N,1 


*» 


* 
* 


*® 


*® 


* 


* 


® 
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PAGE 0003 
MEMORY TO DATA MEMORY * 


THIS SECTION SETS THE * 
INITIAL STATE OF THE * 
FILTER TO ZERO : 


INITIALIZATION OF ANALOG * 
INTERFACE BOARD * 


BIO PIN GOES LOW WHEN A * 


NEW SAMPLE IS AVAILABLE * 


BRING IN THE NEW SAMPLE XN * 


START FIRST CASCADE SECTION * 


d (n-1l) *% a * 
1 ll 


FINISHED FIRST CASCADE SECTION * 


START SECOND CASCADE SECTION * 


d (n-l1) * a * 
2 12 


IITR4CAS 


0168 
0169 
0170 
0171 
0172 
0173 
0174 
0175 
0176 
0177 
0178 
0179 
0180 
0181 
0182 
0183 
0184 
0185 
0186 


003A 
003B 


003C 
003D 


OO03E 
003F 


0040 
0041 


0042 


0043 
0044 
0187 
0188 
NO ERRORS, 
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7F89 ZAC 
* 
6D0D MPY B22 
* 
6B01 LTD D2NM1 
6DOC MPY B12 
* 
6800 LTD D2N 
6D0B MPY BO2 
* 
7P8F APAC 
* 
5912 SACH YN,1 
* 
* 
4Al2 OUT YN, PA2 
* 
F900 B WAIT 
0022 
* 
END 


NO WARNINGS 


* 


* 


20:43:59 


PC2.1 84.107 
: PAGE 0004 


FINISHED SECOND CASCADE SECTION * 
AND FILTER * 


OUTPUT THE FILTER RESPONSE y(n) * 


GO GET THE NEXT POINT * 
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PAGE 0001 PAGE 0002 

000ol PEERESESERSESEELESLESE SESE EEE REL ELSES ES ER ESLER E RSE RS ERE REE EES ET TY 0058 * H i | 

0002 * 0059 * 28s 5.6 | 28 | 18 

0003 * FOURTH-ORDER IIR 0060 * | | J 

0004 * ELLIPTIC LOWPASS FILTER 0061 Bo wee nn nnn nn on on no en nn en ne ene 

0005 * 0062 * 

0006 * PARALLEL STRUCTURE 0063 * (EXCLUDING I/O AND INITIALIZATION) 

0007 * * 

0008 * FILTER CHARACTERISTICS aoe RRR REE RRR RE RE EEE EREEREEREE ARERR ER RERERERERER RAEN 

0009 * 0066 * 

0010 * SAMPLING FREQUENCY = 10 KHZ 0067 * 

0011 bd 0068 IDT 'IIR4PAR' 

0012 * BAND 1 BAND 2 0069 0000 D2N EQU 0 

0013 * 0070 0001 D2NMl1 EQU 1 

0014 * LOWER BAND EDGE 0.00000 2.75000 0071 0002 D2NM2 EQU 2 

0015 * UPPER BAND EDGE 2.50000 5.00000 0072 0003 DIN EQU 3 

0016 * NOMINAL GAIN 1.00000 0.00000 0073 0004 DINMl EQU 4 

0017 * NOMINAL RIPPLE 0.06000 0.06000 0074 0005 DINM2 EQU 5 

0018 * MAXIMUM RIPPLE 0.05617 0.05514 0075 * 

0019 * RIPPLE IN DB 0.47469 -25.17089 0076 0006 GOl EQU 6 

0020 Se 0077 0007 Gll EQU 7 

0021 bg 0078 * 

0022 * FILTER STRUCTURE 0079 0008 All EQu 8 

0023 * 0080 0009 A2l EQU 9 

0024 * Cc 0081 * 

0025 * $----------------- e-------------- = + 0082 OOOA G02 EQU 10 

0026 * | | 0083 000B Gi2 EQU 11 

0027 * ig g p(n) v 0084 * 

0028 * | 01 1 ] 0085 oo0c Al2 EQU 12 

0029 ba © Santee Seateatiaal © Detentan Oeeetetenl © Lenteettan® aeatertenten © tentestecten? edectentead * leche Dated ° teteatee Decentead °) 0086 0O0OO0D A22 EQU 13 

0030 * x(n) | | } -1 I | y(n) 0087 * 

0031 * * a vz g x { 0088 OOOE C EQU 14 

0032 * | | ll sf ll | | 0089 * 

0033 * | O----<--=-0---->----0 | 0090 OOOF MODE EQU 15 

0034 * | | | <-1 | 0091 0010 CLOCK EQU 16 

0035 * | a a vz I 0092 0011 YN EQU 17 

0036 * v | 21 | . 0093 0012 XN EQU 18 

0037 * | 0----<----0 | 0094 0013 ONE EQU 19 

0038 * | | 0095 0014 Pl EQU 20 

0039 * { | 0096 * 

0040 * | g p(n) | 0097 0000 AORG 0 

0041 * | 02 2 | 0098 * 

0042 * +--->---0---->----0---- > ----0--->---+ 0099 0000 F900 B START 

0043 * | | -1 | 0001 000D 

0044 * Zz a Vv Zz g . 0100 * 

0045 * | 12 | 12 | 0101 * COEFFICIENTS ARE INITIALLY * 

0046 * O-~--<----0---->----0 0102 * STORED IN PROGRAM MEMORY * 

0047 * | } -1 0103 * 

0048 * - a vz 0104 = 

0049 * | 22 | 0105 0002 C750 cGOl DATA >C750 * ~0.4428446 * 

0050 * 0----<----0 0106 0003 5F2C cGll DATA >5F2C * 0.743559] * 

0051 “ 0107 * 

0052 > 0108 0004 394D CAll DATA >394D * 0.447687 * 

0053 HREM EEEREKE EERE ERKEREREREKKEKEEERKREKEAKKHKKEKN 0109 0005 D889 CA21l DATA >D889 * 0.308310 * 

0054 * 0110 * 

0055 * CYCLES | EXECUTION TIME | PROGRAM MEMORY | DATA MEMORY 0111 0006 F721 CGO2 DATA >F721 * -0,0693031 * 

0056 * | (MICROSECONDS) | (WORDS) | (WORDS ) 0112 0007 EFAE CG12 DATA >EFAE * -0.1274959 * 

0057 twee ee $---------------- tenn enn nee ----- = +------------ 0113 * 


OZOZESWL/OLOZESILL 2) YM SJONIY YII/AlA JO uoneuowajduy -¢ 


L9 


ILR4PAR 


0114 
0115 
0116 
0117 
0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 


0134 
0135 
0136 
0137 
0138 
0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 


0147 


0148 
0149 
0150 
0151 
0152 
0153 
0154 
0155 
0156 
0157 
0158 
0159 
0160 
0161 
0162 
0163 
0164 
0165 
0166 
0167 


0008 
0009 


OOOA 


0008 
000c 


000D 


OOOE 
OOOF 


0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
OOLA 
001B 
oolCc 
001D 
OO1E 


OO1F 
0020 


0021 
0022 
0023 
0024 
0025 
0026 


0027 
0028 


0029 
002A 


002B 
002Cc 
002D 
002E 


002F 
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FEF7 
9OEE 


5988 


OOOA 
O1F3 


6E00 


7EO1 
5013 


7010 
TL0A 
TEOC 
6880 
6791 
1013 
F400 
0013 
7F89 
5000 
5001 
5002 
5003 
5004 
5005 


480F 
4910 


F600 
0025 
F900 
0021 
4212 
2F12 


‘ 
6A05 
6D09 


6B04 
6D08 


7F8F 
5903 
TF89 
6D07 


6B03 


CA12 
CA22 
* 

cc 

* 


MD 
SMP 
* 


START 
* 


LOAD 


DATA 
DATA 


DATA 


DATA 
DATA 


LDPK 


LACK 
SACL 


LARK 
LARK 
LACK 
LARP 
TBLR 


>FEF7 
>90EE 


>5988 


>000A 
>O1F3 


0 


1 
ONE 


ARO ,CLOCK 
AR1,10 
SMP 

ARO 
*-,AR1 


SUB ONE 


BANZ 


ZAC 

SACL 
SACL 
SACL 
SACL 
SACL 
SACL 


LOAD 


D2N 
D2NM1 
D2NM2 
DIN 
DINM1 
D1INM2 


OUT MODE, PAO 
OUT CLOCK,PA1 


BIOZ 


NXTPT 


B WAIT 


IN XN,PA2 


LAC XN,15 


LT DINM2 
MPY A21 


LTD DINM] 
MPY All 


APAC 


SACH 


ZAC 


DIN,1 


MPY Gil 


LTD DIN 


+e * & 


* 
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-0.008080 * 

-0.867723 * 

0.699476 * 
SAMPLING RATE OF 10 KHZ * 
CONTENT OF ONE IS 1 * 
THIS SECTION OF CODE LOADS * 
THE FILTER COEFFICIENTS AND * 
OTHER VALUES FROM PROGRAM * 

* 


MEMORY TO DATA MEMORY 


THIS SECTION SETS THE * 
INITIAL STATE OF THE * 
FILTER TO ZERO * 


INITIALIZATION OF ANALOG * 
INTERFACE BOARD * 


BIO PIN GOES LOW WHEN A * 


NEW SAMPLE IS AVAILABLE * 


BRING IN THE NEW SAMPLE XN * 


START FIRST PARALLEL SECTION * 


d (n-2) * a * 
1 21 


IIR4PAR 


0168 0030 
0169 
0170 0031 
0171 
0172 0032 
0173 
0174 0033 
0175 
0176 0034 
0177 0035 
0178 
0179 0036 
0180 0037 
0181 
0182 0038 
0183 
0184 0039 
0185 
0186 003A 
0187 
0188 003B 
0189 
0190 003C 
0191 003D 
0192 
0193 003E 
0194 003F 
0195 
0196 0040 
0197 
0198 0041 
0199 
0200 
0201 0042 
0202 
0203 0043 
* 0044 
0204 
0205 
NO ERRORS, 
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6D06 
7F8F 
5914 
2F12 


6A02 
6D0D 


6B01 
6DOC 


7F8F 
5900 
2F14 
6D0B 


6B00 
6DO0A 


6COE 
6D12 


TF8F 


$911 


4All 


F900 
0021 


NO WARNINGS 


* 


MPY GOl 
APAC 


SACH Pl,1l 
LAC XN,15 


LT D2NM2 
MPY A22 


LTD D2NM1 
MPY Al2 


APAC 


SACH D2N,1 


LAC P1,15 
MPY G12 


LTD D2N 
MPY GO2 


LTA C 
MPY XN 


APAC 


SACH YN,1 


OUT YN,PA2 


B WAIT 


END 


PC2.1 84.107 20:45:18 
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* FINISHED FIRST PARALLEL SECTION * 
* START SECOND PARALLEL SECTION * 


* ad (n-2) * a * 
2 22 


* FINISHED SECOND PARALLEL SECTION * 
* AND FINISHED FILTER 


* OUTPUT THE FILTER RESPONSE y(n) * 


* GO GET THE NEXT POINT * 
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INTRODUCTION 


The Fourier transform converts information from the time 
domain into the frequency domain. It is an important 
analytical tool in such diverse fields as acoustics, optics, 
seismology, telecommunications, speech, signal processing, 
and image processing. In discrete-time systems, the Discrete 
Fourier Transform (DFT) is the counterpart of the 
continuous-time Fourier transform. Since the DFT is 
computation-intensive, it had relatively few applications, 
even with modern computers. The Fast Fourier Transform 
(FFT) is the generic name for a class of computationally 
efficient algorithms that implement the DFT and are widely 
used in the field of Digital Signal Processing (DSP). 

Recent advances in VLSI hardware, such as the Texas 
Instruments TMS320 family of digital signal processors, have 
further enhanced the popularity of the FFT. This application 
report describes the implementation of FFT algorithms using 
the TMS32020 processor, which has features particularly 
suited to digital signal processing. This report begins with 
a discussion of the development of the DFT algorithm, 
leading to the derivation of the FFT algorithm. Special 
attention is given to various FFT implementation aspects, 
such as scaling. Although this report refers to radix-2 and 
radix-4 FFT only, the implementation techniques described 
are applicable to all FFT algorithms in general.!-3 Specific 
examples of FFT implementations on the TMS32010 
processor are contained in the book by Burrus and Parks.4 
To expedite TMS32020 FFT code development, two macro 
libraries are included in the appendices for both the direct 
and indirect memory addressing modes. TMS32020 source 
code examples are also given for a 256-point (both radix-2 
and radix-4) and a 1024-point complex FFT, along with some 
system memory considerations for implementing large FFTs. 
The FFT source code can be found in Appendices A through 
G. 


DEVELOPMENT OF THE DFT ALGORITHM 


The Discrete Fourier Transform (DFT) is the discrete- 
time version of the continuous-time Fourier transform. The 
continuous-time Fourier transform or frequency spectrum 
of an analog signal x(t) is 


[o ) 
X(w) = \ x(tle—iwtdt (1) 
oo 


where, in general, both x(t) and X(w) are complex functions 
of the continuous-time variable t and the frequency variable 
w, respectively. The continuous-time signal x(t) is converted 
to a discrete-time signal x(nT) by sampling it every T 
seconds. When there is no ambiguity, the sampling period 


T notation is dropped and the discrete signal is represented 
by x(n). The Fourier transform of the discrete signal is given 
by 


co 


X(w) = 


n=—o 


x(n)e—jJwn (2) 


where w represents normalized frequency and takes on values 
between 0 and 27. X(w) is periodic with period 27 and, as 
a result, it is sufficient to consider its values only between 
O and 27.2 The periodicity of X(w) is a direct result of the 
sampled nature of x(n). In general, sampling in the time 
domain is associated with periodicity in the frequency domain 
and, conversely, sampling in the frequency domain is 
associated with periodicity in the time domain. This property 
is a basic result in Fourier theory, and forms the foundation 
of the DFT. 

Assume that a signal x(n) consists of N samples. Since 
no restriction is imposed on what happens outside the interval 
of N points, it is convenient to assume that the signal is 
periodically repeated. Under this assumption, and because 
of the above correspondence of sampling and periodicity, 
the Fourier transform becomes discrete with the distance 
between successive samples equal to the fundamental 
frequency of the signal in the time domain. This distance 
is 27/N in normalized frequency units. The result is the DFT, 
given by 


N-1 
X(k)= DO  x(n) wek k= 0,1,... 


n=0 


where Wy = e7J27/N, and Wy is known as the phase or 
twiddle factor. Equation (3) is generally referred to as an 
N-point DFT. Because the number of complex 
multiplications and additions required is approximately N2 
for large N, the total number of arithmetic operations 
required for a given N increases rapidly with the value of 
N. In fact, the excessively large amount of computations 
required to compute the DFT directly when N is large has 
directly prompted alternative methods for computing the DFT 
efficiently. Most of these methods make use of the inherent 
symmetry and periodicity of the above twiddle factor, as 
shown in Figure 1 for the case where N=8. 

Figure 1 shows that the following symmetry and 
periodicity relationships are true: 


Symmetry Property: Wi = —wyt 2) (4) 


Periodicity Property: wk = wh +k (5) 
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Figure 1. Symmetry and Periodicity of the Twiddle 
Factor for N=8 


In the next section, these relationships are utilized in 
the derivation of the radix-2 FFT algorithm. 


DERIVATION OF THE FFT ALGORITHM 


A more efficient method of computing the DFT that 
significantly reduces the number of required arithmetic 
operations is the so-called decimation-in-time (DIT) FFT 
algorithm.2 With the FFT, N is a factorable number that 
allows the overall N-point DFT to be decomposed into 
successively smaller and smaller transforms. The size of the 
smallest transform thus derived is known as the radix of the 
FFT algorithm. Thus, for a radix-2 FFT algorithm, the 
smallest transform or ‘‘butterfly’’ (basic computational unit) 
used is the 2-point DFT. Generally, for an N-point FFT, 


there are N resultant frequency samples corresponding to N 
time samples of the input signal x(n). For a radix-2 FFT, 
N is a power of 2. 

The number of arithmetic operations can be reduced 
initially by decomposing the N-point DFT into two N/2-point 
DFTs. This means that the input time sequence x(n) is 
decomposed into two N/2-point subsequences (hence the 
name, decimation-in-time), which consist of its even- 
numbered and odd-numbered samples with time indices 
expressed mathematically as 2n and 2n+1, respectively. 
Substituting these time indices into the original DFT equation 
gives 


N/2-1 N/2-1 
Xk) = SY x(n werk+ YP xn+1) weet Dk 
n=0 n=0 


(6) 
N/2-1 N/2-—1 
= yO x(2n) wank + wk > x(2n+1) wank 
n=0 n=0 
Since 


W2, = [e-i@z/N)2 = [e-ix(NI2)I2 = Wry 


equation (6) can be written as 


N/2-1 
Xk) = YO xn) wi, 
n=0 
N/2-1 
+ Wk DD x(2n+1) Wek, 
n=0 
(7) 
= Y(k) + WK Z(k) k = 0,1,...,.N—1 
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where Y(k) is the first summation term and Z(k) is the second 
summation term. 

Y(k) and Z(k) are further seen to be the N/2-point DFTs 
of the even-numbered and odd-numbered time samples, 
respectively. In this case, the number of complex 
multiplications and additions is approximately N + 2(N/2)2 
because, according to (7), the N-point DFT is split in two 
N/2-point DFTs, which are then combined by N complex 
multiplications and additions. Thus, by splitting the original 
N-point DFT into two N/2-point DFTs, the total number of 
arithmetic operations has been reduced. This reduction is 
illustrated in Figure 2. 

Implicit in the above derivation is the periodicity of 
X(k), Y(k), and Z(k). X(k) is periodic in k with a period 


N, while Y(k) and Z(k) are both periodic in k with a period 
N/2. Consequently, despite the fact that the index k ranges 
over N values from 0 to N—1 for X(k), both Y(k) and Z(k) 
must be computed for k between 0 and (N/2)—1 only. The 
periodicity of Y(k) and Z(k) is also assumed in Figure 2. 

Although (7) can be used to evaluate X(k) for 0 < k 
=< N-1, further reduction in the amount of computation is 
possible when the symmetry property (4) and periodicity (5) 
of the twiddle factor are utilized to compute X(k) separately 
over the following ranges: 


lst Half of Frequency Spectrum: 0 < k < (N/2)-1 


2nd Half of Frequency Spectrum: (N/2) s k s (N-1) 
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Figure 2. First DIT Decomposition of an N-Point DFT 
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Equation (7), for N/2 < k s N-1, can be rewritten as 


N/2—1 
X(k+N/2) = YO x(2n) whit N/2) 
n=0 


Ne N/2-1 

+ 

+ Wr > 
n=0 


n(k+N/2 
x(2n+1) WRUCTN/2) 


(8) 
where 0 < k S (N/2)-1 


Since 


n(k-+N/2) —" yyn(N/2) nk =. .-j2an ynk _. yank 
Wn? = Wn” Wr =e 2" Wy = Wu 


and 


x(2n) 
x(O) 


x(2) N/2-POINT DFT 
Y(k) 


x(4) 


N/2-1 


nk 
Da x(2n) Ww2 


n=0 
x(N — 2) 


x(2n + 1) 
x(1) 
N/2-POINT DFT 
x(3) Z(k) 


N/2-1 
nk 
» x(2n + 1) Wn2 
n=0 
x(N— 1) 


equation (8) can be rewritten as 


N/2-1 
X(k+N/2) = YO x(2n) WRK, 
n=0 (9) 
N/2-1 
-wkK YD x(@n+1) Wik, 


n=0 


= Y(k) — WE Zk) k = 0,1,...,(N/2)-1 
Therefore, (7) can be used to compute the first half of the 
frequency spectrum X(k) for the index range 0 < k s 
(N/2)—1, while equation (9) can be used to compute the 
second half of the frequency spectrum X(k + N/2). 

Figure 3 depicts the situation when the symmetry 
property of the twiddle factor is used to compute X(k). The 
above decimation process and symmetry exploitation can 
reduce the DFT computation tremendously. By further 
decimating the odd-numbered and even-numbered time 
samples in a similar fashion, four N/4-point DFTs can be 
obtained, resulting in a further reduction in the DFT 
computation. Consequently, to arrive at the final radix-2 DIT 
FFT algorithm, this decimation process is repetitively carried 
out until eventually the N-point DFT can be evaluated as a 
collection of 2-point DFTs or butterflies. 
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Figure 3. Decomposition of a DFT Using the Symmetry Property 
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RADIX-2 DECIMATION-IN-TIME (DIT) FFT 
BUTTERFLY 


In the radix-2 DIT FFT algorithm, the time decimation 
process passes through a total of M stages where N = 2M 
with N/2 2-point FFTs or butterflies per stage, giving a total 
of (N/2)log2N butterflies per N-point FFT. 

For the case of an 8-point DFT implemented using the 
radix-2 DIT FFT algorithm discussed in the previous pages, 
the input samples are processed through three stages. Four 
butterflies are required per stage, giving a total of twelve 
butterflies in the radix-2 implementation. Each butterfly is 
a 2-point DFT of the form depicted in Figure 4. P and Q 
are the inputs to the radix-2 DIT FFT butterfly. In general, 
the inputs to each butterfly are complex as is also the twiddle 
factor. 


Pp @) ———+ P+O Wh 


Q @)———> P-O Wh 
kk 
Wn -1 


Figure 4. A Radix-2 DIT FFT Butterfly Flowgraph 


As shown in Figure 4, the outputs P’ and Q’ of the 
radix-2 butterfly are given by 


_ k 
P =P +QWy 


k (10) 
Q=P-QWy 


While Figure 4 actually uses signal flowgraph 
nomenclature, another commonly used symbol for a radix-2 
butterfly is shown in Figure 5. 


P k 
P+Q Wa 


k 
Ww 
rey N 


k 

P-Q Wa 

Figure 5. A Simplified Radix-2, DIT FFT 
Butterfly Symbol 


For an explanation of the various notational conventions 
in use, the reader is referred to reference [3]. Both the 
flowgraph nomenclature and the butterfly symbol are used 
interchangeably in this report. 


Implementation of the FFT Butterfly with Scaling 

In the computation of the FFT, scaling of the 
intermediate results becomes necessary to prevent overflows. 
The TMS32020 processor has features optimized for digital 
signal processing and a number of on-chip shifters for 
scaling. In particular, the input scaling shifter, the 32-bit 
double-precision ALU and accumulator, and its output 
shifters are used extensively for scaling. 

To see why scaling is necessary, observe that from the 
general equation of an N-point DFT (3), application of 
Parseval’s theorem gives 


N-1 (Nal 
Y x%n) = — [x(k |2 (11) 
n=0 N x=0 
or 
, N=] , Nel 
NiJ— yD xq@ = [= D ix} a2) 
N n=0 N ,=0 


i.e., the mean-squared value of X(k) is N times that of input 
x(n). Consequently, in computing the DFT of the input 
sequence x(n), overflows may occur when fixed-point 
arithmetic is employed without appropriate scaling. To see 
how overflows can actually occur in FFT computations, 
consider the general radix-2 butterfly in the mth stage of an 
N-point DIT FFT as shown in Figure 6. 


Pm @) ————rms1 


om—— @——eoms 
Wa -1 


Figure 6. Signal Flowgraph of a Butterfly 
at the mth Stage 


From Figure 6, the final form of the FFT can be written 
as 


Pm+1 = Pm + Wy Qm 


(13) 
Qm+1 = Pm - Wx Qm 


where Pm and Qm are the inputs, and Pm +1 and Qm + 1 
are the outputs of the mth stage of the N-point FFT, 
respectively. In general, Pm, Qm, Pm+1, and Qm+1 are 
complex as is the twiddle factor. The twiddle factor can be 
expressed as 


4. Implementation of Fast Fourier Transform Algorithms with the TMS32020 75 


wk = ¢ J@7/NK = cos(x) — j sin(X) (14) 


where X = (27/N)k andj = v~—1. 


The inputs Pm and Qm can be expressed in terms of 
their real and imaginary parts by 


Pm = PR + j Pl (15) 
Qm = QR + j QI 


By substituting the values from (13) and (14), equation 
(12) becomes 


Pm+1 = PR + j PI + (QR cos(X) + QI sin(X) 
+ j (QI cos(X) — QR sin(X)) 
= (PR + QR cos(X) + QI sin(X)) 
+ } (PI + QI cos(X) — QR sin(X)) 
(16) 
Qm+1 = PR + j PI — (QR cos(X) + QI sin(X)) 
— j (QI cos(X) — QR sin(X)) 
= (PR — QR cos(X) + QI sin(X)) 
+ j (PI — QI cos(X) — QR sin(X)) 


Although the inputs of each butterfly stage have real 
and imaginary parts with magnitudes less than one, the real 
and imaginary parts of the outputs from (15) can have a 
maximum magnitude of 


1 + 1 sin(45) + 1 cos(45) = 2.414213562 


To avoid the possibility of overflow, each stage of the 
FFT is scaled down by a factor of 2. In this way, if an FFT 
consists of M_ stages, the output is scaled down by 
2M =N, where N is the length of the FFT. Even with 
scaling, overflow is possible because of the maximum 
magnitude value for complex input data. This possibility is 
avoided by scaling down the input signal by a factor of 
1.207106781, and then scaling up the output of the last FFT 
stage by the same factor. This additional scaling is not 
implemented in the code of the appendices, because the input 


signal is assumed real (i.e., the imaginary part is zero), and 
the above maximum value cannot be attained. The maximum 
value for a real input is 2. 

Using (15), the TMS32020 butterfly code is given in 
Figure 7. It is assumed that all input and output data values 
are in Q15 format; i.e., they are expressed in two’s- 
complement fractional arithmetic with the binary point 
immediately to the right of the sign bit (15 bits after the binary 
point). This code incorporates one stage of scaling (i.e., 
scaling by two) for the implementation of the general radix-2 
DIT FFT butterfly with the 16-bit sine and cosine values of 
the twiddle factor also stored in Q15 format. Note that in 
performing fractional multiplications, the product of two 
16-bit Q15 fractions is a 32-bit double-precision fraction in 
Q30 format with two sign bits. This result is illustrated in 
Figure 8, where S stands for sign bit. 

The code for a general radix-2 DIT FFT is given in 
Figure 7. In the comment section, ACC, P-REGISTER, and 
T-REGISTER represent the on-chip 32-bit accumulator, 
32-bit product register, and 16-bit temporary register of the 
TMS32020 processor, respectively. For more information 
about the TMS32020 processor and its architecture, see the 
TMS32020 User’s Guide.5 

The first block in the butterfly code of Figure 7 (starting 
with the label INIT) is for general system initialization. The 
second block of code (starting with the label BTRFLY) takes 
advantage of the double sign bits to provide a ‘“‘free’’ divide- 
by-2 scaling in calculating the term (1/2)(QR COS(X) + QI 
SIN(X)), which is the scaled real part of the product of the 
twiddle factor and Qm. In addition, since the current contents 
of memory location QR are no longer required for subsequent 
calculations, QR is also used as a temporary storage for this 
term. 

The third block of code calculates the term (1/2) 
(QI COS(X) — QR SIN(X)), which is the scaled imaginary 
part of the product of the twiddle factor and Qm. By 
completing this calculation, QI is also freed as a temporary 
storage for this term. 

The fourth block of code calculates the real parts of 
Pm + 1 and Qm + 1 and provides the divide-by-2-per-stage 
scaling function to avoid signal overflows. To perform this 
function, the input binary scaling shifter of the TMS32020 
is used. 
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* TMS32020 CODE FOR A GENERAL RADIX-2 DIT FFT BUTTERFLY y 


HHI KIKI KK IK HH KI IK IK IK KK IK KICK KI IK I I IK I HK IK I KIKI HK KK KK HI IKI KKK IKK KK KKK KK 


EQUATES FOR THE REAL AND IMAGINARY PARTS OF Xm(P) AND Xm(Q). 
THE LOCATIONS PR, PI, QR, AND QI ARE USED BOTH FOR THE INPUT 
AND THE OUTPUT DATA. 


+ ££ & 


* 


PR EQU 0 Re(Pm) STORED IN LOCATION 0 IN DATA MEMORY 
PI = EQUSsd Im(Pm) STORED IN LOCATION 1 IN DATA MEMORY 
QR EQU 2 Re(Qm) STORED IN LOCATION 2 IN DATA MEMORY 
Ql EQ 3 Im(Qm) STORED IN LOCATION 3 IN DATA MEMORY 
x 
* EQUATES FOR THE REAL AND IMAGINARY PARTS OF THE TWIDDLE FACTOR. 
* 
COSX EQU 4 COS(X) STORED IN LOCATION 4 IN DATA MEMORY 
SINX EQU 5 SIN(X) STORED IN LOCATION 5 IN DATA MEMORY 
* 
* INITIALIZE SYSTEM. 
* 

AORG >20 
INIT SPM 0 NO SHIFT AT OUTPUTS OF P-REGISTER 

SSXM SELECT SIGN-EXTENSION MODE 

ROVM RESET OVERFLOW MODE 

LDPK 4 CHOOSE DATA PAGE 4 


* 


* CALCULATE (QR COS(X) + QI SIN(X)); STORE RESULT IN QR. 
* 


BIRFLY LT QR LOAD T-REGISTER WITH QR 
MPY COSX P-REGISTER = (1/2) QR COSX 
LTP QI ACC= (1/2) QR COSX ; LOAD T-REGISTER WITH QI 
MPY SINX P-REGISTER = (1/2) QI SINX 
APAC ACC= (1/2)(QR COSX+QI SINX) 
MPY COSX P-REGISTER = (1/2) QI COSX 
LT QR LOAD T-REGISTER WITH QR 
SACH QR QR = (1/2)(QR COSX+QI SINX) 


* CALCULATE (QI COS(X) - QR SIN(X)); STORE RESULT IN QI. 
* 


PAC ACC= (1/2) QI COSX 

MPY SINX P-REGISTER = (1/2) QR SINX 
SPAC ACC= (1/2)(QI COSX - QR SINX) 
SACH QI QI = (1/2)(QI COSX - QR SINX) 


* CALCULATE Re(Pm+1) AND Re(Qm+1); STORE RESULTS IN PR AND QR. 
* 


LAC PR,14 ACC= (1/4)PR 
ADD QR, 15 ACC= (1/4)(PR + QR COSX + QI SINX) 


SACH PR, 1 PR = (1/2)(PR + QR COSX + QI SINX) 
SUBH QR ACC= (1/4)(PR - QR COSX + QI SINX) 
SACH QR, 1 QR = (1/2)(PR - QR COSX + QI SINX) 


Figure 7. TMS32020 code for a General Radix-2 DIT FFT Butterfly 
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* CALCULATE Im[Pm+1] AND Im[Qm+1]; STORE RESULTS IN PI AND QI. 
* 


LAC PI,14  ~— ACC= (1/4)PI 
ADD QI,15 ACC= (1/4)(PI 
SACH PI, 1 PI = (1/2)(PI 
SUBH QI ACC= (1/4)(PI 
SACH QI, QI = (1/2)(PI 


+ + 


QI COSX - QR SINX) 
QI COSX - QR SINX) 
QI COSX - OR SINX) 
QI COSX - QR SINX) 


Figure 7. TMS32020 Code for a General Radix-2 DIT FFT Butterfly (concluded) 
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Ss. @—15 BITS —e x S. @e— 15 BITS ——> = 


MULTIPLIER 
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Q30 


SS. e— 30 BITS —e» 


PRODUCT 


Figure 8. Multiplication of Two Q15 Numbers 


Initially, the contents of PR are scaled down by a factor 
of 4 (equivalent to a 14-bit left-shift). Note that the shift or 
scaling function is being performed while the contents of PR 
are being loaded into the 32-bit accumulator. Since the 
TMS32020 has a 32-bit double-length accumulator, no 
accuracy is lost in this binary scaling process. To generate 
the final result of Re(Pm-+ 1), the contents of QR must be 
added to the contents of the accumulator with a 1-bit right- 
shift (equivalent to a 15-bit left-shift). This means adding 
(1/4)(QR COS(X) + QI SIN(X)) to (1/4)PR, which is the 
current value held in the accumulator. The upper-half of the 
accumulator is then stored in PR with a 1-bit left-shift to yield 
the term (1/2)(PR + QR COS(X) + QI SIN(X)), which is 
precisely Re(Pm + 1) scaled down by 2. This shift or scaling 
function is being performed while the contents of the upper 
half of the accumulator are loaded into PR. At this point, 
the accumulator still has a value equal to (1/4)(PR + QR 
COS(X) + QI SIN(X)). Hence, to obtain the final result of 
Re(Qm + 1), the unscaled contents of QR must be subtracted 
from the accumulator. The upper-half of the accumulator is 
again stored in QR with a 1-bit left-shift to yield the term 
(1/2)(PR + QR COS(X) + QI SIN(X)), which is precisely 
Re(Qm + 1) scaled down by 2. 

In a similar fashion, the fifth block of code calculates 
the imaginary parts of Pm +1 and Qm + 1. Note that all the 
scaling functions performed so far have come ‘‘free’’ with 
the architecture of the TMS32020.5 

In summary, the data values are scaled down by right- 
shifting the 16-bit words as they are loaded into the 32-bit 
accumulator. In this way, full precision is still maintained 
in all calculations. The right-shifts are implemented by a 
corresponding number of left-shifts into the upper half of 
the accumulator. On the other hand, if the accumulator had 
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been single precision or 16 bits wide, all scaling operations 
would have resulted in a loss of accuracy. 


In-Place FFT Computations 

In the butterfly implementation, the set of input 
registers in data memory (PR, PI, QR, and QI) for the two 
complex inputs Pm and Qm are used for holding the two 
complex outputs Pm + 1 and Qm + 1, respectively. When the 
same set of input registers is used as output registers for 
holding the FFT results, the FFT computation is said to be 
performed in-place. Therefore, FFTs implemented on the 
TMS32020 using the general butterfly routine are performed 
in-place. 

As a general rule, an in-place FFT computation means 
that a total of 2N memory locations are required for an N- 
point FFT since the inputs to the FFT can be complex. On 
the other hand, a total of up to 4N memory locations is 
required for not-in-place computations. 

Another attractive feature of the butterfly routine is that 
temporary or scratch-pad registers are not needed for 
intermediate results or calculations. Where coefficient 
quantization and other finite wordlength effects are not 
critical, 13-bit sine and cosine values can be used instead 
of 16-bit values addressed by the MPY instruction. In this 
way, the registers COSX and SINX for the twiddle factors 
can be dispensed with altogether. For this purpose, the 
MPYK instruction, which has a 13-bit signed constant 
embedded in its opcode, can be employed instead of the MPY 
instruction in the butterfly code. In Appendix A, two FFT 
macros (NORM 1 and NORM2) illustrate the use of the MPY 
and MPYK instructions, respectively. Appendices A and B 
contain macro libraries that perform the same tasks, but in 
Appendix A they use direct addressing while ir Appendix 
B they use indirect addressing. 
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Bit-Reversal/Data Scrambling 
As shown in Figure 9, the input time samples x(n) are 
not in order, i.e., they are scrambled. Such data scrambling 
or bit reversal is a direct result of the radix-2 FFT derivation. 
On closer inspection, it is seen that the index of each input 
sample is actually bit-reversed, as shown in Table 1. 
Therefore, the input data sequence must be 


prescrambled prior to executing the FFT in order to produce 
in-order outputs. To perform bit reversal on the 8-point FFT, 
shown in Figure 9, the pairs of input samples, [x(1) and x(4)] 
and [x(3) and x(6)], must be swapped. On the other hand, 
Figure 10 has in-order input samples by rearranging the 
ordering of all the butterflies. However, the outputs are now 
bit-reversed. 


Table 1. Bit-Reversal Algorithm for an 8-Point Radix-2 DIT FFT 


NOOO AWN =O 


| INDEX | BIT PATTERN BIT-REVERSED PATTERN BIT-REVERSED INDEX 


STAGE 1 STAGE 2 STAGE 3 
x(0) x(0) 
wo z 
x(4) e X(1) 
wo o> 
x(2) Ce: = po x(2) 
e 
x6) —“2 s¢5 X(3) 
x(1) = eS x(4) 
wo e & 
x(5) 7 ° X(5) 
x(3) “ x(6) 
Ww 6 
x(7) 2 X(7) 


LEGEND FOR TWIDDLE FACTOR: Wo =Ws wi =Ws w2 =W5 w3 =Ws 


Figure 9. An In-Place DIT FFT with In-Order Outputs and Bit-reversed Inputs 


STAGE 1 STAGE 2 STAGE 3 
x(0) x(0) 
e 
x(1) : x(4) 
x(2) ‘ . x(2) 
<= 
x(3) CxO) x(6) 
xsi eS x(1) 
x(5) wo 7 SS guy x(5) 
x(6) 2 : X(3) 
e@ 
x(7) Wo x(7) 


LEGEND FOR TWIDDLE FACTOR: wo =W3 


=w! —w2 —-w3 
W1=Weg W2=Ws W3=We 


Figure 10. An In-Place DIT FFT with In-Order Inputs but Bit-Reversed Outputs 
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In general, bit reversal or data scrambling must be 
performed either at the input stage on the time samples 
(Figure 9) or at the output stage on the frequency samples 
(Figure 10). Bit reversal can be performed in-place. Such 
a process generally requires the use of one temporary data 
memory location. 

Because of its double-precision accumulator and its 
versatile instruction set, the TMS32020 processor can 
perform in-place bit reversal or data scrambling without the 
use of a temporary data memory location. For example, the 
TMS32020 code for swapping input data locations x(1) and 
x(4) is given in Figure 11. 

Although bit-reversal can be regarded as a separate task 
performed either at the input or output stage of an FFT 
implementation, some FFT algorithms exist with bit reversal 
as an integral part.5 Such algorithms are said to be in-place 
and in-order, and they tend to have higher execution speeds 
than that of the FFT and bit-reversal algorithms executed 
separately. 


A Numerical Example: 8-Point DIT FFT 

To illustrate the concept of the FFT, a numerical 
example of an 8-point, decimation-in-time FFT is presented. 
The input signal is a square pulse with four samples equal 
to 0.5 and four samples equal to zero, as shown in Figure 
12(a). The broken line in Figure 12 represents the envelope 
of the plotted signal. Figure 12(b) plots the magnitude of the 
computed FFT, where the number next to each sample 
indicates its magnitude. The choice of the amplitude for this 
example is arbitrary, but it is restricted to be less than 1 since 
it assumed that the numbers handled by the processor are 
in QIS format. 


x(n) 


0.5 


0123 45 67 n 


(a) TIME-DOMAIN SIGNAL 


(b) FOURIER TRANSFORM MAGNITUDE 


Figure 12. Time-Domain Signal and the Magnitude 
of Its FFT 


The FFT of this time signal is computed by an 8-point 
DIT FFT as shown in Figure 13. On the left side, the samples 
x(n) of the time signal are arranged in their normal order. 
On the right side, the computed samples X(k) of the FFT 
are in bit-reversed order. Since the computation produces 
complex numbers, all the numerical values are presented as 
(R, I), where R is the real part and I is the imaginary part 
of the complex number. Figure 13 shows also the numerical 
values computed in the intermediate stages. 


KICK HHA HK KHAKI KIKI KI KK IKK IKK KKK IK IKKE KK KKK KKK KKK KKK IKK KKK KKKKKKKKEKE 


* TMS32020 CODE FOR THE BIT REVERSAL OF x(1) AND x(4) . 


HI K HHI KIKI KH I KI HK IK HI KIKI IKK KK KIKI IK KK KKK II KI KKK KI KEKE IKK KKKKKKKKKKK 


* 


BITREV ZALH RX] 


ADDS RX4 
SACL =RX1 
SACH  RX4 
ZALH IX] 
ADDS IX4 
SACL IX] 
SACH  IX4 


LOAD REAL PART OF x(1) IN UPPER ACCUMULATOR 
LOAD REAL PART OF x(4) IN LOWER ACCUMULATOR 
STORE REAL PART OF x(4) IN REAL PART OF x(1) 
STORE REAL PART OF x(1) IN REAL PART OF x(4) 
LOAD IMAG PART OF x(1) IN UPPER ACCUMULATOR 
LOAD IMAG PART OF x(4) IN LOWER ACCUMULATOR 
STORE IMAG PART OF x(4) IN IMAG PART OF x(1) 
STORE IMAG PART OF x(1) IN IMAG PART OF x(4) 


Figure 11. TMS32020 Code for the Bit Reversal of x(1) and x(4) 
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x(0) = (0.5,0) © D 
x(1)=(0.5,0)© 
x(2)=(0.5,0)Q 
x(3) =(0.5,0)Q 
x(4) =(0,0)© 
x(5) =(0,0)Q 


Ws, =1 
x(6)=(0,0)Q 


ee 1) X(0) = (2,0) 


O X(2) = (0,0) 


of) X(1) = (0.5, — 1.2071) 


<2 
Pia. 


NO X(5) = (0.5,0.2071) 
<x (0.5, -0.5) 


OD X(3) = (0.5, -0.2071) 


(0.5.0.5) a ae 


We =| We J 
x(7)=(0,0)Q e PO X(7) = (0.5,1.2071) 
(0.5.0.5) 
Figure 13. Numerical Example of an 8-Point DIT FFT without Scaling 
Table 2 shows the values of the twiddle factor Wi in the FFT. The other four are related to them through the 


fori = 0,1,...,7. Of these factors, only the first four are used symmetry property (see equation (4)). 


Table 2. Numerical Values of W x where i = 0, 1, ..., 7 


TWIDDLE FACTOR VALUE 


e 7 4_ _ 
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0.7071 —j 0.7071 


— 0.7071 —j 0.7071 


— 0.7071 +j 0.7071 
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Figure 13 has demonstrated the need for scaling. 
Without scaling, the intermediate results can attain values 
greater than or equal to 1. This would cause overflows in 
an implementation that uses Q15 numbers. Therefore, scaling 
is applied as mentioned earlier. Figure 14 shows exactly the 
same example, but now every stage is scaled by 1/2. No 
overflows occur with this implementation. The final output 
is the same as in Figure 13 but scaled by 1/8. 


Special Butterflies 


Although any N-point FFT (where N is a power of 2) . 


(0. 25 0) 
x(0) =(0.5,0)Q—/2 


x(1) =(0.5,0)Q 
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ae De SS 


(0.26.0) 
0 
We 
x(6) =(0,0)0—2 D 


(0.25,0) 


x(7) =(0,0)© 


e 
(0.25,0) 


5 RD X(4) = (0,0) 
LY Wi We 
Tas Joos J 
512 


can be directly impleniented with the general butterfly only, 
special butterflies are normally used in order to increase the 
FFT execution speed. 

Special butterflies can be coded by taking advantage 
of certain sine and cosine values of the twiddle factor. For 
instance, when the angle X takes on values such as 0, 90, 
180, and 270 degrees, butterflies require much less code. 
Other special butterflies can also be coded for angles such 
as 45, 135, 225, and 315 degrees. Examples of these special 
butterflies can be found in nine macros located in Appendix 
A. 


. 25,0) 
) X(0) = (0.25,0) 


D X(2) = (0,0) 
(0,0) ee 
2 
“3 ae. 
. 2 BO X(6) = (0,0) 
(0,0) -1 
(0.125 —0.125) 


o£) X(1) = (0.125, — 0.3018) 


O X(3) = (0.125, - 0.0518) 


IO X(5) = (0.125,0.0518) 


e PE X(7) = (0.125,0.3018) 
(0.125,0.125) 


Figure 14. Numerical Example of an 8-Point DIT FFT with Scaling 
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An interesting point to be noted is that the first two 
stages of an N-point radix-2 FFT can be performed 
simultaneously with a special radix-4 butterfly to enhance 
execution speed. This special radix-4 butterfly is depicted 
in Figure 15 with the corresponding code (MACRO 9) listed 
in Appendix A. . 

The special radix-4 butterfly actually consists of four 
separate radix-2 butterflies. The radix-4 butterfly is further 
seen to be a 4-point DFT. 

Together with the general butterfly, these special 
butterflies greatly improve the execution speed of an FFT 
algorithm. An example of the use of such butterflies for an 


A , 

4 B1 
B B’ B2 
Cc C’ B3 
D D’ B4 


8-point DIT FFT is given in Figure 16. Since the FFT 
implementation is, in general, highly modular, the code in 
Figure 16 has been structured into a number of macro calls, 
including a macro for bit reversal. 

During assembly time, the TMS32020 Macro 
Assembler fully expands these macros into in-line code.5 The 
first two stages of the 8-point DIT FFT are implemented by 
the special radix-4 DIT FFT macro COMBO. The last stage 
consists of the special radix-2 DIT FFT macros ZERO, 
PIBY4, PIBY2, and PISBY4. These macros can be found 
in Appendix A. The difference from the general radix-2 DIT 
butterfly is that the angle X of the twiddle factor takes on 
the values 0, 45, 90, and 135, respectively. 


A B1 
B B2 
Cc B3 
D B4 


Figure 15. The Equivalence of Four Radix-2 Butterflies to one Radix-4 Butterfly 
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* 


AN 8-POINT DIT FFT 


* 


KIKI KIKI IKK IKK IKK IKK II KKK IK KIKI KKK IKKE KKK KKKKKKKKKKKKKKKKKKKKK KEKE 


XOR EQU 00 
X01 EQU 01 
X1R EQU 02 
X11 EQU 03 
X2R EQU 04 
X21 EQU 05 
X3R EQU 06 
X31] EQU 07 
X4R EQU 08 
X4] EQU 09 
X5R EQU 10 
X51 EQU 11 
X6R EQU 12 
X61 EQU 13 
X7R EQU 14 
X71 EQU 15 
W EQU 16 
AORG >20 
WTABLE DATA >5A82 


VALUE FOR SIN(45) OR COS(45) 


Figure 16. TMS32020 Code for an 8-Point DIT FFT Implementation 
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x 


* INITIALIZE SYSTEM 
x 


INIT SPM 0 
SSXM 
ROVM 
LOPK 
LALK 
TBLR 
x 


* MACRO FOR INPUT BIT REVERSAL 
x 
BITREV $MACRO PR,PI,QR,QI 


4 
WTABLE 
W 


ZALH =:PR 
ADDS :QR: 
SACL ~: PR: 
SACH — :QR: 
ZALH se: 
ADDS  :QI: 
SACL: 
SACH  :QI: 
$END 


x 


NO SHIFT AT OUTPUTS OF PR 
SELECT SIGN-EXTENSION MODE 
RESET OVERFLOW MODE 

CHOOSE DATA PAGE 4 

GET TWIDDLE FACTOR ADDRESS 
STORE SIN(45) OR COS(45) IN W 


* FFT CODE WITH BIT-REVERSED INPUT SAMPLES 
* 


FFT8PT BITREV 2,3,8,9 
BITREV 6,7,12,13 


* 


* FIRST & SECOND STAGES COMBINED WITH DIVIDE-BY-4 INTERSTAGE SCALING 
* 


COMBO XOR,XOI,X1R,X11,X2R,X21,X3R,X31 
COMBO X4R,X41,X5R,X51,X6R,X61,X7R, X71 


* 


* THIRD STAGE WITH DIVIDE-BY-2 INTERSTAGE SCALING 
* 


ZERO =XOR,XOI,X4R, X41 
PIBY4 X1R,X11,X5R,X5I,W 
PIBY2 X2R,X21,X6R,X61 
PI3BY4 X3R,X31,X7R,X71,W 


Figure 16. TMS32020 Code for an 8-Point DIT FFT Implementation (concluded) 


RADIX-4 DECIMATION-IN-FREQUENCY (DIF) 
FFT 


The implementation described thus far is that of a 
radix-2 FFT using Decimation In Time (DIT). The 
decimation-in-time FFT is calculated by breaking the input 
sequence x(n) into smaller and smaller sequences and 
computing their FFTs. In an alternate approach, the output 
sequence X(k), which represents the Fourier transform of 
x(n), can be broken down into smaller subsequences that are 
computed from x(n). This method is called Decimation In 
Frequency (DIF). Computationally, there is no real 
difference between the two approaches. DIF is introduced 
here for two reasons: (1) to give the reader a broader 


understanding of the different methods used for the 
computation of the FFT, and (2) to allow a comparison of 
this implementation with the FORTRAN programs provided 
in the book by Burrus and Parks.4 The programs from that 
book were the basis for the development of the radix-4 FFT 
code on the TMS32020. 

In a radix-4 FFT, each butterfly has four inputs and 
four outputs instead of two as in the case of radix-2 FFT. 
As shown in the following equations, this is advantageous 
because the twiddle factor W has special values when the 
exponent corresponds to multiples of 7/2. The end result is 
that the computational load of the FFT is reduced, and the 
radix-4 FFT is computed faster than the radix-2 FFT. 
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To introduce the radix-4 DIF FFT, equation (3) is 
broken into four summations. These four summations 
correspond to the four components in radix-4. The choice 
of having N/4 consecutive samples of x(n) in each sum is 
dictated by the choice of Decimation In Frequency (DIF). 


N-1 (N/4)-1 
Xk) = YD xmwk= PY xn) wk 
n=0 n=0 
(N/2)—1 (3N/4)—1 
+ dy x(n) wrk + dy x(n) wrk 
n=NA4 n=N/2 
N-1 (N/4)-1 
+ YD xawk= YY x) wk 
n=3N/4 n=0 
(N/4)-1 
+ wNk/4 yy x(n+N/4) wok 
n=0 
(17) 
(N/4)—1 
+ wNki2 > x(n+N/2) wk 
n=0 


ang “| 
+ Wi > 


n=0 


x(n+3N/4) wrk 


k = 0,1,....N-1 


From the definition of the twiddle factor, it can be shown that 
WhK4 = (jk, WANK? = (1k, and waNK4 = (jk 


where j is the square root of — 1. With this substitution, (17) 
can be rewritten as 


(N/4)-1 
Xk = yp 
n=0 


+(—1)kK x(n+N/2)+()K x(n +3N/4)] wok 


[x(n) +(—j)k x(n+N/4) (18) 


Equation (18) is not yet an FFT of length N/4, because 
the twiddle factor depends on N and not on N/4. To make 
it an N/4-point FFT, the sequence X(k) is broken into four 
sequences (decimation in frequency) for the cases where 
k = 4r, 4r+1, 4r+2, and 4r+3. 

Introducing this segmentation, and remembering that 


nr nr 
WN" = WN 


the following four equations (19) are derived from (18) 


(N/4)-1 
X(4r) = yp 
n=0 


[x(n) + x(n+N/4) 


nr 
+ x (n+N/2) + x(n+3N/4)] WO, WEE, 


(N/4)-1 
X@rt+l) = YS kx(n) —j x(n+N/4) 
n=0 
— x(n+N/2) + j x(n+3N/4)] Wh WN/4 
(N/4)-1 
X(4r+2) = YO [x(n) — x(n+N/4) (19) 
n=0 


2 
+ x(n+N/2) — x(n+3N/4)] Wr WN/4 


(N/4)—1 
X(4r+3) = 2. 
n=0 


— x(n+N/2) —j x(n+3N/4)] wn wit, 


[x(n) + j x(n+N/4) 
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Each one of these equations is now an N/4-point FFT 
that can be computed by repeating the above procedure until 
N=4. Note that the factors Wy, Wi, Wel, and W3 
are considered part of the signal. In general, an N-point FFT 
(where N is a power of 4) can be reduced to the computation 
of four N/4-point FFTs by transforming the input signal x(n) 


into an intermediate signal y(n), as suggested by (19). Figure 
17 shows the corresponding radix-4 DIF butterfly, which 
generates one term for each sum in (19). 

For simplicity, the notation of Figure 18 is often used 
instead of that of Figure 17 for the butterfly of radix-4 DIF 
FFT. 


xin Q S y O yin) 
SF 
x("*3) SY wy Ov(n+4) 
SY OX 
x("*3) BRS wa Ov(n+¥) 
x(n+F) WA : WN O v(n+ 


Figure 17. Radix-4 DIF Butterfly 


Oo y(n) 


Where n=0,1...., .71 


Figure 18. Alternate Form of the Radix-4 DIF Butterfly 
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Figure 19 shows an example of a 64-point, radix-4 DIF 
FFT. 

Note that the inputs are normally ordered while the 
outputs are presented in a digit-reversed order. The principle 
of digit reversal is the same as in radix-2 FFT, but now the 
digits are 0, 1, 2, and 3 (quaternary system) instead of 0 and 
1 (binary system). The code for digit reversal is the same 
as that shown in Figure 11. For example, the datapoint 
occupying location 132 (quaternary number corresponding 
to decimal 30) exchanges positions with the datapoint at 


COON OA WH = © 


\N 
WA 
My 
BRN 
\ 


location 231 (corresponding to the decimal 45). 

Another important point of the radix-4 algorithm 
regards scaling. Since each stage of the radix-4 algorithm 
corresponds to two stages of the radix-2 algorithm, equivalent 
results are obtained by dividing the output of each stage of 
the radix-4 algorithm by 4. 

Appendix E contains the implementation of a 256-point, 
radix-4 DIF FFT on the TMS32020. This implementation 
follows the one described in FORTRAN code in the book 
by Burrus and Parks.4 
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Figure 19. A 64-Point, Radix-4 DIF FFT 
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SYSTEM MEMORY AND I/O 
CONSIDERATIONS 


Unlike non-realtime FFT applications where data 
samples to be transformed are assumed to already be in data 
memory, realtime FFT applications demand careful 
considerations of data input/output and system memory 
utilization. 

The TMS32020 has 544 words of on-chip data RAM, 
organized into two 256-word blocks (BO and B1) and one 
32-word block (B2) that can be used as scratch-pad 
locations.5 In non-realtime applications, this memory 
configuration allows a 256-point complex FFT to be easily 
performed (see Appendix C). However, for realtime FFT 
applications, input/output data buffering is generally 
required. 

For small transform sizes, up to 128-point complex (or 
256-point real) FFTs, the double-buffering technique, shown 
in Figure 20, can be used for realtime applications without 
the need of any external data memory. The on-chip RAM 
blocks BO and B1 are organized into Buffer A and Buffer 
B, respectively. 


BUFFER B: 


BUFFER A: 


INPUT : x(n) 


INPUT : x(n) 


OUTPUT : X{(k) 


OUTPUT : X(k) 


INPUT/OUTPUT 


DATA CHANNELS 


Figure 20. Input/Output Double-Buffering 


1/0 CHANNELS 


START POINTER 


(N+M] COMPLEX-WORD CIRCULAR BUFFER 


Consider a 128-point complex FFT. Realtime input data 
to be transformed can be grouped into ‘‘frames’’ of 256 
words (128 complex inputs) read into either Buffer A or 
Buffer B, depending on which one is not currently being used 
by the FFT program. The idea is to use the two on-chip RAM 
blocks BO and B1 alternatively as I/O and transform buffers. 

Assuming that the frame of data in Buffer A, the current 
transform buffer, is being transformed in-place, a software 
flag is then set to indicate that Buffer B can now be used 
as the current I/O buffer. This means that while time-domain 
data is read into Buffer B, the current I/O buffer, previous 
transformed data in Buffer B must be transferred out at the 
same time to make room for the incoming data. This can 
be acomplished efficiently if the I/O transfers are sequential 
and organized in a back-to-back manner (i.e., an output 
operation followed by an input operation). 

Resetting the flag indicates that the roles of Buffer A 
and Buffer B are now reversed. In this case, Buffer B now 
has a full frame of input data ready to be transformed while 
Buffer A has a full frame of transformed data (spectral 
samples) ready to be transferred out to make room for more 
incoming time-domain data. The setting of the software flag 
is often implemented as an I/O device service routine (DSR) 
or as an interrupt handler in the case of interrupt-driven I/O. 

Although this double-buffering technique is also 
applicable to larger transforms with the use of external 
memory, the actual memory required can be optimized if 
the transform time for an N-point FFT is shorter than the 
time to assemble a frame of N complex input data samples. 
For this purpose, the circular-buffer technique, shown in 
Figure 21, can be used. 

Instead of a doubte-buffer size of 2N, a circular-buffer 
size of N+M < 2N can be used where M < N and M 
depends on the system input data rate in general. For 
example, M is chosen to be no less than 8T for an 8-kHz 
input sampling frequency and an N-point complex FFT with 
a transform time of T ms. 


END POINTER 


Figure 21. Input/Output Circular-Buffering for Large FFTs 
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A set of pointers is used to manage the data in the 
circular buffer. The start pointer is set at the beginning of 
the current frame, whereas the end pointer always indicates 
the current input data position in the circular buffer. Both 
pointers ‘‘wrap around’’ at the end of the circular buffer. 

When a complete frame of input data has been 
collected, the set of pointer values is passed to the FFT 
program to transform the frame of data. For the next frame 
of input data, the start pointer points to the location 
immediately following the last location of the previous frame. 
As before, the end pointer for the current frame tracks the 
location of the next input data, and the whole process is 
repeated. 

To decrease execution time, a large N-point FFT can 
be divided into smaller 256-point complex FFTs and executed 
256 complex points at a time utilizing the on-chip RAM, as 
shown in Figure 22. Note that the system is still collecting 
incoming time-domain data samples and storing them in the 
external circular buffer while the FFT program is executing 
with internal data RAM. When 256 complex points have been 
processed, the FFT program returns them to the external 


buffer while fetching the next set of 256 samples for 
execution. 

This scheme takes advantage of the fact that off-chip 
data accesses take two cycles each while on-chip data 
accesses take one cycle each. Certain instructions (e.g., 
SACL and SACH) even take three cycles to execute when 
operating on external RAM. To speed execution, off-chip 
data blocks can be efficiently moved into on-chip data 
memory via the BLKD (block move from data memory to 
data memory) instruction, which executes in a single cycle 
when used in the repeat mode with the repeat counter having 
a maximum count of 256. 


IMPLEMENTING LARGE FFT’S 


Figure 23 shows the memory configurations and 
transfers for a 1024-point complex FFT computed as four 
256-point complex FFTs. A kernel 256-point complex FFT 
can operate on a group of 256 complex points at one time 
using on-chip RAM. Data transfers between on-chip and off- 
chip RAM are efficiently performed via the RPTK and 
BLKD instructions. 


CIRCULAR BUFFER 


ONE 


1/0 CHANNELS TO BE 


END POINTER 


COMPLETE 
FRAME OF DATA 


TRANSFORMED 


ON-CHIP DATA 
MEMORY OF 
512 WORDS 


Figure 22. Use of On-Chip Memory to Speed FFT Execution 


2K-WORDS OFF-CHIP RAM 


256 COMPLEX POINTS 
256 COMPLEX POINTS 


COMPLEX FFT 
256 COMPLEX POINTS EXECUTED 4 TIMES 
256 COMPLEX POINTS TO FORM 
BLOCK 1024-POINT 
MOVES COMPLEX FFT 


512-WORDS ON-CHIP RAM 


KERNEL 256-POINT 


Figure 23. Execution of a 1024-Point Complex FFT with On-Chip RAM 
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Figure 24 shows a more detailed block diagram of a as a subroutine for this purpose. Appendix D contains a 
1024-point radix-2 complex FFT. It can be seen that 512 listing for the 1024-point complex FFT performed with the 
butterflies must be performed at each stage. The first eight help of on-chip RAM. However, due to the size of the 
stages have a total of 4096 butterflies computed by four 1024-point FFT program, the user may find it necessary to 
256-point FFTs. The 256-point FFT in Appendix C is used subdivide the code into smaller sections prior to assembly. 


STAGES 1-8 STAGE 9 STAGE 10 


x(0) —— 256-POINT X(0) 
. COMPLEX FFT 
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x(255) BUTTERFLIES 
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Figure 24. A 1024-Point Complex FFT Using a 256-Point Kernel for Stages 1-8 
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HIGHER-RADIX FFT’S 


The same decomposition principle as for radix-2 FFT 
algorithms applies to higher radices as well. Table 3 shows 
the computation requirements’ for a 4096-point FFT using 
various radices that are a power of two. : 

The main benefit of using higher-radix algorithms is 
the reduced amount of arithmetic operations required for the 
FFT computation. Beyond the use of radix-8 algorithms, 
however, the point of diminishing returns rapidly approaches. 
Memory addressing, data scaling, and program control 
become more and more complicated. On the other hand, a 
suitable combination of the radix-2, radix-4, and radix-8 
algorithms becomes a flexible and efficient ‘‘mixed-radix’”’ 
algorithm for most FFT applications. Reference [4] contains 
some useful FORTRAN routines for higher-radix FFT 
algorithms. 


REAL TRANSFORMS VIA COMPLEX FFT’S 


In practice, many signals are real functions of time, 
whereas the FFT algorithm has been derived for complex 
signals. This means that for real inputs, the imaginary parts 
of the complex entries are simply set to zero. This results 
in a certain amount of redundancy. To utilize the bandwidth 
of the FFT algorithm more effectively, one can use the fact 
that the frequency spectrum of a real signal is a hermitian 
function (i.e., the real part is an even function while the 
imaginary part is an odd function).!1 For example, two 
N-point real FFTs can be computed simultaneously with a 
single N-point complex FFT.8.9,10 On the other hand, a 
single N-point complex FFT can also be utilized effectively 
to perform a 2N-point real FFT.10 Such algorithms 
substantially reduce the amount of computation required for 
real FFTs. 

Other efficient algorithms exist that further utilize the 
properties of real signals. In particular, the FFTs of four N- 
point real symmetric (even) and antisymmetric (odd) 
sequences can be computed with just one N-point complex 
FFT.2 Alternatively, the same N-point complex FFT can be 
used to compute the FFTs of four N-point real symmetric 
(even) sequences simultaneously .2 


INVERSE FFT 


The inverse FFT is given by the equation 


N-1 
x(n) = (IN) YY X(K) WO™ n=0,1,...,N-1 (20) 
k=0 


where X(k) is the Fourier transform of the time-domain signal 
x(n). Note that (20) is essentially the same equation as (1), 
which represents the ‘‘forward’’ FFT, with two important 
differences: the scaling factor is (1/N), and the exponent of 
the twiddle factor is the negative of the one in equation (1). 
Because of the similarity between (20) and (1), the 
implementation of the inverse FFT is very straightforward. 
The code can be derived from that given in the appendices 
by applying the following modifications. 

If the forward FFT is implemented with scaling, the 
resulting values in the frequency domain are (1/N)X(k) and 
not X(k). Hence, for the inverse FFT, no scaling must be 
applied in order to get back the original signal. On the other 
hand, if the forward FFT has not been scaled, the inverse 
FFT must be scaled. This scaling can be performed all at 
one point as suggested by (20), or at every stage as described 
earlier in the forward FFT. 

The negative exponent of the twiddle factors implies 
that the values of sin(X) will have the opposite sign from 
that in the forward FFT. Therefore, one way to implement 
the inverse FFT is to have an additional table with the 
negatives of sin(X). Another method is possible if the 
complex conjugate of (20) is considered. 


N-1 
xn) = (UN) YX) WO n=0,1,....N-1 (21) 
k=0 


In (21), the asterisk indicates complex conjugate. In 
this form, there is no need to have an additional table for 
sin(X). Instead, the inverse FFT is implemented by applying 
the forward FFT on the complex conjugate of X(k) (with 
appropriate scaling). The complex conjugate of the resulting | 
time signal is the desired result. Note that if x(n) is real, this 
last step is not necessary since, in this case, x(n) = x(n). 


Table 3. Computational Requirements for Higher-Radix FFT Algorithms 


RADIX — 2 (N=212) 


RADIX-4 (N=48) 
RADIX—8 (N=84) 


RADIX-16 (N= 163) 


ALGORITHM NUMBER OF REAL NUMBER OF REAL 
MULTIPLICATIONS ADDITIONS 


139,266 


126,978 


126,978 


125,442 
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Table 4. FFT Performance for a TMS32020 Implementation 


FFT 


ALGORITHM | SIZE TYPE 


FFT PERFORMANCE TIMING 


Table 4 provides the FFT timing performance for the 
TMS32020 code in the appendices. The source code 
examples included in Appendices C through G are not 
optimized for any specific application since they have been 
designed to emphasize clarity rather than code optimization. 
The key feature of these codes is that they do not require 
any scratch-pad (temporary) memory locations. 
Consequently, these codes should be useful in memory- 
critical applications. For time-critical applications, the codes 
can be optimized for better execution time. Higher execution 
speed is achieved by using straightline instead of looped code. 
The tradeoff for this optimization is the larger program 
memory requirements of the straightline code. 


DESCRIPTION OF THE APPENDICES 


At the end of this report, there are five appendices with 
TMS32020 code implementing several FFTs. The contents 
of the appendices are the following: 


Appendix A: N-point, radix-2, DIT FFT (9 macros) 

Appendix B: N-point, radix-2, DIT FFT using indirect 
addressing (7 macros) 

Appendix C: 256-point, radix-2 DIT FFT 

Appendix D: 1024-point, radix-2 DIT FFT 

Appendix E: 256-point, radix-4 DIF FFT 

Appendix F: 128-point, radix-2 DIF FFT (looped code) 

Appendix G: 256-point, radix-2 DIF FFT (looped code) 


SUMMARY 


The purpose of this report has been to develop an 
understanding of the underlying principles in FFT 
implementations with the TMS32020 processor. The book 
by Burrus and Parks* contains examples of FFT 
implementations on the TMS32010 processor, the first 
member of the TMS320 family. 

This report has discussed the development of the DFT 
algorithm, leading to the derivation of the FFT algorithm. 
The implementation of the radix-2 DIT FFT algorithm was 
covered in detail, and the radix-4 DIF FFT algorithm was 
also explained. Special attention was given to various FFT 
implementation aspects, such as scaling, system memory, 
and input/outpput considerations. 


RADIX-2 128-Pt Looped 21,879 4.375 ms 
RADIX-2 256-Pt Looped 42,416 8.483 ms 
RADIX-2 256-Pt Straight-Line 22,595 4.519 ms 
RADIX-2 1024-Pt Straight-Line 159,099 31.8198 ms 
RADIX-4 | _256-Pt__| Straight-line | 15,851 =| = S MHz_~—s| 3.1102 ms_—sd 


EXECUTION 


CLOCK TIME 


CYCLES 


The TMS32020 digital signal processor offers many 
advantages for the implementation of FFT algorithms. Its 
200-ns cycle time and special features, such as the single- 
cycle multiplication, allow high execution speed. The 544 
16-bit words of on-chip memory permit the implementation 
of a 256-point complex FFT without access to external 
memory, thus further reducing execution time. Furthermore, 
special instructions, such as RPTK and BLKD, allow the 
quick transfer of data from external to internal memory, so 
that portions of large FFTs can be implemented with the on- 
chip RAM. Due to the flexibility of the TMS32020, the 
designer can trade-off program memory with execution 
speed. 
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APPENDIX A 


RAKKKKAKKKAKKKKAKKEEKKAKKEREKEKEKAREKREEEREKREEEEEEREKK KKK RAK 


THIS TMS32020 FFT MACRO LIBRARY CONTAINS A TOTAL OF 9 
MACROS FOR IMPLEMENTING A GENERAL RADIX-2 DIT N-POINT 
FFT. ALL DIT BUTTERFLIES ARE IMPLEMENTED WITH DYNAMIC 
SCALING TO AVOID ARITHMETIC OVERFLOWS. TWO'S-COMPLEMENT 
FIXED-POINT FRACTIONAL ARITHMETIC IS USED THROUGHOUT. 
WHILE THESE MACROS ARE NOT NECESSARILY OPTIMIZED FOR 
SPEED, THEY ARE SO STRUCTURED THAT NO INTERMEDIATE OR 
TEMPORARY REGISTERS ARE REQUIRED FOR THEIR EXECUTION. 
CONSEQUENTLY, THESE MACROS ARE PARTICULARLY USEFUL IN 
APPLICATIONS WHERE MEMORY SPACE PROVES TO BE CRITICAL. 
IN ADDITION, ALL FFT COMPUTATIONS ARE PERFORMED 
IN-PLACE AND THE REAL AND IMAGINARY PARTS OF ALL COMPLEX* 
INPUTS ARE ASSUMED TO BE IN CONSECUTIVE DATA MEMORY x 
LOCATIONS. x 
k 
k 
KKKKKKKKKKKKKKKREKKKREKEKKEKEKKKEKKEKEREKRERREREKKEKKERKKKKKKK 
* k 
KKRKEKKEKKREKKEKKKKKKKKKRKKEKKKEKEKREKREKEKRKERKAK KKK ARAKK RRA 


+ * FF OF FF OF FF OF 


+ ee FF + + EF HF FOF HF HH HF HF FH 


x * 
x K *x 
x Pp. - P= P + QxW * 
x . ' N x 
* . : x 
x ~ : x 
x . ' * 
* ie] * 
x ' by x 
x ' > x 
* ' K , x 
x ' oW ° ‘ K x 
x Q N Q= P - QxW x 
x N x 
* k 
x * 
* NOTATION FOR IN-PLACE RADIX-2 DIT FFT BUTTERFLY x 
x x 


KKKKKKKEKKKEKERKKEKKKKERKKEKKKKEKKKKKEKKKEEKKREKKEKREKRKEKKKKKKKR 
* * 
KHKKKKKKKKEKKKEKIKKKEKEKKREKKKEKEKEKKEKKEKKEKEKERKEREREKKKKAKKKKK 


* 
x M 
* IN A RADIX-2 N-POINT FFT, THERE ARE M PASSES WHERE N =2 
* WITH [N/2] 2-POINT BUTTERFLIES PER PASS, GIVING A TCTAL 
* OF [N/2]LOG N BUTTERFLIES PER FFT. 

* 2 

* 


+ + Fe HF + 


KEKKKKKKKKKEKKKEKKKEARKERKEKKKEKERERERKEKEREKRERERERERERERERKEKK KA 


KEIAIREKAIKIRIKIKR IRI RARER KIARA IAAI EER ERK 


MACRO 1: W =1, k=0. 
N 


P=(PR+jPI) P+Q=(PR+QR) +3 (PI+QI) 
\/ 
/\ 

Q=(QR+}QI) P-Q=(PR-QR)+j(PI-QI) 


-j(2(pi)/N)k 
W =e =COS[(2(pi)/N)k}~jSIN[(2(pi)/N)k] 
N 
=WR+jWI 


=1 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
A 1-BIT OVERFLOW. HOWEVER, NO OVERFLOWS WILL OCCUR 
FOR FRACTIONAL INPUTS OF THE FORM X : -1 <= X<1l. A 
TOTAL OF 10 INSTRUCTIONS IS USED. EXECUTION 

TIME IS EQUAL TO 10 MACHINE CYCLES. 


+ * FFF FF FOF FO OO 
a a ee ee ee 


HRA RIKER IKARIA ARR RI III IRE 
ZERO SMACRO PR,PI,QR,QI 
* 


x CALCULATE Re[P+Q] AND Re[P-Q] 
* 
LAC :PR:,15 ACC := (1/2) (PR) 
ADD :QR:,15 ACC := (1/2) (PR+QR) 
SACH :PR: ™ PR := (1/2) (PR+QR) 
SUBH :QR: ACC := (1/2) (PRt+OR)- (QR) 
SACH :QR: QR := (1/2) (PR-QR) 


+ + 


CALCULATE Im[P+Q] AND Im[P-Q] 


LAC :PI:,15 Acc := (1/2) (PI) 

ADD :Q1:,15 Acc := (1/2)(PI+QI) 

SACH :PI: PI := (1/2)(PI+QI) 

SUBH :Q1: ACC := (1/2) (PI+QI)-(QT) 
SACH :QI: QI. := (1/2) (PI-QI) 

SEND 
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RAKKEEKKKREKRREKEREKEERKRERRERKREREKEERERERKREREKEEERREERKERKARKK 


MACRO 2: W =-1, k=N/2. 
N 


P=(PR+jP1) P-Q=(PR-QR)+j (PI-QI) 
\/ 
/\ 

Q=(QR+35QI) P+Q=(PR+OR)+j (PI+QI) 


~}(2(pi)/N)k 
a =COS[(2(pi)/N)k]-JjSIN([(2(pi)/N)k] 
=WR+jWI 


=-] 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
A 1-BIT OVERFLOW. HOWEVER, NO OVERFLOWS WILL OCCUR 
FOR FRACTIONAL INPUTS OF THE FORM X : -1 <= X <1. A 
TOTAL OF 10 INSTRUCTIONS IS USED. EXECUTION 

TIME IS EQUAL TO 10 MACHINE CYCLES. 


+e ee ee FH eH HH HHH HH ee He OH 
++ ee ee eH He HH HH HH FH HH HF HF FH OH HH OH 


RRAERRKRERRRERIEIRIIR AISI IPRA AISI ISAIAEIIIAI IISA IA II IIA 
PI SMACRO PR,PI,QR,QI 
* 


* CALCULATE Re[P+Q] AND Re[P-Q] 

* 
LAC :PR:,15 ACC := (1/2)(PR) 
SUB :QR:,15 ACC := (1/2) (PR-QR) 
SACH :PR: PR := (1/2) (PR-QR) 
ADDH :QR: ACC := (1/2) (PR-QR)+(QR) 
SACH :QR: QR := (1/2) (PR+QR) 

* 

4 CALCULATE Im[{P+Q] AND Im[P-Q] 

* 
LAC :P1:,15 ACC := (1/2)(PI) 
SUB :QT:,15 ACC := (1/2)(PI-QI) 
SACH sPI: PI := (1/2) (PI-QI) 
ADDH :QI: ACC := (1/2) (PI-QI)+(QI) 
SACH :QI: QI := (1/2) (PI+QI) 
SEND 


KKKKKAKKEKREKRKKAKREKRERKEEKEKRERERERREEREERKREREEKREKRKKKK KAKA KKK 


MACRO 3: W =-j, k=N/4. 
N 


P=(PR+jPI) P-jQ=(PR+QI)+j (PI-QR) 
\/ 
/\ 
Q=(QR+jQI) P+jQ=(PR-QI)+j (PI+QR) 


-3(2(pi)/N)k 
W =e =COS[(2(pi)/N)k)-jSIN[ (2(pi)/N)k]) 
N 
=WR+jWI 


=-j 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
A 1-BIT OVERFLOW. HOWEVER, NO OVERFLOWS WILL OCCUR 
FOR FRACTIONAL INPUTS OF THE FORM X : -1 <= X<1. A 
TOTAL OF 11 INSTRUCTIONS IS USED. EXECUTION 

TIME IS EQUAL TO 11 MACHINE CYCLES. 


WARNING: THIS MACRO REQUIRES THE INPUT SAMPLES QR AND 
QI TO BE IN CONSECUTIVE DATA MEMORY LOCATIONS 
OF ASCENDING ORDER. THE FOLLOWING STEPS ARE 
USED TO IMPLEMENT THIS MACRO: 


(1) [PI-QR} ----- > [PI] 
(2) [PI+QR] ----- > [QR} 
(3) [PR+QI] ----- > [PR] 
(4) [PR-QI] ----- > [Acc] 
(5) (QR) = ----- > (QT) 
(6) [ACC] ----- > [QR} 


+e + ee ee He He HH + HH HF He He HH HH SE HH HH eH ee 
a a ee ee ee ee eo 


REKKKRAAKAREREAERERR IIIA AIIISIAIAIIIISD ISIS SII AISI SISA IAAI 
PIBY2 $MACRO PR,PI,QR,QI 
* 


a CALCULATE Re[P+jQ] AND Re[P-3Q] 

* 
LAC :PI:,15 ACC := (1/2)(PI) 
SUB :QR:,15 ACC := (1/2)(PI-QR) 
SACH PI: PI := (1/2)(PI-QR) 
ADDH :QR: ACC := (1/2)(PI-QR)+(QR) 
SACH :QR: QR := (1/2)(PI+QR) 


* 


CALCULATE Im[{P+jQ] AND Im[{P-5Q] 


LAC :PR:,15 ACC := (1/2) (PR) 
ADD :Q1:,15 ACC := (1/2)(PR+QI) 
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SACH :PR: PR := (1/2) (PR+QI) 
SUBH :QI1: ACC := (1/2) (PR+QI)-(QI) 
DMOV :QR: QR -> QI 

SACH :QR: QR := (1/2)(PR-Q1) 
SEND 


KKKKKKEKEKEKEKRKEKKKEKRKAKKEREREKAKAKKEKRKKKEKREREKEKEEEKKRERAKKKKK 


MACRO 4: W =j, k=3N/4. 
N 


P=(PR+jPI) | P+jQ=(PR-QI)+j (PI+OR) 
\/ 
/\ 

Q=(QR+jQI) P-jQ=(PR+QI)+3 (PI-QR) 


-3(2(pi)/N)k 
W =e =COS[(2(pi)/N)k}-J3SIN[(2(pi)/N)k] 
N 
=WR+jWI 


=j 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
A 1-BIT OVERFLOW. HOWEVER, NO OVERFLOWS WILL OCCUR 
FOR FRACTIONAL INPUTS OF THE FORM X : -1 <= X <1. A 
TOTAL OF 10 INSTRUCTIONS ARE USED SUCH THAT EXECUTION 
TIME IS EQUAL TO 10 MACHINE CYCLES. 


“WARNING: THIS MACRO REQUIRES THE INPUT SAMPLES QR AND 
QI TO BE IN CONSECUTIVE DATA MEMORY LOCATIONS 
ASCENDING ORDER. THE FOLLOWING STEPS ARE USED 
TO IMPLEMENT THIS MACRO: 


(1) [PI-QR] ----- > [PI] 
(2) [PI+QR] ----- > [QR] 
(3) [PR+QI] ----- > [PR] 
(4) [PR-QI] ----- > [acc] 
(5) (QR) = ----- > (QT) 
(6) [Acc] ----- > [QR] 


ee FF FF ee ee OO OO OOOOH OOOH 
+ ee ee eH FH FH HH HH HH HH OH OH HH OK OH OH OOO 


HKKKKKKKARKKAKKKKRERRRIERRIERER IRAE RRA IKIREREIERRIA 
PI3BY2 $MACRO PR,PI,QR,QI 
; 


x CALCULATE Re[P+jQ] AND Re[P-jQ] 

* 
LAC :PI:,15 ACC := (1/2)(PI) 
ADD :QR:,15 ACC := (1/2) (PI+QR) 
SACH :PI: PI := (1/2) (PI+QR) 
SUBH :QR: ACC := (1/2) (PI+QR)-(QR) 


SACH :QR: QR (1/2) (PI-OR) 


* CALCULATE Im[P+jQ] AND Im[P-jQ] 


LAC :PR:,15 ACC := (1/2)(PR) 

SUB :Q1:,15 ACC := (1/2) (PR-QI) 
SACH :PR: PR := (1/2) (PR-QI) 
ADDH :Q1: ACC := (1/2) (PR-QI)+(QT) 
DMOV :QR: QR -> QI 

SACH :QOR: QR := (1/2) (PR+tQI) 
SEND 


KAKKEKKERERKEKEKRKKEKEREKREREEREEREKEERREREREKEREREEKKRKKKKKKKKKRAKK 
MACRO 5: k=N/8. 
P=(PR+jPI)  P+QaW=(PR+Re[QawW])+j (P1l+Im[QawW]) 
, 
Q=(QR+jQI)  P-QseW=(PR-Re[QaW])+j (PI-Im[QxwW]) 
-j(2(pi)/N)k 


W =e =COS((pi)/4)-jSIN( (pi) /4)=WR+jwWI 
N 


LET W=!COS( (pi) /4) }=!SIN((pi)/4) | 
THEN [QxW]=(QR+QI) xW+j (QI-QR) xW 
Re [Q*W ]=(QI+QR) xW 


Im[Q*W ]}=(QI-QR) aW 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
A 1-BIT OVERFLOW. HOWEVER, NO OVERFLOWS WILL OCCUR 
FOR FRACTIONAL INPUTS OF THE FORM X : -1 <= X <1. A 
TOTAL OF 20 INSTRUCTIONS ARE USED SUCH THAT EXECUTION 
TIME IS EQUAL TO 20 MACHINE CYCLES. THIS MACRO 
REQUIRES W TO BE THE ABSOLUTE VALUE (MAGNITUDE) OF 
COS((pi)/4) AND SIN((pi)/4). THE SIGNS OF THESE TRIG 
FUNCTIONS HAVE BEEN TAKEN CARE OF IN THE CODE. 


eee eH eH EH HF OH OF Ke HO OH HO OH OO OH OOOO OOD 
+e eee eH He eH eH He He He HE eH HE He Ee HH Ee 


HRAKAREKEKKKKARERAKER ERR IAAI RAR ARIERERIERREREREREKRERARER 
PIBY4 SMACRO PR,PI,QR,QI,W 
* 


LT 3W: T-REGISTER :=W=COS(PI/4)=SIN(PI/4) 
LAC :Q1:,14 ACC := (1/4)(QI) 

SUB :Q0R:,14 ACC := (1/4) (QI-OR) 

SACH :QI:,1 QI := (1/2) (QI-QR) 

ADD :QR:,15 ACC := (1/4) (QI+QR) 

SACH :QR:,1 QR := (1/2)(QI+QR) 

LAC :PR:,14 ACC := (1/4) (PR) 

MPY :QR: P-REGISTER := (1/4) (QI+QR)*xW 

APAC ACC := (1/4) [PR+(QI+QR)*W] 


SACH :PR:,1 PR := (1/2){PR+(QI+OR)«W] 
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L6 


SPAC ACC := (1/4) (PR) 

SPAC ACC := (1/4) [PR-(QI+QR)«W] 
SACH :QR:,1 QR := (1/2)[PR-(QI+0R)*W] 
LAC :PI:,14 ACC := (1/4)(PI) 

MPY :QT: P-REGISTER := (1/4) (QI-QR) «Ww 
APAC ACC := (1/4) [PI+(QI-QR)«W] 
SACH :PI:,1 PI := (1/2)[PI+(QI-QR) Ww] 
SPAC ACC := (1/4)(PI) 

SPAC ACC := (1/4) [PI-(QI-QR)*W] 
SACH :QI:,1 QI := (1/2)[PI-(QI-QR)«W] 
SEND 


HAKKRKAKKKAAKKRERKAEREKIEKERAR ERE EKER KAEKEKRARKEKAEKREIKERERARAKEKR 
MACRO 6: k=3N/8. 

P=(XR+jXI) X+QaW=(XR+Re [QxW] )+5 (XI-Im[Q*W] ) 
\/ 
/\ 
Q=(QR+jQI) X-QxW=(XR-Re (QawW])+j (XI+Im[QxW]) 


-3(2(pi)/N)k 
W =e =COS (3(pi)/4)-jSIN(3(pi)/4) 


=WR+jWI 
LET W={COS(3(pi)/4)|=|SIN(3(pi)/4) | 
THEN [QxW]=(QI-QR)*W-3 (QI+OR) aW 
RE [Q4W]=(QI-OR) AW 


IM[Q*W )=(QI+QR) xW 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
A 1-BIT OVERFLOW. HOWEVER, NO OVERFLOWS WILL OCCUR 

FOR FRACTIONAL INPUTS OF THE FORM X: -1 <= X<1. A 
TOTAL OF 20 INSTRUCTIONS ARE USED SUCH THAT EXECUTION 
TIME IS EQUAL TO 20 MACHINE CYCLES. THIS MACRO 
REQUIRES W TO BE THE ABSOLUTE VALUE (MAGNITUDE) OF 
COS(3(pi)/4) AND SIN(3(pi)/4). THE SIGNS OF THESE TRIG 
FUNCTIONS HAVE BEEN TAKEN CARE OF IN THE CODE. 


i eS 
ee ee a a a a a a a SS 


RHRREKEREAKAEEKRARERARRRREREREREK AKIRA IIKIERERERERAEERERERERER 
PI3BY4 $MACRO PR,PI,QR,QI,W 
* 


LT :W: T-REGISTER :=W=COS(PI/4)=SIN(PI/4) 
LAC :Q1:,14 ACC := (1/4)(QI) 

SUB :QR:,14 ACC := (1/4)(QI-QR) 

SACH :QI:,1 QI := (1/2) (QI-QR) 

ADD :QR:,15 ACC := (1/4) (QI+QR) 


SACH :QR:,1 QR := (1/2)(QI+QR) 


LAC :PR:,14 ACC := (1/4)(PR) 

MPY :QI: P-REGISTER := (1/4)(QI-QR)xW 
APAC ACC := (1/4) [PR+(QI-QR)«wW] 
SACH :PR:,1 PR := (1/2) [PR+(QI-QR)«W] 
SPAC ACC := (1/4)(PR) 

SPAC ACC := (1/4)[PR-(QI-QR)«W] 
MPY :QR: P-REGISTER := (1/4)(QI+QR) AW 
SACH :QR:,1 QR := (1/2) [PR-(QI-QR) *W] 
LAC :PI:,14 ACC := (1/4)(PI) 

SPAC ACC := (1/4) [PI-(QI+QR)aW] 
SACH :PI:,1 PI := (1/2) [PI-(QI+QR)«W] 
APAC ACC := (1/4) (PI) 

APAC ACC := (1/4) (PI+(QI+QR)«W] 
SACH :Q1:,1 QI := (1/2) [PI+(QI+QR) «W] 
SEND 


RAKKKKKKKKKKKKKKKKKKKKKRKKKKK KRHA RRR KK 
MACRO 7: A GENERAL RADIX-2 DIT FFT 'BUTTERFLY' 
P=(PR+jPI) P+QxW=(PRtRe[Q*W])+j (PI-Im[Q*W]) 

wd 
/\ 
Q=(QR+5QI) P-QaW=(PR-Re [Qa ] )+j (PI+Im[QxW] ) 


+ ee FF OF OF OF 


-j(2(pi)/N)k 
W =e =COS(0)-jSIN(O) 
N 
=WRtjWI 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
A 1-BIT OVERFLOW. HOWEVER, NO OVERFLOWS WILL OCCUR 
FOR FRACTIONAL INPUTS OF THE FORM X : -1 <= X< 1. A 
TOTAL OF 23 INSTRUCTIONS ARE USED SUCH THAT EXECUTION 
TIME IS EQUAL TO 23 MACHINE CYCLES. THIS MACRO 
REQUIRES W TO BE THE ABSOLUTE VALUE (MAGNITUDE) OF 
COS(O) AND SIN(O). THE SIGNS OF THESE TRIG FUNCTIONS 
HAVE BEEN TAKEN CARE OF IN THE CODE. 


+e ee eH eH Fe Oe FH OF OH HO OO OO 


+e + * +e eee ee 


* 
RERKKKAIRKE RIKKI IARI IIKRIIAI IIA IIIS IAAI IIA IIA A 
NORM1 SMACRO PR,PI,QR,QI,WR,WI 

* 


x CALCULATE QR*WR + QI*WI AND STORE RESULT IN QI 
x 
LT =:QI: T-REGISTER := QI 
MPYK :WI: P-REGISTER := (1/16) (QI*WI) 
LTP :QR: ACC := (1/16)(QI*WI); LOAD T WITH QR 
MPYK :WR: P-REGISTER := (1/16) (QR*WR) 
APAC ACC := (1/16) (QRAWR+QI*WI ) 
SFR ACC := (1/32) (QRAWR+QI*WI) 
SACH :QR:,4 QR := (1/2) (QRAWR+QIAWI) 
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* MPY :WI: P-REGISTER := (1/2) (QI*WI) 


fk CALCULATE QI*WR - QRAWI AND STORE RESULT IN QR APAC ACC := (1/2) (QRAWR+QI WT) 
* MPY :WR: P-REGISTER := (1/2) (QI*WR) 

MPYK -:WI: P-REGISTER := (1/16) (-QR«WI) LT = :QR: LOAD T-REGISTER WITH OR 

LTP :QI: ACC := (1/16)(-QR&WI); T-REGISTER := QI SACH :QR: QR += (1/2) (QRAWR+QI*WI) 

MPYK :WR: P-REGISTER := (1/16) (QI*WR) e 

SFR ACC := (1/32) (QI*WR-QR«WI) x _ 

SACH :QI:,4 QI := (1/2) (QI*WR-QR&WI) PAC ACC= (1/2) (QI*WR) 
x MPY  :WI: P-REGISTER := (1/2) (QRAWI) 
tk CALCULATE Re[P+}Q] & Re[P-jQ] STORE RESULTS in PR & OR SPAC ACC= (1/2) (QIAWR-QR*WI) 
* SACH :QI: QI = (1/2) (QIAWR-QRAWI) 

LAC :PR:,14 ACC := (1/4)PR * : 

ADD :QR:,15 ACC := (1/4) [PR+(QR*WR+Q1*WI) } k CALCULATE Re[Pm+1] & Re[Qm+1] STORE RESULTS in PR & QR 

SACH :PR:,1 PR := (1/2) [PR+(QRAWR+QIAWI) ] x 7 

SUBH :QR: ACC := (1/4) [PR-(QR*WR+QI4WI) ] LAC :PR:,14 = ACC= (1/4)PR 

SACH :QR:,1 QR := (1/2) [PR-(QRAWR+QIAWI) ] ADD :QR:,15 ACC= (1/4) (PR+(QR*WR+QI4WI) } 
* SACH :PR:,1 PR = (1/2) [PR+(QR*WR+QIAWI ) ] 
* CALCULATE Im[P+jQ] & Im[{P-jQ] STORE RESULTS in PI & QI SUBH :QR: ACC= (1/4) [PR-(QRAWR+QIAWT) ] 
* SACH :QR:,1 QR = (1/2) [PR-(QR*WR+QIWI) } 

LAC :PI:,14 ACC := (1/4)PI ai 

SACH :PI:,1 PR := (1/2) [PI+(QI*WR-QR#WI) ] x 7 

SUBH :QI: ACC := (1/4) [PI-(QI*WR-QR&WI) ] LAG aFie 18 ACC= (1/4)PI 

SACH :QI:,1 QI := (1/2) [PI-(QI*WR-QR*WI) ] ADD :QI:,15 ACC= (1/4) [PI+(QI*WR-QRAWT) ] 

SEND SACH :PI:,1 PI = (1/2) [PI+(QI*WRRORAWI) ] 

SUBH :QI: ACC= (1/4) [PI-(QI*WR-QRAWI) } 
SACH :QI:,1 QI = (1/2) [PI-(QI*WR-QR&WI) } 

KEKKKEKKEKKEKREKKKEEEKAKKKKKKKKKK KKK KAKA SEND 
x x 
k MACRO 8: A GENERAL RADIX-2 DIT FFT 'BUTTERFLY' 
* * 
* P=(PR+jPI) P+QaW=(PR+Re [Qaw ] )+j (PI-Im[Q*W]) * PTS. 4-4-4. 4. £22 tt tstetetet tetatatetatatakatatetatetotetatetetatatstelatatetatatetetatatetatetatatatetatatatotel 
* \/ * x x 
k /\ * * MACRO 9: * 
* Q=(QR+jQI) P-QxW=(PR-Re[QxW])+j (PI+Im[QaW]) x * is 
A ; k A A! Bl * 
7 -3(2(pi)/N)k ‘ ‘i as a " 
x W =e =COS (0)-jSIN(0) * * / \ \/ B2 * 
* N * * B B ! \ / \ / x 
* =WR+jWI x * c c'i/\/ \ ‘ 
* * x \/ /\ B3 * 
* * * / \ / \ * 
x ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE x * D D' Ba * 
* A 1-BIT OVERFLOW. HOWEVER, NO OVERFLOWS WILL OCCUR x * x 
* FOR FRACTIONAL INPUTS OF THE FORM X: -1 <= X<1. A & x * 
* TOTAL OF 22 INSTRUCTIONS ARE USED SUCH THAT EXECUTION x * A = R1+j11 * 
* TIME IS EQUAL TO 22 MACHINE CYCLES. THIS MACRO k * x 
x REQUIRES W TO BE THE ABSOLUTE VALUE (MAGNITUDE) OF * * B = R2+j12 - 
* COS(0) AND SIN(O). THE SIGNS OF THESE TRIG FUNCTIONS x * . : 
* HAVE BEEN TAKEN CARE OF IN THE CODE. x * C = R3+j13 e 
x * * 
KKKKKKKKKEKRERKEKKKEEREKEKKKAAKKKKKKKKKKKR RAK aR KKK x D= R4+) 14 = 
NORM2 $MACRO PR,PI,OR,QI,WR,WI * . 
* * A' = (R1+R2) + j(I1+1I2) * 
* CALCULATE QRAWR + QI*WI AND STORE RESULT IN QR . if 
* x B' = (R1-R2) + j(I1-12) * 

LT —_:QR: T-REGISTER := QR * 3 

MPY :WR: P-REGISTER := (1/2) (QR«WR) * C' = (R3+R4) + j(R3+R4) . 

LTP :Q1: ACC := (1/2) (QR*WR); T-REGISTER := QI . % 
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D! 


B2 = B'- 


B3 


REAL(B1) 
IMAG(B1) 


REAL (B2) 
IMAG(B2) 


REAL (B3) 
IMAG (B3) 


REAL (B4) 
IMAG (B4) 


++ * + + FH HF HH HF HF FH HF EH HF HE FH HF HF HK HF HF HF HF HF HF HF FH Fe 


(R3-R4) + j(R3-R4) 


Bl = A‘+C! 


jD' 


A'-C! 


B4 = B'+jD' 


nou 


((R1+R2) + (R3+R4)) /2 
((I1+I2) + (13+14)) /2 


((R1-R2) + (13-14)) /2 
((I1-12) - (R3-R4)) /2 


((R1+R2) - (R3+R4)) /2 
((11412) - (13+14)) /2 


((R1-R2) - (13-14)) /2 
((11-12) + (R3-R4)) /2 


THE FIRST TWO STAGES OF A RADIX-2 N~POINT DIT FFT CAN BE 
IMPLEMENTED WITH A SPECIAL RADIX-4 BUTTERFLY WHICH HAS A 
UNITY TWIDDLE FACTOR USED TO SPEED UP THE EXECUTION TIME 
OF THE FFT WITH THE ABOVE EQUATIONS. 
EQUATIONS, ALL INPUT VALUES MUST BE WITHIN THE RANGE 
-1 <= X < 1.0. TOTAL NUMBER OF INTRUCTIONS IS 37. 

EXECUTION TIME IS EQUIVALENT TO 39 MACHINE CYCLES. 


WHEN USING THESE 


+ eee ee eee ee ee eH ee eH HH eH He eH eH OD 


REKREKRKEREREKRERAIAKKK KKK KKK RK 


RAEKKKEKKREKREREKERERERERERKERERERERRRREREREKKKKKKKRKKKKKRRKKRK 


THE FOLLOWING STEPS ARE USED TO IMPLEMENT THE SPECIAL 
‘COMBO' FOR THE FIRST TWO STAGES OF AN 


RADIX-4 MACRO 
N-POINT RADIX- 


R3 (R3+R4)/1 
13 (13+14)/1 


R4 (R3-R4)/1 
14 (13-14)/1 


+e + + + * HF HH HF HSH FH He HF HF FH HH KF HF 
- 
- 
Lo) 
- 


x 


2 DIT FFT. 


(R1+R2)/1 
(11+12)/1 


[(R1-R2)+(13-14) ]/2 
((I1-12)-(R3-R4) ]/2 


(R3+R4)/1 
(13+I4)/1 


[ (R1-R2)- (13-14) ]/2 
R4--->14:=(R3-R4)/1 
[ (11-12)+(R3-R4) ]/2 


[ (R1+R2)+(R3+R4) ]/2 
{(11+12)+(13+14) }/2 


{ (R1-R2)+(13-14) J/2 
{(11-12)-(R3-R4) J/2 


{ (R1+R2)-(R3+R4) J] /2 
[(11+12)-(13+14)}/2 


((R1-R2)-(13-14)]/2 


((11-12)+(R3-R4) J/2 


eee ee ee eee HF HHH eS 


RREAIRERIKKAAEAAIIIIIIAA IAAI IDI III ISAS IIIS AISI IAI SSSI ISAS IAAI 
R1,I1,R2,12,R3,13,R4,14 


COMBO $MACRO 


» 


»* 


* 


CALCULATE PARTIAL 


LAC :R3:,14 
ADD :R4:,14 
SACH :R3:,1 
SUB :R4:,15 
SACH 2R4:,1 
LAC :13:,14 
ADD :14:,14 
SACH :13:,1 
SUB :14:,15 
SACH :14:,1 
CALCULATE PARTIAL 
LAC :R1:,14 
ADD :R2:,14 
SACH :R1:,1 
SUB :R2:,15 
ADD :14:,15 
SACH :R2: 
SUBH I4: 
DMOV :R4: 
SACH :R4: 
LAC :11:,14 
ADD :12:,14 
SACH :I1l:,1 
SUB :12:,15 
SUB :14:,15 
SACH :12 
ADDH I4: 
SACH 14: 


CALCULATE PARTIAL 


LAC :R1:,15 
ADD :R3:,15 
SACH :R1: 
SUBH :R3: 
SACH :R3: 
LAC :I1:,15 
ADD :13:,15 
SACH :Il: 
SUBH 213: 
SACH :13: 
SEND 


TERMS 


FOR R3,R4,I13 AND 14 


u 


(1/4) (R3) 

(1/4) (R3+R4) 

(1/2) (R3+R4) 

(1/4) (R3+R4)- (1/2) (R4) 
(1/2) (R3-R4) 

(1/4) (13) 

(1/4) (13+14) 

(1/2) (13+14) 

(1/4) (13+14)-(1/2) (14) 
(1/2) (13-14) 


FOR R2,R4,12 AND I4 


eo 68 le 


oe oe “- oe oe eo 
tune tt oon nt net noe 


(1/4) (R1) 

(1/4) (R1+R2) 

(1/2) (R1+R2) 

(1/4) (R1+R2)- (1/2) (R2) 
(1/4) [(R1-R2)+(13-14) ] 
(1/4) [(R1-R2)+ (13-14) } 
(1/4) [(R1-R2)- (13-14) ] 
R4 = (1/2) (R3-R4) 
(1/4) [ (R1-R2)- (13-14) ] 
(1/4) (11) 

(1/4) (11+12) 

(1/2) (11412) 

(1/4) (11412) -(1/2) (12) 
(1/4) [ (11-12) -(R3-R4) ] 
(1/4) [(11-12)-(R3-R4) } 
(1/4) [(11-12)+(R3-R4) ] 
(1/4) [(11-12)+(R3-R4) } 


FOR R1,R3,11 AND 13 


(1/4) (R1+R2). 

(1/4) [(R1+R2)+(R3+R4) ) 
(1/4) { (R1+R2)+(R3+R4) } 
(1/4) [(R1+R2)-(R3+R4) } 
(1/4) [ (RI1+R2) -(R3+R4) } 
(1/4) (11+12) 

(1/4) [(11+12)+(13+14) } 
(1/4) [(11+12)+(13+14) ] 
(1/4) { (11+12)-(13+14) } 
(1/4) [(114+12)-(13+14) } 
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FFT MACRO LIBRARY (INDIRECT ADDRESSING) 


4. Implementation of Fast Fourier Transform Algorithms with the TMS32020 
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APPENDIX _ 8B 


REKRKEKKERKEREKKKKEKKEKEREKERERERKEREREEREKKEKEKEREKKKKKKKKKKAK 


THIS TMS32020 FFT MACRO LIBRARY CONTAINS A TOTAL OF 7 
MACROS FOR IMPLEMENTING A GENERAL RADIX-2 DIT N-POINT 
FFT USING INDIRECT ADDRESS FORMAT OF THE TMS32020. ALL 
DIT BUTTERFLIES ARE IMPLEMENTED WITH DYNAMIC SCALING TO 
AVOID ARITHMETIC OVERFLOWS. 2'S COMPLEMENT FIXED-POINT 
FRACTIONAL ARITHMETIC IS USED THROUGHOUT. WHILE THESE 
MACROS ARE NOT NECESSARILY OPTIMISED FOR SPEED, THEY 
ARE SO STRUCTURED THAT NO INTERMEDIATE OR TEMPORARY 
REGISTERS ARE REQUIRED FOR THEIR EXECUTION. HENCE, 
THESE MACROS ARE PARTICULARLY USEFUL IN APPLICATIONS 
WHERE MEMORY SPACE PROVES TO BE CRITICAL. IN ADDITION, 
ALL FFT COMPUTATIONS ARE DONE IN PLACE AND THE REAL AND 
IMAGINARY PARTS OF ALL COMPLEX INPUTS ARE ASSUMED TO BE 
IN CONSECUTIVE DATA MEMORY LOCATIONS. 


* OF Fe eH ee FF Oe OH OH OH OD 
+e + Fe HHH HH HHH 


KEKEKKKAKEKKKEKREREREREEKEERKEKEREREKRKEEREREEKEKRKKEEKKKKEKEKKKEKK 
k * 
kkkkkkkkkkkkkhKkRKhkRkKkRK RRR RK KKK RR KKK 


K 


Pp. . P= P + QawW 
“x ‘ N 


NOTATION FOR RADIX-2 DIT FFT BUTTERFLY 


+e + + FF HF Fe FF HH 
, 
+e e+ FH FF FH HH HF HF HF OF 


KKKEKKEKEKEKKKKEKKKKEREKERERKEEKRERKEKEKKERKEKREEKEREREKEKREKRKKEKKKKKK 
* * 
KREKKKKEKKEKERKKEKKREREREKERERERREKREKEEKKEKREKEKKEKEREREREREKKKKER 


* * 
* Mk 
* IN A RADIX-2 N-POINT FFT THERE ARE M PASSES WHERE N= 2 * 
* WITH [N/2] 2-POINT BUTTERFLIES PER PASS, GIVING A TOTAL x 
* OF [N/2]LOG N BUTTERFLIES PER FFT. * 
x 2 * 
* * 


RKEKKKEKKEKEKKEKKEKEKKREKKEKEKKEREREREREREKREKEKKKKKRKRKRR KKK 


MACRO 1: W=1, k=0. 

P=(PR+jPI) P+Q=(PR+QR)+j (PI+QI) 
\/ 
i\ 
Q=(QR+5QI) — P-Q=(PR-QR)+j(PI-Q1) 


~j(2(pi)/N)k 
W=e =COS[ (2(pi)/N)k)-jSIN[(2(pi)/N)k} 


=WRtJWI =1 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
A 1-BIT OVERFLOW. HOWEVER, NO OVERFLOWS WILL OCCUR 
FOR FRACTIONAL INPUTS OF THE FORM X : -1 <= X <1. A 
TOTAL OF 12 INSTRUCTIONS ARE USED SUCH THAT EXECUTION 
TIME IS EQUAL TO 14 MACHINE CYCLES. 


MACRO ENTRY CONDITION: ARP MUST POINT AT AR1 
MACRO EXIT CONDITION : ARP MUST POINT AT AR1 


MEMORY ADDRESS BIAS : BIAS FOR DATA MEMORY PAGE 4 


+e ee eH He He HH HHH OHHH HH HH 
+e + eH FF HF FH FH HH He HH HH eH HH HH 


RAKKKKAKKAEKERRAARHAK KE REKEREREK IKEA KIIKERERKRAERERRERER KAKA 
ZEROI $MACRO PR,QR,BIAS 
* 


* INITIALISE AUXILIARY REGISTERS 

* 
LRLK AR1,:PR:+:BIAS: AR1 POINTS TO PR 
LRLK AR2, :QR:+:BIAS: AR2 POINTS TO OR 


+ + 


CALCULATE Re[{P+Q] AND Re[{P-Q] 


LAC *,15,AR2 ACC := (1/2)(PR) 

ADD x,15,ARL ACC := (1/2) (PR+QR) 

SACH *+,0,AR2 PR := (1/2)(PR+OR) 

SUBH k ACC := (1/2) (PR+QR)-(QR) 
SACH x+,0,ARL QR := (1/2)(PR-OR) 


* 


CALCULATE Im{P+Q] AND Im[P-Q] 


LAC *,15,AR2 ACC := (1/2)(PI) 

ADD *,15,ARl ACC := (1/2)(PI+QI) 
SACH x-,0,AR2. PI := (1/2)(PI+QI) 

SUBH x ACC := (1/2)(PI+QI)-(QI) 
SACH z-,0,ARL QI. := (1/2)(PI-QI) 

SEND 
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col 


REKRKKKEKRKEEEKEREREREREREKEEREREREKEKRERERERERERERERERKKKKKRKREK 


e+ He ee Re FO OH OHHH OHHH 


x 


A 1-BIT OVERFLOW. 


MEMORY ADDRESS BIAS 


MACRO 2: W=-j, k=N/4. 


P=(PR+jPI) P-jQ=(PR+QI)+j(PI-QR) 


\/ 
/\ 


Q=(QR+5QI) P+jQ=(PR-QI)+j(PI+QR) 


-j(2(pi)/N)k 


ze =COS[(2(pi)/N)k]-jSIN[(2(pi)/N)k] 


=WRtIWI =-j 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
HOWEVER, NO OVERFLOWS WILL OCCUR 
FOR FRACTIONAL INPUTS OF THE FORM X : -1l <= X < l. 

TOTAL OF 13 INSTRUCTIONS ARE USED SUCH THAT EXECUTION 
TIME IS EQUAL TO 15 MACHINE CYCLES. 


WARNING: THIS MACRO REQUIRES THE INPUT SAMPLES QR AND 
QI TO BE IN CONSECUTIVE DATA MEMORY LOCATIONS 
ASCENDING ORDER. THE FOLLOWING STEPS ARE USED 


TO IMPLEMENT THIS MACRO: 


(1) [PI-QR] ----~- > [PI) 
(2) [PI+QR] ----~- > [QR] 
(3) [PR+#QI] ----- > [PR] 
(4) [PR-QI] ----- > [acc] 
(5) [QR] = ----= > {QI} 
(6) [Acc] ----- > [QR] 


MACRO ENTRY CONDITION: ARP MUST POINT AT AR1 
MACRO EXIT CONDITION : ARP MUST POINT AT AR1 


: BIAS FOR DATA MEMORY PAGE 4 


eee FF FH EH HHH HHH HEHEHE He eH HE EH Ee EE Hh EH 


KREKEKEKKKKKKERREKEEKERERREREREREKEREREREERKEEREERKKEEKKKKKKEKK 


PBY2I 
* 
x 
* 


* 


LRLK 


SMACRO PR,OR,BIAS 


INITIALISE AUXILIARY REGISTERS 


CALCULATE Re[P+jQ] AND Re[P-jQ] 


LAC *,15,AR2 ACC := (1/2) (PI) 

SUB *,15,AR1 ACC: := (1/2) (PI-QR) 
SACH x-,0,AR2 PI := (1/2)(PI-QR) 
ADDH x ACC := (1/2) (PI-QR)+(QR) 
SACH *+,0,ARL QR = := (1/2) (PI+QR) 


CALCULATE Im[P+jQ] AND Im[P-jQ] 


LAC *,15,AR2 ACC := (1/2) (PR) 
ADD *,15,AR1 ACC := (1/2) (PR+QI) 
SACH *,0,AR2 PR := (1/2) (PR+QI) 


AR1,:PR:+:BIAS:+1 AR1 POINTS TO PI 
LRLK AR2,:QR:+:BIAS: | AR2 POINTS TO QR 


SUBH x 
DMOV * 

SACH *,0,AR1 
SEND 


KKKKKEKKEKKKEKEKKEKKEKKKKKK 


MACRO 3: k=N 
P=(PR+}PI)  P+Q 

\/ 

 e 
Q=(QR+3QI) P-Q 
-j(2(pi)/N)k 

W =e 
N 

=W 
LET w=] 
THEN [QxW J=( 
Re [QxW }=( 
Im[{QaW ]=( 


ALL OUTPUT SAMPLES ARE 
A 1-BIT OVERFLOW. HOW 
FOR FRACTIONAL INPUTS 


TIME IS EQUAL TO 24 MA 
W TO BE THE ABSOLUTE V. 
SIN( /4). THE SIGNS 0 
TAKEN CARE OF IN THE C 


MACRO ENTRY CONDITION: 
MACRO EXIT CONDITION 


MEMORY ADDRESS BIAS 


+ ee HF FH HH KH HF HF HF OF KF HF KF HF FH HH HH HE HHH EE EE 


KAIKKRRERIERIAIKERERERER ER 
PBY4I $MACRO PR,QR, 
* 


TOTAL OF 22 INSTRUCTIONS ARE USED SUCH THAT EXECUTION 


: ARP MUST POINT AT AR1 


: BIAS FOR DATA MEMORY PAGE 4 


ACC := (1/2) (PR+QI)-(QI) 
-> QI 


QR Q 
QR := (1/2)(PR-QI) 


KEKKEKKKEKKEREREKEEEREREKEREKKKKKK KK 
/8. 


xW= (PR+Re [QaW] )+j (PI+Im[QaW] ) 
aW= (PR-Re [QW] )+j (PI-Im(Q*W]) 


=COS( (pi) /4)-jSIN( (pi) /4)=WR+jwI 
R+jWI 
COS((pi)/4) }=|SIN((pi)/4) | 


QR+QI) xW+j (QI-QR) «W 
QI+QR) aw 
QI-QR) «W 


SCALED DOWN BY 2 TO ACCOMMODATE 
EVER, NO OVERFLOWS WILL OCCUR 
OF THE FORM X : -1 <= X¥ <1. A 


CHINE CYCLES. THIS MACRO REQUIRES 
ALUE (MAGNITUDE) OF COS( /4) AND 
F THESE TRIG FUNCTIONS HAVE BEEN 
ODE. 


ARP MUST POINT AT AR1 


+ FF OF OF FO OF OO OO OHHH HHH 


KAIKAAKKKKREKAREREREERERERERERERK IKK 
BIAS 


LRLK AR1,:QR:+:BIAS:+1 AR1 POINTS TO QI 
LRLK AR2,:PR:+:BIAS: | AR2 POINTS TO PR 

k 
LAC x-,14 ACC := (1/4) (QT) 
SUB at, 14 ACC := (1/4) (QI-QR) 
SACH n~,1 QI := (1/2)(QI-OR) 
ADD *,15 ACC := (1/4) (QI+OR) 
SACH *,1,AR2 QR := (1/2) (QI+OR) 
LAC *,14,ARO ACC := (1/4)(PR) 
LT *,AR1 T  :=W=COS(PI/4)=SIN(PI/4) 
MPY x ,AR2 P := (1/4) (QI+QR) *W 
APAC ACC := (1/4) [PR+(QI+OR) *W] 
SACH a+,1,AR1 PR := (1/2) [PR+(QI+OR)*W] 
SPAC Acc := (1/4) (PR) 
SPAC ACC := (1/4) [PR-(QI+QR) *W] 
SACH x+,1,AR2 QR. := (1/2) [PR-(QI+QR) *W] 
LAC *,14,AR1 ACC: := (1/4) (PI) 


vol 
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MPY 

APAC 
SACH 
SPAC 
SPAC 
SACH 
SEND 


x, AR2 P -:= (1/4) (QI-QR)«W 
ACC := (1/4) {PI+(QI-QR) «W] 
*-,1,ARL PI := (1/2){PI+(QI-QR)*W] 
ACC := (1/4) (PI) 
ACC := (1/4) [{PI-(QI-QR) «W] 
= ( 


*x-,1,AR1 QI 1/2) [PI-(QI-QR) xW] 


KEKKKKKAKKKKKEKKKKKKEKRKERERAKREEKEKKEKKKEKKEARKEERAKKKKK KKK RRA 


LET 


THEN 


SIN(3 /4). 


Ce ee 


MACRO 4: k=3N/8. 


P=(PR+jPI) P+QuW=(PR+Re [QxW]) +3 (PI+Im[QxW] ) 


\/ 
/\ 


Q=(QR+jQI) P-QuW=(PR-Re[QxW])+j (PI+Im[QeW] ) 


~3(2(pi)/N)k 


=C0S(3(pi)/4)-jSIN(3(pi)/4) 
=WR+jWI 
W=|COS(3(pi)/4) |=|SIN(3(pi)/4)} 
[QW ]=(QI-QR) *W-j (QI+QR) *W 


RE[Q*W]=(QI-QR) «W 
IM(Q*WJ=(QI+QR) xW 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
A 1-BIT OVERFLOW. HOWEVER, NO OVERFLOWS WILL OCCUR 

FOR FRACTIONAL INPUTS OF THE FORM X : -1 <= X <1. A 
TOTAL OF 22 INSTRUCTIONS ARE USED SUCH THAT EXECUTION 
TIME IS EQUAL TO 24 MACHINE CYCLES. THIS MACRO REQUIRES 
W TO BE THE ABSOLUTE VALUE (MAGNITUDE) OF COS(3 /4) AND 


THE SIGNS OF THESE TRIG FUNCTIONS HAVE BEEN 


TAKEN CARE OF IN THE CODE. 


MACRO ENTRY CONDITION: ARP MUST POINT AT AR1 
MACRO EXIT CONDITION : ARP MUST POINT AT AR1 


MEMORY ADDRESS BIAS : BIAS FOR DATA MEMORY PAGE 4 


+e + FH eH FH HF HF HF HF HF FH HF FF FF HF HF FH HF FH HF HF eH HF HH 


KKREKEKKEKRREKKEKKEKKEREKEKKEKKEKREEKREKKEKEREKEKEKERKKKKKKKK KAKA 


P3BY4I SMACRO 
* 


PR,QR,BIAS 


AR1L,:QR:+:BIAS:+1 AR1 POINTS TO QI 


AR2,:PR:+:BIAS: | AR2 POINTS TO PR 
x-,14 ACC := (1/4)(QI) 
a+, 14 ACC := (1/4) (QI-QR) 
x-,1 QI := (1/2)(QI-QR) 
*,15 ACC := (1/4) (QI+QR) 
x+,1,AR2 QR := (1/2) (QI+QR) 
*,14,ARO ACC := (1/4)(PR) 
*,AR1 T  :=W=COS( /4)=SIN( /4) 
*-,AR2 P  := (1/4) (QI-QR)#W 

ACC := (1/4) [PR+(QI-QR) xW] 


*x+,1,AR1 PR (1/2) [PR+(QI-QR) xW] 


ACC := (1/4)(PR) 

ACC := (1/4) [PR-(QI-QR)«W] 
P= := (1/4)(QI+QR) aw 

QR := (1/2) [PR-(QI-QR)«W] 
ACC := (1/4)(PI) 

ACC := (1/4) [PI~(QI+OR)*W] 
PI := (1/2) [PI-(QI+QR)*W] 
ACC := (1/4)(PI) 

ACC := (1/4) [PI+(QI+OR) «W] 
QI := (1/2) [PI+(QI+QR) xwW] 


KAKKKEKKAKKKAKRKKERERKAKRKEKEKEKKEREKREKERAREREREREREKEKERKKKAAKERKEK 


i a ae 


MACRO 5: 

A = RI+jI1 
B = R2+jI2 
C = R3+513 
D = R4+j14 
A' = 


B4 


REAL(B1) 
IMAG(B1) 


REAL (B2) 
IMAG(B2) 


REAL (B3) 
IMAG(B3) 


A! Bl 
\ / 
\/  B2 
B'\ /\ / 
C/N /\ 
/\ B3 
/ \ 
D! B4 


(R1+R2) + j(11+12) 


(R1-R2) + j(1I1-12) 


(R3+R4) + j(R3+R4) 


(R3-R4) + j(R3-R4) 


A'+cC' 
B'-3D' 
A'-C! 


B'+D! 


((R1+R2) 
((11+12) 


((R1-R2) 
((11-12) 


((R1+R2) 
((I1+1I2) 


+ (R3+R4)) /2 
+ (I3+1I4)) /2 


+ (I13-14)) /2 
(R3-R4)) /2 


(R3+R4)) /2 
(I3+14)) /2 


Le a a ae 
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Sol 


+e + ee eH eH He HF He HF 


REAL (B4) 
IMAG(B4) 


-1 <= X¥ < 1.0. 


((R1-R2) - (13-14)) /2 
((11-I2) + (R3-R4)) /2 


THE FIRST TWO STAGES OF A RADIX-2 N-POINT DIT FFT CAN BE 
IMPLEMENTED WITH A SPECIAL RADIX~-4 BUTTERFLY WHICH HAS A 
UNITY TWIDDLE FACTOR USED TO SPEED UP THE EXECUTION TIME 
OF THE FFT WITH THE ABOVE EQUATIONS. WHEN USING THESE 
EQUATIONS, ALL INPUT VALUES MUST BE WITHIN THE RANGE 
TOTAL NUMBER OF INTRUCTIONS IS. 39. 
EXECUTION TIME IS EQUIVALENT TO 41 MACHINE CYCLES. 


ee ee Fe HH He He 


HAKKAR IIAEAIARIIIIIA AAAI AERA REE REREAERER ERE 


KAIIAIAAIIAAAAIKAAAIAAAIAIAAAIAII AIA AAI IIIA IRAE REAR 


R3 
13 


R4 
14 


+e ee ee FF HF HH HHH HHH HEHE SE 


RADIX-4 MACRO 
N-POINT RADIX-2 DIT FFT. 


(R3+R4) /1 
(13+14)/1 


(R3-R4)/1 
(13-14)/1 


‘COMBO' 


REAL TERMS (R'S). 


THE FOLLOWING STEPS ARE USED TO IMPLEMENT THE SPECIAL 
FOR THE FIRST TWO STAGES OF AN 


(R1+R2)/1 
(114+12)/1 


{(R1-R2)+(I3-14) ]/2 
{(11-12)-(R3-R4) ]/2 


(R3+R4)/1 
(13+I4)/1 


[ (R1-R2)-(13-14) ]/2 
R4--->14:=(R3-R4)/1 
[(11-12)+(R3-R4) ]/2 


IN THE FOLLOWING CODE, ALL IMAGINARY TERMS (I'S) ARE 
ASSUMED TO IN LOCATIONS CONSECUTIVE TO THOSE OF THE 
THEREFORE, THE ADDRESS OF EACH 
IMAGINARY TERM (I) IS REPRESENTED AS (R+1). 


{(R1+R2)+(R3+R4) }/2 
{(11+12)+(13+14) ]/2 


{(R1-R2)+ (13-14) ]/2 
[(I1-12)-(R3-R4) ]/2 


[ (R1+R2)- (R3+R4) ]/2 
{(11+12)-(13+14) ]/2 


{(R1-R2)~ (13-14) ]/2 


[(11-12)+(R3-R4) ]/2 


+ Fe FF HH HF OF OH EH OHH HH HH 


HARRI IRIK IAAI IRA AEEIIAI RIERA ER EERIE 


COMBOI 
* 


x 
* 


$MACRO 


R1,R2,R3,R4,BIAS 


CALCULATE PARTIAL TERMS FOR R3,R4,13 AND 14 


:R3:-:BIAS: ,14 
:R4:-:BIAS: ,14 


:R3:-:BIAS: ,1 
:R4:-:BIAS: ,15 
:R4:-:BIAS:,1 


:R3:+1-:BIAS:,14 
:R4:+1-:BIAS: ,14 
:R3:+1-:BIAS:,1 
:R4:+1-:BIAS: ,15 
:R4:+1-:BIAS:,1 


Acc := (1/4) (R3) 

ACC := (1/4) (R3+R4) 

R3.:= (1/2) (R3+R4) 

ACC := (1/4) (R3+R4)-(1/2) (R4) 
R4 := (1/2) (R3-R4) 

acc := (1/4)(I3) 

acc := (1/4) (13+I4) 

13. := (1/2) (13+T4) 

ACC := (1/4) (I3+14)-(1/2) (14) 
14 := (1/2)(13-14) 


* 


CALCULATE 


CALCULATE 


PARTIAL TERMS FOR R2,R4,I2 AND I4 


:R1:-:BIAS: ,14 ACC 
:R2:-:BIAS: ,14 acc 
:R1:-:BIAS:,1 Rl 
:R2:-:BIAS:,15 ACC 
:R4:+1-:BIAS:,15 ACC 
:R2:-:BIAS: R2 
:R4:+1~-:BIAS: ACC 
:R4:-:BIAS: I4 
:R4:-:BIAS: R4 


:R1:+1-:BIAS: ,14 ACC 
:R2:+1-:BIAS:,14 ACC 
:R1:+1-:BIAS:,1 Il 

:R2:+1-:BIAS:,15 ACC 
:R4:+1-:BIAS: ,15 AcC 


:R2:+1-:BIAS: I2 
:R4:+1-:BIAS: ACC 
:R4:+1-:BIAS: 14 


er ee ee es ee eo ee ne 
tnd du ent out ut & t tot wt 


(1/4) (R1) 

(1/4) (R1+R2) 

(1/2) (R1+R2) 

(1/4) (R1+R2)- (1/2) (R2) 
(1/4) { (R1-R2)+(13-14) ] 
(1/4) { (R1-R2)+(13-14) ] 
(1/4) { (R1-R2)-(13-14) ] 
R4 = (1/2) (R3-R4) 
(1/4) [ (R1-R2)- (13-14) J 
(1/4) (11) 

(1/4) (11+12) 

(1/2) (11+12) 

(1/4) (11+I2)-(1/2) (12) 
(1/4) [ (11-12) -(R3-R4) ] 
(1/4) { (11-12) -(R3-R4) ] 
(1/4) { (11-12)+(R3-R4) ] 
(1/4) { (11-12) +(R3-R4) ] 


(1/4) (R1+R2). 

(1/4) [(R1+R2)+(R3+R4) } 
(1/4) {(R1+R2)+(R3+R4) J 
(1/4) [(R1+R2) - (R3+R4) ] 


PARTIAL TERMS FOR R1,R3,I1 AND I3 
:R1:-:BIAS:,15 Acc 
:R3:-:BIAS:,15 ACC 
:R1:-:BIAS: R1 

:R3:-:BIAS: ACC 
:R3:-:BIAS: R3 


:R1:+1-:BIAS:,15 
:R3:+1-:BIAS:,15 ACC 


:R1:+1-:BIAS: Il 
:R3:+1-:BIAS: ACC 
:R3:+1-:BIAS: I3 


oo 


>» 
° 
a 

iano pnuid th nn 


(1/4) [(R1+R2)- (R3+R4) } 
(1/4) (I1+I2) 

(1/4) ((11+12)+(13+14) J 
(1/4) ((11+12)+(13+14) J 
(1/4) ((11+12)-(13+14) ] 
(1/4) ((11+12)-(13+14) ] 


KKKKKKKKKEKEKKKEKEKEREEEEKKREKEREKKEKKREEREKKEKKEKKKKEKKEKKKKAKK 


+ ee ee OO OOH HHO 


MACRO 6: 


P=(PR+j3QI) 


Q=(QR+3QI) 


-j(2(pi)/N)k 


W =e 


N 


A 1-BIT OVERFLOW. 


CoS(0) AND SIN(O). 
HAVE BEEN TAKEN CARE OF IN THE CODE. 


A GENERAL RADIX-2 DIT FFT 


P+QaW=(PR+Re (QawW] )+j (PI+Im[(QxW] ) 


\/ 
/\ 


P-QaW=(PR-Re (QxwW] )+j (PI+Im[Q*W]) 


=CO0S(0)-3jSIN(O) 


=WRtjWI 


"BUTTERFLY ' 


ALL OUTPUT SAMPLES ARE SCALED DOWN BY 2 TO ACCOMMODATE 
HOWEVER, NO OVERFLOWS WILL OCCUR 

FOR FRACTIONAL INPUTS OF THE FORM X : 
TOTAL OF 25 INSTRUCTIONS ARE USED SUCH THAT EXECUTION 
TIME IS EQUAL TO 27 MACHINE CYCLES. 
REQUIRES W TO BE THE ABSOLUTE VALUE (MAGNITUDE) OF 

THE SIGNS OF THESE TRIG FUNCTIONS 


“l<=XK<1. A 


THIS MACRO 


Pe ee eo 
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+e ee 


* 


MACRO ENTRY CONDITION: ARP MUST POINT AT AR1 
MACRO EXIT CONDITION : ARP MUST POINT AT AR1 


MEMORY ADDRESS BIAS : BIAS FOR DATA MEMORY PAGE 4 


+e ee + HF 


RRKKKRKREKEKRIRAEIIIIKAEAA KIARA III KRII IIA IISA IA IAIAIA IIIS IAN 
BTRFLI $MACRO PR,QR,WR,WI,BIAS 
* 


x 
* 


* 


* 


* 


* 


INITIALISE AUXILIARY REGISTERS 


LRLK AR1,:QR:+:BIAS:+1 AR1 POINTS TO QI 
LRLK AR2, :PR:+:BIAS: AR2 POINTS TO PR 


CALCULATE QRAWR + QIAWI AND STORE RESULT IN QI 


LT x T-REGISTER := QI 

MPYK WI: P-REGISTER := (1/16) (QI*WI) 
LTP x ACC := (1/16) (QI&WI); T-REGISTER=QR 
MPYK :WR: P-REGISTER := (1/16) (QRAWR) 

APAC ACC := (1/16) (QRAWR+QI*WI ) 

SFR ACC := (1/32) (QRAWRtQI WI) 

SACH x+,4 QR := (1/2) (QRAWR+QI «WI ) 


CALCULATE QI*«WR - QR*WI AND STORE RESULT IN QR 


MPYK -:WI: P-REGISTER := (1/16) (-QR&WI) 

LTP x ACC := (1/16)(-QR«WI); T-REGISTER=Q1 
MPYK :WR: P-REGISTER := (1/16) (QIAWR) 

APAC ACC := (1/16) (QI*WR-QR&WI) 

SFR ACC := (1/32) (QI*WR-QRAWI) 

SACH x- ,4,AR2 QI := (1/2) (QIAWR-QR«WI) 


CALCULATE. Re[P+jQ] & Re[P-jQ] STORE RESULTS in PR & QR 


LAC *,14,AR1 ACC := (1/4)PR 

ADD *,15,AR2  ACC_-:= (1/4) [PR+(QRAWR+QI*WI) ] 
SACH *x+,1,AR1 PR := (1/2) [PR+(QRAWRtQI «WI ) ] 
SUBH x ACC := (1/4) [PR-(QR&AWR+QIAWI) } 
SACH a+,1,AR2 QR := (1/2) [PR-(QR&WR+OI&WI) ] 


CALCULATE Im[{P+jQ] & Im[P-jQ] STORE RESULTS in PI & QI 


LAC *,14,AR1 ACC := (1/4)PI 

ADD *,15,AR2 ACC: := (1/4) [PI+(QI&AWR-QRe«WI) J 
SACH #-,1,ARL PI := (1/2) (PI+(QIAWR-QR«WI) ] 
SUBH * ACC := (1/4) (PI-(QI*WR-QR«WI) } 
SACH a-,1,ARL QI := (1/2) (PI-(QIAWR-QR#WI) ] 


RERKEKKEREKKEREKEREKREKKREREERERERRERREREREREREREREREREKKKEKRAKK 


& 


* 
* 
* 
* 


* 
MACRO 7: RADIX-2 INPUT BIT REVERSAL 


ENTRY CONDITION: ARP MUST POINT AT AR1 


+e & 


EXIT CONDITION: ARP MUST POINT AT AR1 


A TOTAL OF 10 INSTRUCTIONS ARE USED SUCH THAT 
EXECUTION TIME IS EQUAL TO 12 MACHINE CYCLES. 


+e + He 
+e ee 


RKRKKRKRERKAKKERERERAKIKEE AKAIKE IA IRI IRIE ERAREERERERE 
* 

BITRVI $MACRO PR,QR,BIAS 

* 


* INITIALISE AUXILIARY REGISTERS 
* 
LRLK AR1,:PR:+:BIAS: AR1 POINTS TO PR 
LRLK AR2,:QR:+:BIAS: AR2 POINTS TO QR 
* 
ZALH *, AR2 
ADDS *, AR1 
SACL *x+,0,AR2 
SACH x+,0,AR1 
ZALH x, AR2 
ADDS *, AR1 
SACL x- ,0,AR2 
SACH *x-,0,AR1 
SEND 


APPENDIX C 
A 256-POINT, RADIX-2 DIT FFT IMPLEMENTATION 


4. Implementation of Fast Fourier Transform Algorithms with the TMS32020 107 


801 
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APPENDIX C 


IDT 'FFT256' 
KAKKKRKKKERERKKEKREK RRAKIKE RRR EAKRIARAAIRAERERIERRERRE RE REK 


A 256-POINT RADIX-2 DIT COMPLEX FFT FOR THE TMS32020 


THE FOLLOWING FILE RAD2FFT.MAC CONSISTS OF ALL THE 
MACROS LISTED IN APPENDIX B 


+ ee FF OF OF 
+ + FF FF OF 


KAKKKKKKAKAKRARER RAKE KAKI RR IARKEIRERERAERRAERE RAK KKK 
* 

COPY RAD2FFT.MAC 
* 
KAKKKKAKKKAKKKAKKRKAKAKRE RAK RARER IKK ERIE RRA RAIA RRERAAK KK 


* x 
* DATA MEMORY MAP FOR PAGES 4, 5, 6 AND 7 (BLOCKS BO,B1) x 
* * 


KKKKAKKKKKKKAKKKKEKKEKEKAAKKEERKKKEKKAEREKEKRKARKREREKEKRKKKKKKKKK 


DORG ct) 

* 

* DATA MEMORY PAGE 4 (STARTING ADDRESS 512 OR >200) 
* 

X000 DATA 0,0 
x001 DATA 0,0 
X002 DATA 0,0 
X003 DATA 0,0 
X004 DATA 0,0 
x005 DATA 0,0 
X006 DATA 0,0 
X007 DATA 0,0 
X008 DATA 0,0 
X009 DATA 0,0 
X010 DATA 0,0 
X011 DATA 0,0 
X012 DATA 0,0 
X013 DATA 0,0 
x014 DATA 0,0 
XO15 DATA 0,0 
X016 DATA 0,0 
X017 DATA 0,0 
x018 DATA 0,0 
X019 DATA 0,0 
x020 DATA 0,0 
x021 DATA 0,0 
X022 DATA 0,0 
X023 DATA 0,0 
X024 DATA 0,0 
X025 DATA 0,0 
K026 DATA 0,0 
X027 DATA 0,0 
x028 DATA 0,0 
x029 DATA 0,0 
X030 DATA 0,0 
X031 DATA 0,0 
X032 DATA 0,0 
X033 DATA 0,0 


. 


X088 
K089 
X090 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


DATA 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


i i i, i i, i i i, ee iy 


oooooocoo°ococeooocoeo0occdceceo0oo00cco0o0oo0o0o se 


s sos 


ooooooooo0oco°coeocoveoooo0cooooo0oo0oo0°coo0°cjo;o 


MEMORY PAGE 5 (STARTING ADDRESS 640 OR >280) 


i ee nd 


Or ry ~ 


qoQoeooo0o0o0cococ”cooooooo0oo0o0o0coe0o0oo0o0o0o00cn oo0c oe 
qeooooo0oo0ocecedcoo0ooo0ocececeoooceoeooecna 


. 
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x091 
X092 
X093 
Xx094 
x095 
X096 
X097 
X098 
Xx099 
X100 
Xx101 
X102 
X103 
X104 
X105 
X106 
X107 
X108 
X109 
X110 
X111 
K112 
X113 
X114 
X115 
X116 
X117 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


DATA 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


x os 


- ss Ss Ss 8s 8 


ocooooooo~o~eceaoeceeoaocooocooococoocoo0cocooocco°oo:sd 


. . 8s &s & 8s SO 


coooao~oooaocoea~ce~a~eaeaaaaoaeceeocoecocoeocoocooocooccocooo0oce°oeco eo 


| 


MEMORY PAGE 6 (STARTING ADDRESS 768 OR >300) 


ecCoaooodoooo0oo0o0o0o0o0c0oo0ooqoo00°ndcso 


er ee ee ee 


ecooooooooqooco0c0c0cec°e°eoedcse 


. 


oooo0oooeoo0ooeoedcdeooooececocecedceoeoe0ccececco0o0ocecoocooocoo0 00 8 
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MEMORY PAGE 7 (STARTING ADDRESS 896 OR >380) 


oooooooo°oo°ocoec°cjeo 
ooooooooqoo°o°o°o°o 
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X205 DATA 0,0 * 
X206 DATA 0,0 IOI IS I II Idd a a kth iccktck sek thik 
X207 DATA 0,0 * x 
X208 DATA 0,0 * 13-BIT TWIDDLE FACTORS FOR 256-POINT COMPLEX FFT * 
X209 DATA 0,0 x * 
210 DATA 0,0 KIRK IIIA AIR IRI IIR RIA RIA IR IRI IRI III II IIIT IIA I IIA IAA AI AI AI 
X211 DATA 0,0 x 
X212 DATA 0,0 c000 EQU 4095 
X213 DATA 0,0 cool EQU 4094 
X214 DATA 0,0 c002 EQU 4091 
X215 DATA 0,0 c003 EQU 4085 
X216 DATA 0,0 C004 EQU 4076 
X217 DATA 0,0 coos EQU 4065 
X218 DATA 0,0 C006 EQU 4052 
X219 DATA 0,0 C007 EQU 4036 
X220 DATA 0,0 coos EQU 4017 
X221 DATA 0,0 co09 EQU 3996 
X222 DATA 0,0 co10 EQU 3973 
X223 DATA 0,0 C011 EQU 3948 
X224 DATA 0,0 co12 EQU 3920 
X225 DATA 0,0 co13 EQU 3889 
X226 DATA 0,0 C014 EQU 3857 
X227 DATA 0,0 co1s EQU 3822 
X228 DATA 0,0 C016 EQU 3784 
X229 DATA 0,0 C017 EQU 3745 
X230 DATA 0,0 co1s EQU 3703 
X231 DATA 0,0 col19 EQU 3659 
X232 DATA 0,0 c020 EQU 3612 
X233 DATA 0,0 co21 EQU 3564 
X234 DATA 0,0 co22 EQU 3513 
X235 DATA 0,0 c023 EQU 3461 
X236 DATA 0,0 C024 EQU 3406 
X237 DATA 0,0 co25 EQU 3349 
X238 DATA 0,0 C026 EQU 3290 
X239 DATA 0,0 C027 EQU 3229 
X240 DATA 0,0 co28 EQU 3166 
X241 DATA 0,0 c029 EQU 3102 
X242 DATA 0,0 C030 EQU 3035 
X243 DATA 0,0 co31 EQU 2967 
X244 DATA 0,0 C032 EQU 2896 
X245 DATA 0,0 C033 EQU 2824 
X246 DATA 0,0 C034 EQU 2751 
X247 DATA 0,0 C035 EQU 2675 
X248 DATA 0,0 C036 EQU 2598 
X249 DATA 0,0 C037 EQU 2520 
X250 DATA 0,0 C038 EQU 2440 
X251 DATA 0,0 C039 EQU 2359 
X252 DATA 0,0 c040 EQU 2276 
X253 DATA 0,0 c041 EQU 2191 
X254 DATA 0,0 C042 EQU 2106 
X255 DATA 0,0 C043 EQU 2019 
ke c044 EQU 1931 
FOGG III III III I IOI II IIIT OI AIA A Aa a A A A AR co45 EQU 1842 
r : C046 EQU 1751 
& DATA LOCATION IN BLOCK B2 FOR W=COS(45) OR SIN(45) x c047 EQU 1660 
x k c048 EQU 1567 
KKK IKKRAKIRKRAKEKRERERIRERIARIRER ERR REI R AI IRI RIAA IAAI IAI K RIK c049 EQU 1474 
c cos EQU 1380 
DORG 96 cos1 EQU 1285 


W DATA 0 C052 EQU 1189 


ani 


OZOZTESWLL 24) TIM sWIUOSTY WIOJsuBI] JOLINO, Ise Jo UoTRUSUAIdW] “py 


C053 
C054 
co55 
C056 
C057 
co58 
cos9 
c060 
c061 
c062 
c063 


co065 
c066 
C067 
c068 


co99 
C100 


C102 
C103 
C104 
C105 
C106 
C107 
c108 
C109 
C110 
Cl1lil 
C112 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


1092 


-3102 
-3166 
-3229 
-3290 
-3349 
-3406 
-3461 
-3513 
-3564 
-3612 
-3659 
-3703 
-3745 
-3784 


$030 
$031 


$033 
S034 
S035 
$036 
$037 
S038 
S039 
S040 
$041 
S042 
S043 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


cll 
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S044 
$045 
S046 
S047 
$048 
$049 
$050 
$051 
$052 
$053 
S054 
$055 
S056 
$057 
S058 
$059 
S060 
$061 
$062 
$063 
S064 
$065 
S066 
$067 
$068 
S069 
$070 
$071 
$072 
$073 
$074 
S075 
$076 
$077 
S078 
$079 
$080 
$081 
$082 
$083 
S084 
S085 
S086 
S087 
$088 
$089 
$090 
sogl 
$092 
$093 
S094 
$095 
S096 
$097 
$098 
$099 
$100 
$101 
$102 
$103 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


3612 
3659 
3703 
3745 
3784 
3822 
3857 
3889 
3920 
3948 
3973 
3996 
4017 
4036 
4052 
4065 
4076 
4085 
4091 
4094 
4095 
4094 
4091 
4085 
4076 
4065 
4052 
4036 
4017 
3996 
3973 
3948 
3920 
3889 
3857 
3822 
3784 
3745 
3703 
3659 
3612 
3564 
3513 
3461 
3406 
3349 
3290 
3229 
3166 
3102 
3035 
2967 
2896 
2824 
2751 
2675 
2598 
2520 
2440 
2359 


k 
* 
* 
FFT256 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


AORG 
B 


201 
101 


0 
INIT 


SYSTEM INITIALIZATION 


AORG 


16-BIT TWIDDLE FACTOR FOR SPECIAL MACROS 


DATA 


SPM 

CNFD 
ROVM 
SSXM 
LARP 
LRLK 
LALK 
TBLR 


FFT CODE WITH BIT-REVERSED INPUT SAMPLES 


BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


>20 


>SA82 


0 


ARO 
ARO,W 
WVAL 
*,AR1 


X001 ,X128,512 
X002,X064,512 
X003 ,X192,512 
X004 ,K032,512 
K005 ,X160,512 
K006 ,X096,512 
X007 ,X224,512 
X008 ,X016,512 
X009 ,X144,512 
K010 ,X080 ,512 
K011,X208,512 
X012,X048,512 
X013 ,X176,512 
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ell 


BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


X014,X112,512 
X015,X240,512 
X017 ,X136,512 
X018 ,X072,512 
X019 ,X200,512 
X020, X040,512 
X021,X168,512 
X022 ,X104,512 
X023 ,X232,512 
X025 ,X152,512 
X026 ,X088,512 
X027 ,X216,512 
X028 ,X056,512 
X029 ,X184,512 
X030 ,X120,512 
X031 ,X248,512 
X033 ,X132,512 
X034,X068,512 
X035 ,X196,512 
X037 ,X164,512 
X038 ,X100,512 
X039,X228,512 
X041 ,X148,512 
X042 ,X084,512 
X043 ,X212,512 
X044 ,,X052,512 
X045 ,X180,512 
X046 ,X116,512 
X047 ,X244,512 
X049 ,X140,512 
X050 ,X076,512 
X051 ,X204,512 
X053,X172,512 
X054,X108,512 
X055 ,X236,512 
X057 ,X156,512 
X058 , X092,512 
X059,X220,512 
X061,X188,512 
X062 ,X124,512 
X063 ,X252,512 
X065 ,X130,512 
X067 ,X194,512 
X069 ,X162,512 
X070 ,X098,512 
X071,X226,512 
X073 ,X146,512 
X074 ,X082,512 
X075 ,X210,512 
X077 ,K178,512 
X078 ,X114,512 
K079 ,X242,512 
X081 ,X138,512 
X083 ,X202,512 
K085 ,X170,512 
X086 ,X106,512 
X087 ,X234,512 
X089 ,X154,512 
X091 ,X218,512 
X093 ,X186,512 


* * 


BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


X094,X122,512 
X095 ,X250,512 
X097 ,X134,512 
X099 ,X198,512 
X101,X166 ,512 
X103 ,X230 , 512 
X105,X150,512 
X107,X214,512 
K109 ,X182,512 
X110,X118,512 
X111,X246 ,512 
K113,x142,512 
X115 ,X206 ,512 
X117,X174,512 
119,238,512 
X121,X158,512 
X123,X222,512 
X125,X190 , 512 
X127 ,X254,512 
X131,X193,512 
X133,X161,512 
X135,X225,512 
X137,X145 512 
X139,X209, 512 
X141,X177, 512 
X143 ,X241,512 
147,201,512 
X149,X169,512 
X151,X233,512 
X155 ,X217,542 
X157,X185 ,512 
X159,X249, 512 
163, X197,512 
167 ,X229, 512 
X171,X213,512 
X173,X181,512 
X175 ,X245,512 
X179,X205 ,512 
X183 ,X237,512 
X187 ,X221 ,512 
X191,X253,512 
X199, 227, 512 
X203 ,X211,512 
207 ,K243 , 512 
K215,X235,512 
X223, 251,512 
X239, 247,512 


FFT CODE FOR STAGES 1 AND 2 


LDPK 

COMBOT 
COMBOI 
COMBOI 
COMBOI 
COMBOT 
COMBOI 
COMBOT 
COMBOI 
COMBOTI 


4 

X000 , X001 , X002,,X003 ,0 
X004, X005 , X006 ,X007,,0 
X008 ,X009,X010,X011,0 
X012,X013,X014,X015,0 
X016,X017 ,X018,X019,0 
X020 ,X021 ,X022,X023,0 
X024,X025 ,X026,X027,0 
K028 ,X029 ,X030, X031,0 
X032 ,X033 ,X034,X035,0 


vil 
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COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
LDPK 

COMBOI 
COMBOI 
COMBOI 
COMBOT 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOT 
COMBOTI 
COMBOI 
COMBOT 
COMBOI 
COMBOT 
LDPK 

COMBOT 
COMBOI 
COMBOI 
COMBOT 
COMBOI 
COMBOT 
COMBOTI 
COMBOI 
COMBOT 
COMBOI 
COMBOT 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
LDPK 

COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 


- COMBOI 


COMBOI 
COMBOI 
COMBOI 
COMBOT 
COMBOI 
COMBOI 
COMBOI 
COMBOT 
COMBOT 


X036 , X037 ,X038,X039,0 
X040 , X041 , X042 ,X043 ,0 
X044 ,X045 , X046 ,X047,0 
X048 ,X049 ,X050 ,X051,0 
X052,X053 ,X054,X055,0 
X056 ,X057 ,X058 ,X059 ,0 
X060 , X061 , X062,,X063 ,0 

5 

X064, X065 , X066 ,X067,128 
X068 ,X069 ,X070 ,X071,128 
X072 ,X073 ,X074,X075 ,128 
X076 ,X077 ,X078 ,X079 ,128 
X080 ,X081,X082 ,x083 ,128 
X084 ,X085 , X086 ,X087 ,128 
X088 ,X089 , X090 ,X091,128 
X092 ,X093 ,X094,x095 ,128 
X096 ,X097 , X098 ,X099 ,128 
X100,X101,X102,xX103,128 
X104,X105 ,X106 ,X107,128 
X108 ,X109,X110,X111,128 
X112,X113,K114,X115,128 
X116 ,K117 ,X118,X119,128 
X120,X121,X122 ,X123,128 
X124,X125,X126,X127,128 
6 
X128,X129,X130,X131,256 
X132,X133 ,X134,X135 , 256 
X136 ,X137,X138,X139 , 256 
X140,X141,X142 ,X143, 256 
X144 ,X145 ,X146 ,X147 ,256 
K148 ,X149,X150,X151,256 
X152,X153,X154,X155 , 256 
X156,X157,X158,X159, 256 
X160 ,X161,X162,X163 ,256 
X164 ,X165,X166,X167 , 256 
X168,X169,X170,X171, 256 
X172,X173,X%174,X175,256 
X176 ,X177 ,X178 ,X179 ,256 
X180,X181 ,X182,X183 , 256 
X184,K185 ,X186 ,X187, 256 
X188 ,X189,X190,X191,256 
7 

X192,X193 ,X194,X195 , 384 
X196 ,X197 ,X198,X199, 384 
X200 ,X201 ,X202,X203 , 384 
X204,X205 ,X206 ,X207 , 384 
X208,X209,X210,X211, 384 
X212,X213,X214,X215, 384 
X216 ,X217 ,X218,X219 , 384 
X220 ,X221 ,X222 ,X223 , 384 
X224,X225 ,X226,X227 , 384 
X228 ,X229 ,X230, 23) , 384 
X232,X233 ,X234,X235 ,384 
X236 ,X237 , K238 ,X239 , 384 
X240 ,X241 ,X242 ,X243 , 384 
X244 ,X245 ,X246 ,X247 , 384 
X248 ,X249 ,X250,X251 , 384 
X252,K253 ,X254,X255,384 


FFT CODE FOR STAGE 3 


ZEROTI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROT 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY41 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 


X000 , X004, 512 
X001,X005,512 
X002, X006,512 
X003 , X007, 512 
X008, X012,512 
x009, X013,512 
XO10 ,X014, 512 
X011,X015,512 
X016 ,X020,512 
X017,X021,512 
X018,X022,512 
K019, X023,512 
X024, X028,512 
X025 ,X029, 512 
X026 ,X030,512 
X027 ,X031,512 
X032,X036,512 
X033 ,X037,512 
X034,X038,512 
X035 ,X039,512 
X040 , X044,512 
X041 ,X045,512 
X042, 046,512 
X043 ,X047 ,512 
X048 ,X052,512 
X049,X053,512 
X050,X054, 512 
X051,X055,512 
X056 ,X060,512 
X057,X061,512 
X058,X062,512 
X059 , K063,512 
X064,X068, 512 
X065 ,X069, 512 
X066 ,X070, 512 
X067 ,X071, 512 
X072,X076,512 
X073 ,X077, 512 
X074,X078,512 
K075 ,X079 ,512 
X080 ,X084, 512 
X081,X085, 512 
X082 , X086 , 512 
X083 ,X087, 512 
X088 ,X092,512 
X089 , X093, 512 
X090 ,X094, 512 
X091 ,X095 512 
X096 ,X100,512 
X097 ,X101,512 
X098 ,X102,512 
X099 , X103,512 
X104 ,X108,512 
X105 ,X109, 512 
X106 ,X110,512 
X107 ,X111,512 
X112,X116,512 
X113,X117,512 
X114,X118,512 
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Stt 


P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY41 
ZEROI 
PBY4I1 
PBY2I 
P3BY41 
ZEROI 
PBY4I1 
PBY2I 
P3BY41 
ZEROI 
PBY41 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROL 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 


X115,X119,512 
X120,X124,512 
X121,X125,512 
X122,X126,512 
X123 ,X127 .512 
X128,X132,512 
X129,X133,512 
X130 ,X134,512 
X131,X135,512 
X136,X140,512 
X137,X141,512 
X138,X142,512 
X139,X143,512 
X144,X148,512 
X145 ,X149,512 
X146 ,X150,512 
X147,X151,512 
X152,X156,512 
X153,X157,512 
X154,X158,512 
X155,X159,512 
X160,X164,512 
X161,X165,512 
X162,X166,512 
X163,X167,512 
X168 ,X172,512 
X169,X173,512 
X170 ,X174,512 
X171,X175,512 
X176 ,X180,512 
X177,X181,512 
X178 ,X182,512 
X179 ,X183 ,512 
X184,X188,512 
X185 ,X189,512 
X186 ,X190,512 
X187 ,X191,512 
X192,X196,512 
X193 ,X197,512 
X194,X198,512 
X195 ,X199,512 
X200 ,X204,512 
X201,X205,512 
X202 ,X206,512 
X203 ,X207 ,512 
X208 ,X212,512 
X209,X213,512 
X210,X214,512 
X211,X215,512 
X216 ,X220,512 


‘%217 ,X221 ,512 


X218,X222,512 
X219,X223,512 
X224 ,X228 ,512 
K225 ,X229,512 
X226 ,X230,512 
X227 ,X231,512 
X232 ,X236,512 
X233 ,X237 ,512 
X234 ,X238,512 


* 


P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 


X235 ,X239,512 
X240 ,X244,512 
X241 ,X245 ,512 
X242 ,X246,512 
X243 ,X247,512 
X248 ,X252,512 
X249,X253,512 
X250 ,X254,512 
X251,X255,512 


FFT CODE FOR STAGE 4 


ZEROI 
BTRFLI 
PBY4I 
BTRFLI 


' PBY2I 


BTRFLI 
P3BY4I 
BTRFLI 
ZEROT 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBYZI . 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY41 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 


X000 ,X008,512 
X001 ,X009,C016,S016,512 
X002 ,X010,512 
X003 ,X011,C048 ,S048 ,512 
X004,X012,512 
X005 ,X013,C080 ,S080, 512 
X006 ,X014,512 
X007, X015,C112,S5112,512 
X016 ,X024,512 
X017 ,X025,C016,S016,512 
X018,X026 ,512 
X019 ,X027 ,C048,S048 ,512 
X020 ,X028,512 
X021 ,X029, C080 ,S080,512 
X022 ,X030,512 
X023 ,X031,C112,S8112,512 
X032,X040,512 
X033 ,X041,C016,S016,512 
X034 ,X042,512 
X035 ,X043 ,C048 ,S048, 512 
X036 ,X044,512 
X037 ,X045 ,C080,S080, 512 
X038 ,X046 ,512 
X039 ,X047,C112,S8112,512 
X048 ,X056,512 
X049 ,X057,C016,S016,512 
X050 ,X058, 512 
X051,X059, C048 ,S048,512 
X052,X060,512 
X053,X061 ,C080 ,S080 ,512 
X054,X062,512 
X055 ,X063,C112,8112,512 
X064 ,X072,512 
X065 ,X073,C016,S016,512 
X066 ,X074,512 
X067 ,X075,C048 ,S048 ,512 
X068 ,X076,512 
X069 ,X077 , C080 ,S080 , 512 
X070 ,X078,512 
X071 ,X079,C112,8112,512 
X080 ,X088, 512 
X081 ,X089,C016,S016,512 
X082 ,X090,512 
X083 ,X091 ,C048 ,S048,512 
X084 ,X092,512 
X085 ,X093 ,C080 ,S080,512 
X086 ,X094,512 
X087 ,X095,C112,S8112,512 


9IT 
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ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY41 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BIRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 


X096,X104,512 

X097 ,X105,C016,5016,512 
X098 ,X106,512 

X099 ,X107 ,C048,°148,512 
X100,X108,512 
X101,X109,C080,S080,512 
X102,X110,512 
X103,X111,€112,$112,512 
K112,X120,512 
X113,X121,C016,S016,512 
X114,X%122,512 
X115,X123,C048,S048,512 
X116,X124,512 
X117,X125,C080,S080,512 
X118,X126,512 
X119,X127,C112,$8112,512 
X128 ,X136,512 

X129 ,X137,C016,S016,512 
X130,X138,512 
X131,X%139,C048 ,S048,512 
X132,X%140,512 

X133 ,X141,C080,S080,512 
X134,X142,512 

X135 ,K143 ,C112,8112,512 
X144,X152,512 

X145 ,X153,C016,S016,512 
X146 ,K154,512 

X147 ,X155,C048,S048 ,512 
X148 ,K156,512 

X149 ,X157,C080,S080,512 
X150 ,X158,512 
X151,X159,C112,S8112,512 
X160,X168,512 
X161,X169,C016,S016,512 
X162,X170,512 

X163 ,X171,C048,S048,512 
X164,X172,512 

X165 ,X173 ,C080 ,S080 ,512 
X166,X174,512 

X167 ,X175,C112,5112,512 
X176,X184,512 
X177,X185,C016,S016,512 
X178,X186,512 

X179 ,X187 ,C048,S048 ,512 
X180 ,X188,512 
X181,X189,C080,S080,512 
X182,X190,512 

X183 ,K191,C112,S112,512 
X192,X200,512 
X193,X201,C016,S016 ,512 
X194,X202,512 

X195 ,X203,C048,S048 ,512 
X196 ,X204,512 

X197 ,X205 ,C080 ,S080 ,512 
X198 ,X206,512 

X199 ,X207 ,C112,S8112,512 
X208 ,X216,512 

X209 ,X217,C016,S016,512 
X210,X218,512 
X211,X219,C048,S048 ,512 


PBY2I 
BTIRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I | 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 


X212,X220,512 
X213,X221,C080,S080 ,512 
X214,X222,512 
K215,X223,C112,8112,512 
X224,X232,512 
X225 ,X233,C016,S016,512 
X226 ,X234,512 
X227 ,X235 ,C048,S048 ,512 
X228 ,X236,512 
X229 ,X237 ,C080,S080 ,512 
X230 ,X238,512 
X231,X239,C112,8112,512 
K240 ,X248 ,512 
X241 ,X249,C016,S016,512 
X242 ,X250,512 
X243 ,X251,C048,S048,512 
X244 ,X252 ,512 
X245 ,X253 ,C080,S080 ,512 
X246 ,X254,512 
X247 ,X255,,C112,8112,512 


FFT CODE FOR STAGE 5 


ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BIRFLI 
BTRFLI 
BTRFLI 
PBY4I 


X000 ,X016 ,512 

X001 ,X017,C008,S008, 512 
X002 ,X018,C016,S016,512 
X003 ,X019,C024,5024,512 
X004,X020,512 

X005 ,X021 ,C040,S040, 512 
X006 ,X022 ,C048 ,S048 ,512 
X007 ,X023 ,C056,5056 ,512 
X008 , X024,512 

X009 ,X025 ,C072,5072,512 
X010 ,X026 ,C080,S080,512 
X011,X027 ,C088,S088,512 
X012,X028,512 

X013,X029,C104,S104,512 
X014,xX030,C112,8112,512 
X015,X031,C120,S120,512 
K032 ,X048,512 

X033 ,X049 ,C008,S008 ,512 
X034 ,X050,C016,S016 ,512 
X035 ,X051,C024,S8024,512 
X036 ,X052,512 

X037 ,X053 ,C040,S040 ,512 
X038 ,X054 ,C048,S048 ,512 
X039,xX055 ,C056,S056,512 
X040 ,X056,512 

X041 ,X057 ,C072,,8072,512 
X042 ,X058,C080,S080 ,512 
X043 ,X059 ,C088,S088,512 
X044 ,X060 ,512 

K045 ,X061,C104,5104,512 
X046 ,X062,C112,5112,512 
X047 ,X063,C120,S8120,512 
X064 ,X080 ,512 

X065 ,X081 ,C008,S008 ,512 
K066 ,K082,C016,S016,512 
X067 , X083 , C024 ,S024,512 
X068 ,X084,512 
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Lit 


BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BIRFLI 
BTRFLI 
P3BY41 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BIRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFELI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 


X069 ,, X085 ,C040,S040,512 
X070 ,X086 ,C048,5048 ,512 
X071 ,X087 ,COS6,S056,512 
X072 ,X088 ,512 

X073,X089 ,CO72,S072,512 
X074, X090,C080,S080, 512 
X075 ,X091 ,C088,S088 ,512 
X076 ,X092 ,512 

X077 ,X093 ,C104,S104,512 
X078 ,X094,C112,5112,512 
X0793,X095 ,C120,5120 ,512 
X096 ,X112,512 

X097 ,X113 ,C008,S5008,512 
X098 ,X114,C016,S016,512 
X099 ,K115,C024,S024,512 
X100,X116,512 

X101 ,X117 ,C040,S040 ,512 
X102,X118 ,C048,S048,512 
X103 ,X119,C056,S056 ,512 
X104,X120,512 

X105 ,X121,C072,8072,512 
X106 ,X122,C080,S080 ,512 
X107 ,X123 ,C088 ,S088 ,512 
X108 ,X124,512 

X109,X125,C104,S104,512 
X110,X126 ,C112,$112,512 
X111,X127,C120,8120,512 
X128 ,X144,512 

X129,X145 ,C008,S008, 512 
X130 ,X146,C016,S016,512 
X131,X147 ,C024,S024,512 
X132 ,X148 ,512 

X133,,X149 ,C040,S040,512 
X134,X150 ,C048,S048 ,512 
X135,X151,C056,S056 ,512 
X136 ,X152,512 

X137 ,X153 ,C072,S072,512 
X138 ,X154,C080,S080 ,512 
X139,X155 ,C088 ,S088 ,512 
X140 ,X156,512 

X141,X157,C104,S5104,512 
X142,,X158,C112,8112,512 
X143 ,X159,C120,5120,512 
X160,X176 ,512 

X161,X177 ,C008,S008 , 512 
X162 ,X178,C016,S016 ,512 
X163 ,X179 ,C024,S024,512 
X164,X180 ,512 

X165 ,X181 ,C040 ,S040 ,512 
X166 ,X182 ,C048 ,S048 ,512 
X167 ,X183 ,CO56,S056,512 
X168 ,X184,512 

X169 ,X185 ,C072,S072,512 
X170 ,X186 ,CO80 ,S080 ,512 
X171,X187 ,C088 ,S088 , 512 
X172,X188 ,512 

X173 ,X189 ,C104,S5104,512 
X174,X190 ,C112,8112,512 
X175 ,X191,€120,S120,512 
X192 ,X208 ,512 


* 


BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTIRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 


K193 ,X209,C008,S008 ,512 
X194,X210,C016,S016,512 
K195 ,X211,C024,S024,512 
X196 ,X212,512 

X197 ,X213,C040,S040,512 
X198,X214,C048,S048,512 
X199,X215,C056,S056,512 
X200 ,X216,512 

X201 ,X217 ,C072,S072,512 
X202 ,X218,C080,S080 , 512 
X203 ,X219, C088 ,S088,512 
X204,X220,512 

X205 ,X221 ,C104,S104,512 
X206 ,X222,C112,S5112,512 
X207 ,X223,C120,S120,512 
X224 ,X240,512 

X225 ,X241 ,C008,S008,512 
X226 ,X242,C016,S016,512 
X227 ,X243 ,C024,S024,512 
X228 ,X244,512 

X229 ,X245 ,C040 ,S040,512 
X230 ,X246 ,C048,S048,512 
X231,X247,C056,S056,512 
X232 ,X248,512 

X233 ,X249,C072,S072,512 
X234,X250,C080,S080,512 
X235 ,X251,C088,S088,512 
X236 ,X252,512 

X237 ,X253,C104,5104,512 
X238 ,X254,C112,$112,512 
X239 ,X255,C120,$120,512 


FFT CODE FOR STAGE 6 


ZEROI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY41 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
P3BY41 
BTRFLI 


X000 ,X032,512 

X001 ,X033 ,C004,5004,512 
X002 ,X034,C008 ,S008, 512 
X003 ,X035,C012,S012,512 
X004 ,X036 ,C016,S016,512 
X005 ,X037 ,C020,S020,512 
X006 ,X038,C024,S024,512 
X007 ,X039 ,C028,S028,512 
X008 ,X040,512 

X009 ,X041 ,C036 ,S036 ,512 
X010 ,X042 ,C040 ,S040,512 
X011,X043 ,C044,5044,512 
X012 ,X044,C048 ,S048,512 
X013 ,X045 ,C052,S052,512 
X014 ,X046 ,C056 ,S056,512 
X015 ,X047 ,C060 ,S060,512 
X016 ,X048 ,512 

X017 ,X049 ,C068 ,S068,512 
X018 ,X050 ,C072,S072,512 
X019 ,X051,C076,S076,512 
X020 ,X052,C080 ,S080, 512 
X021,X053 ,C084,S084,512 
X022 ,X054, C088 ,S088 ,512 
X023 ,X055 ,C092,S092,512 
X024,X056 ,512 

025 ,X057,C100,S100,512 


stl 
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BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTIRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 


X026 ,X058,C104,S104,512 
X027 ,X059 ,C108,S108,512 
X028 ,X060,C112,8112,512 
X029,,X061,C116,5116,512 
X030 ,X062,C120,5120,512 
X031 ,X063,C124,5124,512 
X064 ,X096 ,512 

X065 ,X097 ,C004,S004,512 
X066 ,X098,C008,S008 ,512 
X067 ,X099,,C012,S012,512 
X068 ,X100,C016,S016,512 
X069 ,X101,C020,S020,512 
X070 ,X102,C024,S024,512 
X071 ,X103 ,C028,S028,512 
X072 ,X104,512 

X073 ,X105 ,C036,S036 ,512 
X074,X106 ,C040 ,S040,512 
X075 ,X107 ,C044,5044,512 
X076 ,X108 ,C048,S048 ,512 
X077 ,X109,C052,S052,512 
X078 ,X110,C056,S056,512 
X079 ,X111,C060,S060,512 
X080 ,X112,512 

X081 ,X113,C068,S068,512 
X082 ,X114,C072,S072,512 
X083 ,X115,C076,S076,512 


' X084,X116,C080,S080,512 


X085 ,X117 ,C084,S084,512 
X086 ,X118, C088 ,S088 ,512 
X087 ,X119,C092,5092,512 
X088 ,K120,512 

X089 ,X121,C100,S100,512 
X090 , X122,C104,$104,512 
X091 ,X123,C108,S108,512 
X092 ,X124,C112,5112,512 
X093 ,X125,C116,$116,512 
X094,X126,C120,$120,512 
X095 ,X127,C124,5124,512 
X128 ,X160,512 

X129,X161,C004,S004,512 
X130,X162,C008 ,S008,512 
X131,X163,C012,S012,512 
X132,X164,C016,S016,512 
X133 ,X165,,C020,S020,512 
X134,X166,C024,S5024,512 
X135,X167,C028,S028,512 
X136 ,X168,512 

X137 ,X169, C036 ,S036,512 
X138,X170,C040 ,S040,512 
X139 ,X171,C044,S044,512 
X140 ,X172,C048,S048,512 
X141 ,K173,C052,S052,512 
X142 ,X174,C056,S056 ,512 
X143 ,K175,C060 ,S060,512 
X144,X176,512 

X145 ,X177, C068 ,S068,512 
X146 ,X178,C072,S072,512 
X147 ,K179,C076,S076,512 
X148 ,KX180,C080,S080,512 
K149, X181,C084,S084,512 


* 


BTRFLI 
BTRFLI 
P3BY41 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 


X150,X182,C088,S088 ,512 
X151,X183 ,C092,S092,512 
X152,X184,512 

X153,X185,C100,S100,512 
X154,X186 ,C104,$104,512 
X155,X187,C108,S108,512 
X156 ,X188,C112,5112,512 
X157,X189,C116,S116,512 
X158,X190,C120,S120,512 
X159,X191,C124,5124,512 
K192,X%224,512 

X193 ,X225,C004,5004,512 
X194 ,X226 ,C008,S008 ,512 
X195 ,X227,C012,S012,512 
X196 ,X228,C016,S016,512 
X197,X229,C020,S020,512 
X198 ,K230,C024,S024,512 
X199 ,X231,C028,S028,512 
X200 ,K232,512 

X201 ,K233 ,C036 ,S036,512 
X202 ,K234,C040,S040 , 512 
X203 ,X235,C044,5044,512 
X204 ,X236 ,C048,S048,512 
X205 ,X237,C052,S052,512 
X206 ,X238,C056,S056,512 
X207 ,X239 ,C060 ,S060 , 512 
X208 ,X240,512 

X209 ,K241 ,C068 ,S068 , 512 
X210 ,X242,C072,S072 ,512 
X211,X243,C076,S076,512 
X212,X244,C080,S080,512 
K213,X%245 ,C084,S084,512 
X214 ,X246 ,C088 ,S088,512 
X215 ,X247,C092,S092,512 
X216,X248,512 

X217 ,X249,C100,S100,512 
X218,X250,C104,5104,512 
X219,X251,C108,5108,512 
X220 ,X252,C112,5112,512 
X221,X253,C116,5116,512 
X222,X254,C120,5120,512 
X223 ,K255,C124,5124,512 


FFT CODE FOR STAGE 7 


ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 


X000 ,X064,512 

X001,X065,C002,S002,512 
X002 , X066,C004,5004,512 
X003 ,X067 ,C006 ,S006,512 
X004,X068,C008 ,S008 ,512 
X005 ,X069,C010,S010,512 
X006 ,X070,C012,S012,512 
X007 ,X071,C014,S5014,512 
X008 ,X072,C016,S016,512 
X009 ,X073,C018,S018,512 
X010 ,X074,C020,S020,512 
X011,X075,C022,S022,512 
X012,X076 ,C024,S024,512 
X013 ,X077 ,C026,S026,512 
X014,X078,C028,S028,512 
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6IT 


X015 ,X079,C030,S030,512 
X016 ,X080, 512 

X017 , X081 ,C034,5034,512 
X018 ,X082 ,C036 ,S036 ,512 
X019 ,X083 ,C038,S038,512 
X020 ,X084,C040,S040 , 512 
X021 , X085 ,C042 ,S042 ,512 
X022 ,X086 ,C044,S044,512 
X023 , X087 ,C046,,S046 ,512 
X024 ,X088 ,C048 ,S048 ,512 
X025 ,X089 ,C050,S050 ,512 
X026 ,X090 ,C0S2,S052,512 
X027 , X091 ,C054,S054,512 
X028 , X092,C056,S056,512 
X029 ,X093 ,C058,S058,512 
X030 , X094,C060,S060 , 512 
X031 ,X095 ,C062,S062 ,512 
X032 , X096 ,512 

X033 ,X097 , C066 ,S066 ,512 
X034,X098 ,C068,S068 ,512 
X035 ,X099,C070,S070,512 
X036 ,X100 ,C072,S072,512 
X037 ,X101 ,C074,S074,512 
X038 ,X102 ,C076,S076 ,512 
X039 ,X103 ,CO78,S078,512 
X040 ,X104,C080 ,S080 , 512 
X041 ,X105 ,C082 ,S082 ,512 
X042 ,X106 ,C084,S084,512 
X043 ,X107 ,C086 ,S086 ,512 
X044 ,X108 ,C088 ,S088 , 512 
X045 ,X109,C090 ,S090 ,512 
X046 ,X110,C092,S092,512 
X047 ,X111,C094,S094,512 
X048 ,X112,512 

X049 ,X113,C098,S098 ,512 
X050 ,X114,C100,S100,512 
X051 ,X115,C102,S102,512 
X052 ,X116,C104,S104,512 
X053 ,X117,C106,S106,512 
X054,X118,C108,S108 ,512 
X055 ,X119,C110,S110,512 
X056 ,X120,C112,S8112,512 
X057 ,X121,C114,S8114,512 
X058 ,X122,C116,S116,512 
X059 ,X123,C118,S$118,512 
X060 ,X124,C120,S120,512 
X061 ,X125,C122,S122,512 
X062 ,X126 ,C124,S124,512 
X063 ,X127 ,C126,S126,512 
X128 ,X192,512 

X129 ,X193 ,C002,S002,512 
X130 ,X194,C004,S004,512 
X131,X195,C006 ,S006 ,512 
X132 ,X196,C008,S008 ,512 
X133 ,X197,C010,S010,512 
X134,X198,C012,S012,512 
X135 ,X199,C014,S014,512 
X136 ,X200,C016,S016,512 
X137 ,X201 ,C018,S018 ,512 
X138 ,X202,C020,S020,512 


* 


BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
P3BY41I 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BIRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BIRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 


X139 ,X203 ,C022 ,S022 ,512 
X140 ,X204,C024,S024,512 
X141,X205 ,C026 ,S026 ,512 
X142 ,X206 ,C028,S028,512 
X143 ,X207 ,C030 ,S030 ,512 
X144 ,X208 ,512 

X145 , X209 ,C034,S034,512 
X146 ,X210,C036 ,S036 ,512 
X147 ,X211 ,C038 ,S038 ,512 
X148 ,X212,C040 ,S040 ,512 
X149 ,X213 ,C042 ,S042 ,512 
X150 ,X214,C044,S044 ,512 
X151 ,X215 ,C046 ,S046 ,512 
X152,X216,C048 ,S048,512 
K153 ,X217,C050 ,S050, 512 
X154,X218,C052,S052,512 
X155 ,X219,C054,S054,512 
X156 ,X220 ,C056 ,S056 ,512 
X157 ,X221 ,C058,S058,512 
X158 ,X222 ,C060 ,S060 , 512 
X159 ,X223 ,C062 ,S062,512 
X160 ,X224,512 

X161,X225, C066 ,S066 , 512 
X162 ,X226 ,C068 ,S068, 512 
X163 ,X227 ,CO70 ,S070,512 
X164 ,X228,C072,S072,512 
X165 ,X229 ,C074 ,S074,512 
X166 ,X230,C076 ,S076,512 
X167 ,X231 ,C0O78,S078,512 
X168 ,X232 ,C080 ,S080 ,512 
X169 ,X233 ,C082 ,S082,512 
X170 ,X234,C084 ,S084,512 
X171,X235,C086 ,S086 , 512 
X172 ,X236 ,C088 ,S088 ,512 
X173 ,X237 ,C090 ,S090 ,512 
X174,,X238 ,C092 ,S092,512 
X175 ,X239,C094,S094,512 
X176 ,X240,512 

X177 ,X241 ,C098 ,S098 ,512 
X178 ,X242 ,C100 ,S100,512 
X179 ,X243 ,C102,S102,512 
X180 ,X244,C104,S104,512 
X181,X245,C106,S106,512 
X182 ,X246 ,C108,S108, 512 
X183 ,X247 ,C110,S110,512 
X184,X248 ,C112,S8112,512 
X185 ,X249,C114,8114,512 
X186 ,X250,C116,S116,512 
X187 ,X251,C118,S118,512 
X188 ,X252,C120,$120,512 
X189 ,X253 ,C122,$122,512 
X190 ,X254,C124,5124,512 
X191 ,X255,C126,S126,512 


FFT CODE FOR STAGE 8 


ZEROI 

BTRFLI 
BIRFLI 
BTRFLI 


X000 ,X128,512 

X001 ,X129,C001 ,S001 ,512 
X002 ,X130 ,C002 ,S002 ,512 
X003 ,X131,C003 ,S003,512 


ra | 
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BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BIRFLI 
BIRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 


X004,,X132,C004,S004,512 
X005 ,X133 ,C005 ,S005 ,512 
X006 ,X134,C006 ,S006,512 
X007 ,X135 ,C007 ,S007 ,512 
X008 ,X136 ,C008,S008, 512 
K009 ,X137 ,C009,S009,512 
X010,X138,C010,S010,512 
X011,X139,C011,$011,512 
X012,X140,C012,5012,512 
X013 ,X141,C013,$013,512 
X014,X142,C0014,S014,512 
X015 ,X143,C015,S015,512 
X016 ,X144,C016,S016,512 
X017,X145 ,C017,S017,512 
X018,X146,C018,S018,512 
X019 ,X147,C019,S019,512 
K020 ,X148,C020,S020,512 
X021 ,X149,C021,S021,512 
X022,,X150,C022,5022,512 
X023,X151,C023 ,S023,512 
X024 , X152,C024,S024,512 
X025 ,X153 ,C025,S025 ,512 
X026 ,X154,C026 ,S026,512 
X027 ,X155 ,C027,S027,512 
X028 ,, X156,C028,S028,512 
X029,, X157,C029,S029,512 
X030 ,X158,C030,S030,512 
X031,X159,C031,5031,512 
X032,X160,512 

X033, X161 ,C033,S033,512 
X034, X162 ,C034,5034,512 
X035 ,X163,C035,S035,512 
X036 ,X164,C036 ,S036,512 
X037 , X165 ,C037,S037,512 
X038 , X166 ,C038,S038,512 
X039 , X167 ,C039,S039,512 
X040 , X168, C040 ,S040,512 
X041 ,X169,C041,S041 ,512 
X042 ,,X170, C042 ,S042,512 
X043 ,X171 ,C043 ,S043,512 
X044 ,X172 ,C044,8044,512 
X045 ,X173,C045,S045 ,512 
X046 ,X174, C046 ,S046 ,512 
X047 , X175 ,C047 ,S047 ,512 
X048 ,X176 ,C048,S048 ,512 
X049, X177 ,C049 ,S049,512 
X050 ,X178,C050,S050,512 
X051 ,X179,C051,S051,512 
X052,X180 ,C052,S052,512 
X053 ,X181 ,C053,S053 ,512 
X054,X182 ,C054,5054,512 
X055 ,X183,C055,S055,512 
X056 ,X184,C056 ,S056,512 
X057 ,X185 ,C057,S057 ,512 
X058 ,X186 ,C058,S058,512 
X059 ,X187 ,C059,S059,512 
X060 ,X188, C060 ,S060,512 
X061 ,X189 ,C061,S061,512 
X062 ,X190, C062 ,S062 ,512 
X063 ,X191 ,C063,5063 ,512 


PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I1 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 


X064,X192,512 

X065,X193,C065,S065,512 
X066 ,X194,C066,S066,512 
X067 ,X19S ,C067,S067,512 
X068 ,X196,C068,S068,512 
X069 ,X197 ,C069,S069,512 
X070 ,X198,C070,S070,512 
X071,X199,C071,S071 ,512 
X072 ,X200,C072,5072,512 
X073 ,X201 ,C073,S073,512 
X074 ,X202,C074,S074,512 
X075 ,X203,C075,S075,512 
X076 ,X204,C076,5076,512 
X077 ,X205 ,C077,S077 ,512 
X078 ,X206 ,C078,S078,512 
X079 ,X207 ,C079,S079 ,512 
X080 ,X208, C080 ,S080,512 
X081 ,X209,C081,S081 ,512 
X082 ,X210, C082 ,S082,512 
X083 ,X211,C083 ,S083,512 
X084 ,X212 ,C084,S5084,512 
X085 ,X213 ,C085,5085,512 
X086 ,X214, C086 ,S086,512 
X087 ,X215 , C087 ,S087 ,512 
X088 ,X216, C088 ,S088 ,512 
X089 ,X217 ,C089 ,S089,512 
X090 ,X218,C090,S090,512 
X091,X219,C091,S091,512 
X092 ,X220 ,C092,S092,512 
X093 ,X221 ,C093 ,S093,512 
K094 ,X222 ,C094,S094,512 
X095 ,X223 ,C095,S095,512 
X096 ,X224,512 

X097 ,X225 ,C097 ,S097 ,512 
X098 ,X226 ,C098,S098,512 
X099 ,X227 ,C099 ,S099,512 
X100 ,X228,C100,S100,512 
X101,X229,C101,S101,512 
X102 ,X230,C102,8102,512 
X103 ,X231,C103,S103,512 
X104 ,X232,C104,S8104,512 
X105 ,X233,C105,S105,512 
X106 ,X234,C106,5106,512 
X107 ,X235,C107,S107,512 
X108 ,X236 ,C108,S5108,512 
X109 ,X237 ,C109,S109,512 
X110,X238,C110,5110,512 
X111,X239,C111,8111,512 
K112,X%240 ,C112,5112,512 
K113 ,X241 ,C113,5113,512 
X114,X242 ,C114,5114,512 
X115 ,X243,C115,S115,512 
X116 ,X244,C116,5116,512 
X117,X245,C117,$117,512 
X118 ,X246,C118,S118,512 
X119 ,X247,C119,5119,512 
X120 ,X248,C120,5120,512 
X121,X249,C121,5121,512 
X122 ,X250,C122,5122,512 
X123 ,X251,C123,5123,512 


BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
END 


X124,X252,C124,5124,512 
X125 ,X253,C125,S$125,512 
X126 ,X254,C126,S$126,512 
X127 ,X255,C127,5127,512 
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APPENDIX OD 


IDT ‘FT1024' 
HRA IIIA IKK IIIA III IAA IER II AIAISI IA AIR IIA AAI AAA ERISA 


A 1024-POINT RADIX-2 DIT COMPLEX FFT FOR THE TMS32020 


MACROS LISTED IN APPENDIX B 


+e ee ee FH 


x 
* 
* 
THE FOLLOWING FILE RAD2FFT.MAC CONSISTS OF ALL THE * 
* 
* 


RERKKKEKKEREKEKEKREREKKEKERKKKEKREEKRKEEKKEKEEKEREKEKKKKRKKEKREKKKKKK KK 
x 


COPY RAD2FFT .MAC 
* 
IIKAKKIIIIK IIH IIAI IIIA AAAI IIIIIIS IIA IAI ISIS AAI IIS AI AAI AIAAI IA 
* x 
x DATA MEMORY MAP FOR PAGES 4, 5, 6 AND 7 (BLOCKS BO,Bl) x 
* * 


RKEKEKKKKEEKKEKEKKEEKKEKEREREKEKERKKEKERKKEKKEKEREKREKEKKEKEKEKEEKK 


DORG 0 

* 

* DATA MEMORY PAGE 4 (STARTING ADDRESS 512 OR >200) 
x 

x000 DATA 0,0 
x001 DATA 0,0 
X002 DATA 0,0 
X003 DATA 0,0 
X004 DATA 0,0 
X005 DATA 0,0 
X006 DATA 0,0 
Xx007 DATA 0,0 
x008 DATA 0,0 
x009 DATA 0,0 
x010 DATA 0,0 
x011 DATA 0,0 
X012 DATA 0,0 
x013 DATA 0,0 
X014 DATA 0,0 
K015 DATA 0,0 
x016 DATA 0,0 
X017 DATA 0,0 
x018 DATA 0,0 
X019 DATA 0,0 
X020 DATA 0,0 
x021 DATA 0,0 
X022 DATA 0,0 
X023 DATA 0,0 
X024 DATA 0,0 
X025 DATA 0,0 
X026 DATA 0,0 
K027 DATA 0,0 
x028 DATA 0,0 
X029 DATA 0,0 
X030 DATA 0,0 
X031 DATA 0,0 
X032 DATA 0,0 
X033 DATA 0,0 


NOILVINANATdWI Lad Lid 7-XIdVua ‘LNIOd-vZ01 V 


d XIGNHddV 


(cal 
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DATA 0,0 x091 DATA 0,0 
DATA 0,0 X092 DATA 0,0 
DATA 0,0 X093 DATA 0,0 
DATA 0,0 X094 DATA 0,0 
DATA 0,0 x095 DATA 0,0 
DATA 0,0 X096 DATA 0,0 
DATA 0,0 X097 DATA 0,0 
DATA 0,0 x098 DATA 0,0 
DATA 0,0 X099 DATA 0,0 
DATA 0,0 X100 DATA 0,0 
DATA 0,0 X101 DATA 0,0 
DATA 0,0 X102 DATA 0,0 
DATA 0,0 X103 DATA 0,0 
DATA 0,0 104 DATA 0,0 
DATA 0,0 X105 DATA 0,0 
DATA 0,0 X106 DATA 0,0 
DATA 0,0 X107 DATA 0,0 
DATA 0,0 X108 DATA 0,0 
DATA 0,0 X109 DATA 0,0 
DATA 0,0 X110 DATA 0,0 
DATA 0,0 X111 DATA 0,0 
DATA 0,0 X112 DATA 0,0 
DATA 0,0 X113 DATA 0,0 
DATA 0,0 X114 DATA 0,0 
DATA 0,0 X115 DATA 0,0 
DATA 0,0 X116 DATA 0,0 
DATA 0,0 X117 DATA 0,0 
DATA 0,0 X118 DATA 0,0 
DATA 0,0 X119 DATA 0,0 
DATA 0,0 X120 DATA 0,0 
X121 DATA 0,0 
DATA MEMORY PAGE 5 (STARTING ADDRESS 640 OR >280) X122 DATA 0,0 
X123 DATA 0,0 
DATA 0,0 X124 DATA 0,0 
DATA 0,0 X125 DATA 0,0 
DATA 0,0 X126 DATA 0,0 
DATA 0,0 X127 DATA 0,0 
DATA 0,0 x 
DATA 0,0 x DATA MEMORY PAGE 6 (STARTING ADDRESS 768 OR >300) 
DATA 0,0 x 
DATA 0,0 X128 DATA 0,0 
DATA 0,0 x129 DATA 0,0 
DATA 0,0 X130 DATA 0,0 
DATA 0,0 X131 DATA 0,0 
DATA 0,0 X132 DATA 0,0 
DATA 0,0 X133 DATA 0,0 
DATA 0,0 X134 DATA 0,0 
DATA 0,0 X135 DATA 0,0 
DATA 0,0 X136 DATA 0,0 
DATA 0,0 X137 DATA 0,0 
DATA 0,0 X138 DATA 0,0 
DATA 0,0 X139 DATA 0,0 
DATA 0,0 X140 DATA 0,0 
DATA 0,0 X141 DATA 0,0 
DATA 0,0 X142 DATA 0,0 
DATA 0,0 X143 DATA 0,0 
DATA 0,0 X144 DATA 0,0 
DATA 0,0 X145 DATA 0,0 
DATA 0,0 X146 DATA 0,0 
DATA 0,0 X147 DATA 0,0 
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X148 DATA 0,0 X205 DATA 0,0 
X149 DATA 0,0 X206 DATA 0,0 
X150 DATA 0,0 X207 DATA 0,0 
X151 DATA 0,0 X208 DATA 0,0 
X152 DATA 0,0 X209 DATA 0,0 
X153 DATA 0,0 X210 DATA 0,0 
X154 DATA 0,0 X211 DATA 0,0 
X155 DATA 0,0 X212 DATA 0,0 
X156 DATA 0,0 X213 DATA 0,0 
X157 DATA 0,0 X214 DATA 0,0 
X158 DATA 0,0 X215 DATA 0,0 
X153 DATA 0,0 X216 DATA 0,0 
X160 DATA 0,0 X217 DATA 0,0 
X161 DATA 0,0 X218 DATA 0,0 
X162 DATA 0,0 X219 DATA 0,0 
X163 DATA 0,0 X220 DATA 0,0 
X164 DATA 0,0 X221 DATA 0,0 
X165 DATA 0,0 X222 DATA 0,0 
X166 DATA 0,0 X223 DATA 0,0 
X167 DATA 0,0 X224 DATA 0,0 
X168 DATA 0,0 X225 DATA 0,0 
X169 DATA 0,0 X226 DATA 0,0 
X170 DATA 0,0 X227 DATA 0,0 
X171 DATA 0,0 X228 DATA 0,0 
X172 DATA 0,0 X229 DATA 0,0 
X173 DATA 0,0 X230 DATA 0,0 
X174 DATA 0,0 X231 DATA 0,0 
X175 DATA 0,0 X232 DATA 0,0 
X176 DATA 0,0 X233 DATA 0,0 
X177 DATA 0,0 X234 DATA 0,0 
X178 DATA 0,0 X235 DATA 0,0 
X179 DATA 0,0 X236 DATA 0,0 
X180 DATA 0,0 X237 DATA 0,0 
X181 DATA 0,0 X238 DATA 0,0 
X182 DATA 0,0 X239 DATA 0,0 
X183 DATA 0,0 X240 DATA 0,0 
X184 DATA 0,0 X241 DATA 0,0 
X185 DATA 0,0 X242 DATA 0,0 
X186 DATA 0,0 X243 DATA 0,0 
X187 DATA 0,0 X%244 DATA 0,0 
X188 DATA 0,0 X245 DATA 0,0 
X189 DATA 0,0 X246 DATA 0,0 
X190 DATA 0,0 X247 DATA 0,0 
X191 DATA 0,0 X248 DATA 0,0 
* X249 DATA 0,0 
x DATA MEMORY PAGE 7 (STARTING ADDRESS 896 OR >380) X250 DATA 0,0 
* X251 DATA 0,0 
X192 DATA 0,0 X252 DATA 0,0 
X193 DATA 0,0 X253 DATA 0,0 
X194 DATA 0,0 X254 DATA 0,0 
X195 DATA 0,0 X255 DATA 0,0 
X196 DATA 0,0 * 

X197 DATA 0,0 HIKHKKIKIIKER IIIA IRIE RIKER ERIKA RERERERERE REE EEREKER KEK 
X198 DATA 0,0 * k 
X199 DATA 0,0 * DATA LOCATION IN BLOCK B2 FOR W=COS(45) OR SIN(45) * 
X200 DATA 0,0 * * 
X201 DATA 0,0 HIKHKIKIRIKK RIA KERR AKIRA IKERR ERE KERR REREREREREREERIEREERIK 
X202 DATA 0,0 * 

X203 DATA 0,0 DORG 96 
X204 DATA 0,0 W DATA 0 


vel 
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* 


KKKKKKKKKKKKKEKKKKRAKRKKKEKEEKIEKERKKEKEKERKEKKAKKKKKKKKKKKKKKKKK 


* 
* 
* 


KAKKKKKEKKKKAKKKEKKEKRARKERAKEKREKKEKEREKERKEKKKKKKKKKKK KK KKK KKK KK 


* 
c000 
cool 
co002 
C003 
c004 
coos 
C006 
C007 
c008 
c009 
co10 
coll 
c012 
c013 
co14 
cols 
co16 
c017 
cols 
cols 
c020 
c021 
c022 
c023 
c024 
c025 
c026 
C027 
C028 
c029 
C030 
C031 
C032 
C033 
C034 
C035 
C036 
C037 
C038 
C039 
c040 
c041 
C042 
C043 
C044 
c045 
C046 
C047 
C048 
c049 
co50 
co51 
C052 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


13-BIT TWIDDLE FACTORS FOR 256-POINT COMPLEX FFT 


4095 
4094 
4091 
4085 
4076 
4065 
4052 
4036 
4017 
3996 
3973 
3948 
3920 
3889 
3857 
3822 
3784 
3745 
3703 
3659 
3612 
3564 
3513 
3461 
3406 
3349 
3290 
3229 
3166 
3102 
3035 
2967 
2896 
2824 
2751 
2675 
2598 
2520 
2440 
2359 
2276 
2191 
2106 
2019 
1931 
1842 
1751 
1660 
1567 
1474 
1380 
1285 
1189 


c053 
C054 
co55 
C056 
C057 
co58 
cos9 
c060 
c061 
C062 
C063 
C064 
c065 
C066 
C067 
C068 
C069 
C070 
C071 
C072 
C073 
C074 
C075 
C076 
C077 
c078 
c079 
coso 
co8l 
C082 
c083 
C084 
c085s 
C086 
C087 
C088 
C089 
co90 
cog1 
C092 
c093 
C094 
cogs 
C096 
C097 
c098 
co99 
C100 
C101 
C102 
C103 
C104 
C105 
C106 
C107 
C108 
C109 
C110 
Ccl1lil 
C1l2 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
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SZl 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


1092 
1189 
1285 
1380 
1474 
1567 
1660 
1751 
1842 
1931 
2019 
2106 
2191 
2276 
2359 
2440 
2520 
2598 
2675 
2751 
2824 
2896 
2967 
3035 
3102 
3166 
3229 
3290 
3349 
3406 
3461 
3513 
3564 


S044 
$045 
S046 
$047 
$048 
S049 
$050 
S051 
S052 
S053 
S054 
S055 
S056 
$057 
$058 
S059 
S060 
S061 
$062 
S063 
S064 
S065 
S066 
S067 
$068 
S069 
$070 
$071 
S072 
$073 
S074 
$075 
S076 
$077 
S078 
S079 
$080 
S081 
S082 
S083 
S084 
S085 
S086 
S087 
S088 
S089 
S090 
$091 
S092 
$093 
5094 
S095 
S096 
$097 
S098 
$099 
$100 
$101 
$102 
$103 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


3612 
3659 
3703 
3745 
3784 
3822 
3857 
3889 
3920 
3948 
3973 
3996 
4017 
4036 
4052 
4065 
4076 
4085 
4091 
4094 
4095 
4094 
4091 
4085 
4076 
4065 
4052 
4036 
4017 
3996 
3973 
3948 
3920 
3889 
3857 
3822 
3784 
3745 
3703 
3659 
3612 
3564 
3513 
3461 
3406 
3349 
3290 
3229 
3166 
3102 
3035 
2967 
2896 
2824 
2751 
2675 
2598 
2520 
2440 
2359 
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* 


KRIEKKKEKKKKRKEKRKEKEKKEKKERKEKAKEKEREKEKEKRERERKKRKAKK KKK KAKA RK 


x 
x 
* 


KKKKKKKKKKKKKEKEKEKKEKREAKEKKEREKKKAKRAKKKKKKKKK KKK RAKE 


* 
KNL256 


EQU 


AORG 


DATA 


2276 
2191 
2106 
2019 
1931 
1842 
1751 
1660 
1567 
1474 
1380 
1285 
1189 
1092 


>20 


>SA82 


SYSTEM INITIALIZATION 


SPM 

CNFD 
ROVM 
SSXM 
LARP 


256-POINT FFT KERNEL - STAGES 1 AND 2 


LDPK 

COMBOI 
COMBOI 
COMBOT 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 


0 


ARO 
ARO,W 
WVAL 
*,AR1 
FT1024 


4 

X000 , X001, X002,X003 ,0 
X004 ,X005, X006 , X007 ,0 
X008 , X009,X010,X011,0 
X012,X013,X014,X015,0 
X016,X017,X018,X019,0 
X020 ,X021 , X022,X023,0 
X024,X025, X026 ,X027 ,0 
X028 ,X029, X030 ,X031,0 


COMBOI 
COMBOI 
COMBOI 
COMBOT 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
LDPK 

COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
LDPK 

COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOT 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
LDPK 

COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOTI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 
COMBOI 


X032,X033 ,X034,X035,0 
X036 , X037 , X038, X039,0 
X040 , X041 ,X042, X043 ,0 
X044, X045 , X046 , X047 ,0 
X048 , X049, X050,X051,0 
X052,X053,X054,xX055,0 
X056 , X057, X058,X059,0 
X060, X061,X062,X063 ,0 

5 

X064,, X065 , X066, X067 ,128 
X068 , X069 , X070, K071,128 
X072 ,X073 ,X074,K075,128 
X076 ,X077 ,X078,X079 ,128 
X080 , X081 ,X082, X083,128 
X084, X085 , X086 , X087 ,128 
X088 , X089 ,, X090, X091,128 
X092 ,X093 ,X094,X095,128 
X096 , X097 , X098 , X099,128 
X100,X101,X102,X103,128 
X104,X105,X106,X107,128 
X108,X109,X110,X111,128 
X112,X113,X114,X115,128 
X116,X117,X118,K119,128 
X120,X121,X122,X123,128 
X124,X%125 ,X126,X127,128 
6 

X128,X129 ,X130,X131, 256 
X132 ,X133 ,X134,X135,256 
X136 ,X137 ,X138,X139, 256 
X140,X141 ,X142,X143, 256 
X144,X145 ,X146,X147,256 
X148 ,X149,X150,X151, 256 
X152,%153,X154,XK155, 256 
X156 ,X157,X158,X159, 256 
X160,X161 ,X162,X163, 256 
X164,X165 ,X166,X167, 256 
X168,X169 ,X170,X171, 256 
X172,X173 ,X174,X175, 256 
X176 ,X177 ,X178,X179, 256 
X180 ,X181 ,X182,X183 ,256 
X184,X185 ,X186,X187, 256 
X188 ,X189,X190,X191, 256 
7 
X192,X193,X194,X195 , 384 
X196,X197 ,X198,X199, 384 
X200 ,X201 ,X202,X203,384 
X204 ,X205 ,X206,X207 , 384 
X208 , X209 ,X210,X211,384 
X212,X213,X214,K215 ,384 
X216 ,X217,X218,X219 , 384 
X220 ,X221 ,X222 ,X223 , 384 
X224 ,X225 ,X226 ,X227 , 384 
X228 ,X229 ,X230,X231 ,384 
X232 ,X233 ,X234,X235 , 384 
X236 , X237 ,X238,X239 , 384 
X240 ,X241 ,X242,X243 , 384 
X244 ,X245 ,X246 , X247 , 384 
X248 ,X249 ,X250,X251 , 384 
X252 ,X253 ,X254,X255 , 384 
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KKKKAKKKEKKKKKEKEKKRKEKREKKRKKEKEREKERKERKKKEKEKEKKKKAKKKK KKK 


bad 
x 
* 


256-POINT FFT KERNEL - STAGE 3 


x 
x 
x 


KAKKKKKKKKKKEEKKKRERAKKKEKEKKEKEKKKKKEKKERKKKEKKKEREKKEKREKREEKEKKKKKKK 


* 


ZEROI 
PBY4I 
PBY2I 
P3BY41 
ZEROI 
PBY4I1 
PBY2I 
P3BY4I1 
ZEROI 
PBY4I 
PBY2I 
P3BY41 
ZEROI 
PBY41 
PBY2TI 
P3BY41 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY41I 
ZEROI 
PBY4I 
PBY2I 
P3BY41 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 


X000 , X004,512 
X001,X005,512 
X002,X006,512 
X003 ,X007,512 
X008,X012,512 
X009 ,X013,512 
X010,X014,512 
X011,X015,512 
X016,X020,512 
X017,X021,512 
K018,X022,512 
X019,X023,512 
X024 ,X028,512 
X025 ,X029,512 
X026 ,X030,512 
X027,X031,512 
X032 ,X036,512 
X033 ,X037,512 
X034,X038,512 
X035 ,X039,512 
X040 , X044,512 
X041 ,X045,512 
X042 ,X046,512 
X043 ,X047 ,512 
X048 ,X052,512 
X049,X053,512 
X050 ,X054,512 
X051,X055,512 
X056 ,X060,512 
X057 ,X061,512 
X058 ,X062,512 
X059,X063,512 
X064, X068,512 
X065 ,X069,512 
X066 ,X070,512 
X067 ,X071,512 
X072 ,X076,512 
X073 ,X077 ,512 
X074 ,X078,512 
X075 ,X079,512 
X080 ,X084,512 
X081,X085 ,512 
X082 , X086 ,512 
X083 ,X087 ,512 
X088 ,X092,512 
X089 ,X093 ,512 
X090 ,X094,512 
X091,X095,512 
X096 ,X100,512 
X097 ,X101,512 
X098 ,X102,512 
X099,X103,512 
X104,X108,512 
X105,X109,512 


PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY41 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY41I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I1 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 


X106 ,X110,512 
X107,X111,512 
X112,X116,512 
X113,X117,512 
X114,X118,512 
X115,X119,512 
X120 ,X124,512 
X121,X125 ,512 
X122,X126 ,512 
X123,X127,512 
X128,X132,512 
X¥129,X133,512 
X130,X134,512 
X131,X135,512 
X136 ,X140,512 
X137,X141,512 
X138,X142,512 
X139,X143,512 
X144,X148, 512 
X145 ,X149,512 
X146 ,X150,512 
X147,X151,512 
%152,X156,512 
X153,X157,512 
X154,X158,512 
X155,X159,512 
X160 ,X164,512 
X161,X165,512 
X162,X166,512 
X163 ,X167,512 
X168,X172,512 
X169 ,X173,512 
X170,X174,512 
X171,X175,512 
X176,X180 512 
X177,X181 ,512 
X178,X182,512 
X179, X183,512 
X184,X188,512 
X185 ,X189,512 
X186,X190,512 
X187,X191,512 
X192,X196,512 
X193,X197,512 
X194,X198,512 
X195,X199,512 
X200 ,X204,512 
X201,X205,512 
X202,X206,512 
X203 ,X207, 512 
X208 ,X212,512 
X209,X213,512 
X210,X214,512 
X211,X215,512 
X216,X220,512 
X217,X221,512 
X218,X222,512 
X219,X223,512 
X224,X228,512 
X225,X229,512 
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PBY21 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 
ZEROI 
PBY4I 
PBY2I 
P3BY4I 


%226 ,X230 ,512 
X227 ,X231,512 
X%232,X236,512 
X%233 ,X237,512 
X234,X238,512 
X%235 ,X239,512 
X240 ,X244,512 
X241 ,X245,512 
X242 ,X246 ,512 
X243 ,X247 ,512 
X248,X252,512 
X249 ,X253,512 
X250,X254,512 
X251,X255 ,512 


RAEKKKKKKEKRKEKKKKKERERKRERKAEKKEREEKERKEKEREKEKKEREKRRKAKKKKKKK ARK KKK 


* 
* 
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256-POINT FFT KERNEL - STAGE 4 


* 
* 
*x 


KHKKKKKKEKEKKKRKKKKKKKEKKKKEKKKAKKEKREKKKERKKEKKKKKRERKEKRKKKKKKAKKKK 


* 


ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROT 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 


X000 ,X008 ,512 
X001,X009,C016,S016,512 
X002,X010,512 
X003,,X011 ,C048,5S048 ,512 
X004,X012,512 
X005 ,X013 ,C080,S080 ,512 
X006 ,X014,512 
X007 ,X015,C112,$112,512 
X016 ,X024,512 
X017,X025,C016,S016,512 
X018,X026,512 
X019,X027 ,C048,S048 ,512 
X020 ,X028,512 
X021,X029,C080,S080,512 
X022,X030,512 
X023,X031,C112,5112,512 
X032 ,X040 ,512 
X033,X041,C016,S016,512 
X034,X042,512 
X035 ,X043 ,C048 ,S048, 512 
X036 ,X044,512 
X037,X045 ,C080 ,S080, 512 
X038,X046 ,512 
X039 ,X047,C112,S112,512 
X048 ,X056 ,512 
X049,,X057,C016,S016,512 
X050,X058,512 
X051,X059,C048,S048,512 
X052,X060,512 
X053 ,X061,C080,S080, 512 
X054,X062,512 
X055,X063,C112,5112,512 
%064 ,X072 ,512 
X065 ,X073,C016,S016,512 
X066 ,X074,512 
X067 ,X075 ,C048 ,S048 ,512 
X068 ,X076,512 
X069 ,X077 ,C080,S080, 512 
%070,X078,512 


BTRFLI 
ZEROI 
BTRFLI 
PBY41 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BIRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 


X071 ,X079,C112,S112,512 
X080 , X088, 512 
X081 ,X089,C016,S016,512 
X082 ,X090,512 
X083 ,X091 ,C048,S048, 512 
X084 ,X092,512 
X085 , X093,C080,S080, 512 
X086 ,X094,512 
X087 ,X095 ,C112,$112,512 
X096 ,X104,512 
X097 ,X105,C016,S016,512 
X098 ,K106,512 
X099 ,X107 ,C048,S048 ,512 
X100,X108,512 
X101,X109,C080,S080, 512 
X102,X110,512° 
X103 ,X111,€112,8112,512 
X112,X120,512 
X113,X121,C016,S016,512 
X114,%122,512 
X115,X123,C048,S048 ,512 
X116,X124,512 
X117,X125,C080, S080, 512 
X118,%126,512 
X119,X127,C112,$112,512 
X128,X136,512 
X129,X137,C016,S016,512 
X¥130,X138,512 
X131,X139,C048 ,S048,512 
X132,X140,512 
X133,,X141,C080 , S080 ,512 
X134,X142,512 
X135,X143,C112,8112,512 
X144,X152,512 
X145,X153,C016 ,S016,512 
X146,X154,512 
X147,X155,C048 ,S048 ,512 
X148,X156,512 
X149,X157,C080,S080 ,512 
X150,X158,512 
X1$1,X%159,C112,8112,512 
X160,X168,512 
X161,X169,C016,S016,512 
X162,X176,512 
X163 ,X171,C048,S048 ,512 
X164,X172,512 
X165,X173,C080,S080,512 
X166,X174,512 
X167 ,X175,C112,S112,512 
X176,X184,512 
X177,%185 ,C016,S016,512 
X178,X186,512 
X179,X187,C048 ,S048 ,512 
X180 ,X188,512 
X181 ,X189, C080 ,S080, 512 
X182,X190,512 
X183 ,X191,C112,5112,512 
X192,X200,512 
X193 ,X201 ,C016,S016,512 
X194,X202,512 
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BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY41 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 
ZEROI 
BTRFLI 
PBY4I 
BTRFLI 
PBY2I 
BTRFLI 
P3BY4I 
BTRFLI 


X195 ,X203 , C048 ,S048,512 
K196 ,X204, 512 
X197,X205, C080 ,S080,512 
X198 ,X206, 512 
X199 ,X207,C112,$112,512 
X208 ,X216,512 
X209 ,X217,C016,S016,512 
X210,X218,512 
X211,X%219, C048 ,S048,512 
X212,X%220,512 
X213,X221,C080,S080,512 
X214,X%222,512 
X215 ,X223,C112,8112,512 
X224,X232,512 
X225 ,X233 ,C016,S016,512 
X226 ,X234,512 
X227 ,X235 ,C048,S048,512 
X228 ,X236, 512 
X229 ,X237 , C080 ,S080,512 
X230 ,X238,512 
X231,X239,C112,5112,512 
X240 ,X248 ,512 
X241,X249,C016,5016,512 
X242,X250,512 
X243 ,X251 , C048 ,S048,512 
X244,X252,512 
X245 ,X253 ,C080,S080,512 
X246 ,X254,512 
X247 ,X255,C112,5112,512 


KKKKKKKEKKEEKKEKKEKKEKREKEKEEREKEKEREREEEKRRKKRAREREKKEKKKKKKKK 


x 
k 
k 


256-POINT FFT KERNEL - STAGE 5 


* 
x 
x 


RAKKKKKKKEKKKEKKEKEKEKKKREKEKKEKRKERKEKKAKERKEREERREREREEEKKKKKKKKKKK 


k 


ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I1 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BIRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 


X000 ,X016, 512 

X001, X017,C008,S008,512 
X002,X018,C016,S016,512 
X003 ,X019,C024,5024,512 
X004 ,X020,512 

X005 ,X021, C040 ,S040,512 
X006 ,X022 ,C048,S048,512 
X007 ,X023 ,C056,S056,512 
X008 ,X024, 512 

X009 , X025 ,C072,S072,512 
X010, X026, C080 ,S080 ,512 
X011 ,X027, C088 ,S088 ,512 
X012,X028,512 

X013,X029,C104,S104,512 
X014,X030,C112,$112,512 
X015, X031,C120,8120,512 
X032,X048,512 

X033 ,X049 ,C008,S008,512 
X034,X050,C016,S016,512 
X035,X051,C024,S024,512 
X036,X052,512 

X037 ,X053 ,C040,S040,512 
X038 ,X054,C048,S048 ,512 
X039,X055,C056,S056,512 


PBY2I 

BTRFLI 
BTRFLI 
BIRFLI 
P3BY4I1 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY21I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BIRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BIRFLI 


X040 , X056 ,512 

X041 ,X057,C072,S072,512 
X042 ,X058 ,C080,S080 , 512 
X043 ,X059,C088,S088 ,512 
X044,X060 ,512 

X045 ,X061,C104,S104,512 
X046 ,X062,C112,5112,512 
X047 ,K063 ,C120,S120,512 
X064,X080 ,512 

X065 ,X081 ,C008 ,S008,512 
X066 ,X082 ,C016,S016,512 
X067 ,X083 ,C024,S024,512 
X068 ,X084,512 

X069 ,X085 ,C040 ,S040,512 
X070 , X086 , C048 ,S048 ,512 
X071 ,X087 ,C056,S056,512 
X072,X088,512 

X073 ,X089 ,C072,S072,512 
X074,X090 ,C080,S080,512 
X075 ,X091,C088,S088,512 
X076 ,X092,512 

X077 , X093 ,C104,S104,512 
X078 ,X094,C112,S$112,512 
X079 ,X095,C120,8120,512 
X096 ,X112,512 

X097 ,X113 ,C008,S008,512 
X098,X114,C0016,S016,512 
X099,X115,C024,S024,512 
X100,X116,512 

X101,X117 ,C040,S040 ,512 
X102,%118,C048 ,S0#8 ,512 
X103 ,KX119,C056,S056 ,512 
X104,X120,512 

X105,X121 ,C072,S072,512 
X106 ,X122 ,C080,S080 ,512 
X107 ,X123 ,C088,S088 ,512 
X108 ,X124,512 

X109,X125,C104,S104,512 
X110,X126 ,C112,$112,512 
X111,X127,C120,$120,512 
X128,X144,512 

X129,X145 ,C008,S008,512 
X130,X146 ,C016,S016,512 
X131,X147 ,C024,S024,512 
X132,X148,512 

X133 ,X149 ,C040,S040,512 
X134,X150,C048,S048,512 
X135,X151,C056,S056,512 
X136,X152,512 

X137 ,X153 ,C072,S072 ,512 
X138 ,X154,C080 ,S080 ,512 
X139,X155 ,C088,S088,512 
X140 ,K156,512 

X141 ,X157,C104,S104,512 
X142,X158,C112,$112,512 
X143 ,X159,C120,8120,512 
X160 ,X176,512 

X161,%177 ,C008,S008 ,512 
X162 ,K178,C016,S016,512 
X163 ,X179 ,C024,S024,512 


oc! 
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PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTIRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 


X164,X180,512 

X165,X181 ,C040 ,S040, 512 
X166 ,X182,C048 ,S048,512 
X167,X183 ,CO056 ,S056 ,512 
X168,X184,512 

X169,X185 ,C072,S072,512 
X170,X186 ,C080 ,S080 , 512 
X171, X187 ,C088 ,S088 , 512 
X172,X188,512 

X173,X189,C104,S104 , 512 
X174,X190,C112,5112,512 
X175,X191,C120,S120,512 
X192,X208,512 

X193 ,X209 ,C008 ,S008 ,512 
X194,X%210,C016,S016 , 512 
X195,X211,C024,S024,512 
X196 ,X212,512 

X197 ,X213,C040,S040,512 
X198 ,X214,C€048,S048,512 
X199,X215,C056,S056 ,512 
X200 ,X216,512 

X201 ,X217 ,C072,S072 ,512 
X202,X218, C080 ,S080 , 512 
X203 ,X219, C088 ,S088 ,512 
X204 ,X220,512 

X205 ,X221,€104,S104,512 
X206 ,X222,C112,5112,512 
X207 ,X223,C120,8120,512 
X224 ,X240,512 

X225 ,X241 ,C008 ,S008 ,512 
X226 ,X242,C016,S016,512 
X227 ,X243 ,C024,,S024 512 
X228 ,X244,512 

X229 ,X245 ,C040 ,S040 ,512 
X230 ,X246,C048 ,S048 ,512 
X231,X247 ,C056,S056 ,512 
X232,X248,512 

X233 ,X249 ,C072 ,S072 ,512 
X%234,%250,C080 , S080 , 512 
X235 ,X251 ,C088,S088,512 
X236 ,X252,512 

X237 ,X253,C104,8104,512 
X%238 ,X254,C112,8112,512 
X239,X255,C120,S120,512 
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256-POINT FFT KERNEL - STAGE 6 


* 
* 
* 


REKKKKEKKEREREEKKRKKREKEREKEREERERERERERERARKEKEAKREREEKKKKAKRKEKK 


* 


ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 


X000 ,X032,512 

X001 ,X033 ,C004,5004 ,512 
X002 ,X034,C008 ,S008 , 512 
X003 ,X035 ,C012,S012,512 
X004 ,X036 ,C016,S016,512 
X005 , X037,, C020 ,S020 ,512 
%006 ,X038 ,C024,S024,512 
X007 ,X039 ,C028 ,S028,512 
x008 ,X040 ,512 


BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 


X009 , X041 ,CO36 ,S036 , 512 
X010, X042, C040 ,S040,512 
X011,X043,C044,S044, 512 
X012,X044, C048 , $048, 512 
X013,X045,C052,$052,512 
X014, X046,C056,S056,512 
X015,X047,C060,S060,512 
X016 ,X048,512 

X017,X049 , C068 $068,512 
X018,X050,C072,S072, 512 
X019,X051,C076, $076,512 
X020 ,X052,C080, S080, 512 
X021 ,X053, C084, S084, 512 
X022,X054, C088, S088, 512 
X023,X055,C092,$092,512 
X024,X056,512 

X025 ,X057,C100 $100,512 
X026 ,X058,C104,5104,512 
X027,X059,C108,S108, 512 
X028,X060,C112,$112,512 
X029,X061,C116,5116,512 
X030,X062,C120,5120,512 
X031,X063,C124,$124,512 
X064 , X096 , 512 

X065 ,X097 ,C004,S004, 512 
X066 , X098, C008 , S008, 512 
X067 ,X099,C012,S012, 512 
X068 ,X100,C016, $016,512 
X069 ,X101,C020,$020, 512 
X070 ,X102,C024,S024,512 
X071 ,X103,C028,S028, 512 
X072,X104,512 

X073 ,X105 ,C036 ,$036 ,512 
X074,X106 , C040, S040, 512 
X075 ,X107, C044 ,S044, 512 
X076 ,X108, C048, $048, 512 
X077 ,X109,C052,$052, 512 
X078,X110, C056, $056,512 
X079,X111,C060,S060,512 
x080, X112,512 

X081 ,X113 ,C068 ,S068,512 
X082 ,X114,C072,S072,512 
X083 ,X115,C076,S076,512 
X084,X116 , C080, $080, 512 
X085 ,X117, C084, $084, 512 
X086 ,X118, C088, S088, 512 
X087 ,X119, C092, $092,512 
X088 ,X120, 512 

X089 ,X121 ,C100 $100,512 
X090 ,X122,C104,$104, 512 
X091 ,X123,C108,$108, 512 
X092,X124,C112,$112,512 
X093,X125,C116 $116,512 
X094,X126 ,C120,$120,512 
X095 ,X127,C124,S124,512 
X128, X160,512 

X129 ,X161,C004 ,S004, 512 
X130,X162, C008 ,S008,512 
X131,X163,C012,$012,512 
X132,X164,C016,S016,512 
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Tel 


BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
P3BY41 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 


X133,X165,C020,S020,512 
X134,X166,C024,S5024,512 
X135,X167,C028,SC28,512 
X136,X168,512 

X137,X169,C036,S036,512 
X138,X170 ,C040,S040,512 
X139,X171,C044,S044 ,512 
X140 ,X172,€948,S048,512 
X141,X173,C052,S052,512 
X142,X174,C056,S056,512 
X143 ,X175,C060,S060,512 
X144,X176,512 

X145 ,X177 ,C068,S068,512 
X146 ,X178,C072,S072,512 
X147,X179 ,C076,S076,512 
X148, X180,C080,S080,512 
X149,X181 ,C084,S084,512 
X150,X182,C088,S088,512 
X151,X183 ,C092,S092,512 
X152,X184,512 

X153,X185,C100,S100,512 
X154,X186,C104,S104,512 
X155, X187,C108,S108,512 
X156,X188,C112,S5112,512 
X157 ,X189,C116,S116,512 
X158,X190,C120,S120,512 
X159,X191,C124,5124,512 
X192 ,X224,512 

X193,X225,C004,S004,512 
X194,X226 ,C008,S008,512 
X195 ,X227,C012,S012,512 
X196,X228,C016,S016,512 
X197 ,X229,C020,S020,512 
X198, X230,C024,S024,512 
X199 ,X231,C028,S028,512 
X200 , X232,512 

X201 ,X233 ,C036 ,S036,512 
X202 , X234,C040 ,S040,512 
X%203 , X235 ,C044,5044,512 
X204 , X236 ,C048 ,S048 ,512 
X205, X237,C052,S8052,512 
X206 , X238, C056 ,S056,512 
X207 ,X239 ,C060,S060,512 
X208 , X240,512 

X209 ,X241,C068,S5068,512 
X210, X242 ,C072,S072,512 
X211,X243 ,C076,S076,512 
X212,X244,C080,S080,512 
X213,X245 ,C084,S084,512 
X214,X246 ,C088,S088,512 
X215 ,X247 ,C092,S092,512 
X216 ,X248,512 

X217 ,X249 ,C100,S100,512 
X218,X250,C104,S8104,512 
X219,X251,C108,S108,512 
X220 ,X252,C€112,5112,512 
X221 ,,X253 ,C116,S8116,512 
X222,,X254,C120,8120,512 
X223 ,X255,C124,S124,512 
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256-POINT FFT KERNEL - STAGE 7 


* 
* 
* 


RKKKKKKKKKKKKKEKEKEKEREKKEEKERREKREKERAKEKKREKKEKKRKKEKEKEEKKKKKK 


x 


ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTIRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTIRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 


X000 , X064, 512 

X001,X065 ,C002,S002,512 
X002, X066, C004, S004, 512 
X003,X067, C006, S006, 512 
X004, X068, C008 ,S008,512 
X005 ,X069,C010,$010,512 
X006 ,X070 ,C012,$012,512 
X007 , X071,C014,S014,512 
X008 ,X072,C016,S016,512 
X009 ,X073,C018,S018,512 
X010 ,X074,C020,S020,512 
X011,X075,C022,S022,512 
X012,X076,C024,$024,512 
X013,X077,C026,S026,512 
X014,X078,C028,S028,512 
X015,X079,C030,5030,512 
X016,X080,512 

X017,X081 ,C034,5034,512 
X018,X082,C036,S036,512 
X019, X083,C038,S038,512 
X020, X084,C040,S040, 512 
X021 ,X085 ,C042,5042,512 
X022, X086 ,C044,5044,512 
X023,X087 ,C046,S046,512 
X024,X088,C048,S048,512 
X025 ,X089,C050,S050,512 
X026 ; X090 ,C052,5052,512 
X027, X091,C054,$054,512 
X028,X092,C056,5056,512 
X029, X093,C058,S058,512 
X030, X094,C060,S060,512 
X031,X095,C062,S062,512 
X032,X096,512 

X033,X097 , C066,S066,512 
X034, X098, C068, S068, 512 
X035,X099,C070,S070,512 
X036 ,X100,C072,5072,512 
X037,X101,C074,5074,512 
X038,X102,C076,S076,512 
X039,X103,C078,S078,512 
X040, X104, C080 ,S080 ,512 
X041 ,X105,C082,S082,512 
X042,X106 ,C084,S084,512 
X043 ,X107, C086 ,S086 ,512 
X044,X108, C088 ,S088 , 512 
X045 ,X109,C090,S090,512 
X046 ,X110,C092,S092,512 
X047 ,X111,C094,S094,512 
X048 ,X112,512 

X049 ,X113,C098 ,S098,512 
X050 ,X114,C100,$100,512 
X051,X115,C102,$102,512 
X052,X116,C104,$104,512 
X053,X117,C106,5106,512 


cel 
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BIRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
ZEROI 

BTRFLI 
BTRFLI 
BTIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY21I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BTRFLI 


X054,X118,C108,S108,512 
X055 ,X119,C110,5110,512 
X056,X120,C112,S8112,£2 


X057,X121,C114,8114,512- 


X058 ,X122,C€116,S116,512 
X059,X123,C118,5118,512 
X060 ,X124,C120,8120,512 
X061,X125,C122,8122,512 
X062,X126,C124,S8124,512 
X063 ,X127,C126,S126,512 
X128,X192,512 

X129,X193 ,C002,S002,512 
X130,X194,C004,S004, 512 
X131,X195,C006,S006,512 
X132,X196 ,C008,S008 ,512 
X133,X197,C010,S010,512 
X134,X%198,C012,S012,512 
X135,X199,C014,S014,512 
X136,X200,C016,S016,512 
X137,X201,C018,S018,512 
X138 ,X202 ,C020,S020, 512 
X139 ,X203 ,C022,S022,512 
X140 , X204,C024,S024,512 
X141,X205 ,C026 ,S026,512 
X142 ,X206,C028,S028,512 
X143 ,X207 ,C030,S030,512 
X144,X208,512 

X145 ,X209 ,C034,S034,512 
X146 ,X210, C036 ,S036,512 
X147,X211,C038,S038,512 
X148 ,X212,C040,S040,512 
X149 ,X213,C042 ,S042 ,512 
X150 ,X214,C044,S044,512 
X151,X215,C046 ,S046,512 
X152,X216,C048 ,S048,512 
X153,X%217,C050,S050,512 
X154,X218,C052,S052,512 
X155,%219,C054,S054,512 
X156,X220,C056,S056,512 
X157,X221 ,C0$8,S058,512 
X158,X222,C060,S060,512 
X159,X223 ,C062,S062,512 
X160 , X224,512 

X161,X225,C066 ,S066,512 
X162,X226,C068,S068,512 
X163 ,X227,C070,S070,512 
X164,X228,C072,S072,512 
X165 ,X229 ,C074,8074,512 
X166 ,X230 ,CO076 ,S076 ,512 
X167 ,X231,C078,5078,512 
X168,%232,C080,S080,512 
X169 ,X233 ,C082,S082,512 
X170 ,X234,C084,S084,512 
X171,X235 ,C086 ,S086 ,512 
X172,%236 ,C088 ,S088,512 
X173 ,X237 ,C090 ,S090,512 
X174 ,X%238,C092,S092 ,512 
X175 ,X239 ,C094,S094,512 
X176 ,X240,512 

X177 ,X241 ,C098,S098 ,512 


* 


BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BIRFLI 


X178 , X242,C100,S100,512 
X179 ,X243 ,C102,S102,512 
X180 ,X244 ,C104,5104,512 
X181,X245 ,C106,S106 ,512 
X182,X246 ,C108,S108,512 
X183 ,X247,C110,5110,512 
X184,X248 ,C112,5112,512 
X185 ,X249,C114,5114,512 
X186 ,X250,C116,S116,512 
X187,X251,C118,S118,512 
X188 ,X252,C120,S120,512 
X189 ,X253,C122,8122,512 
X190 ,X254,C124,8124,512 
X191,X255,C126,S5126,512 
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256-POINT FFT KERNEL - STAGE 8 


* 
* 
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KAAKKAKKAAKKKKERKKKKKAREKKEKAKKKKREKKKEKKKKAERERERAREKERERAEKKKEKAKR 


x 


ZEROI 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY4I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 


X000 ,X128,512 

X001 ,X129,C001,S001 ,512 
X002 ,X130,C002,5002,512 
X003 ,X131,C003,S003, 512 
X004 ,X132 ,C004,5004,512 
X005 ,X133,C005,S005,512 
X006 ,X134, C006 ,S006 , 512 
X007 ,X135 , C007 ,S007, 512 
X008 ,X136 , C008, S008, 512 
X009 ,X137,C009,S009, 512 
X010,X138,C010,S010,512 
X011,X139,C011,S5011 ,512 
X012,X140,C012,5012,512 
X013 ,X141,C013,S013,512 
X014,X142,C014,S8014,512 
X015,X143,C015,S015,512 
X016 ,X144, C016 ,S016,512 
X017 ,X145 ,C017,S017,512 
X018,X146,C018,S018,512 
X019,X147,C019,S019,512 
X020 ,X148,C020,S020,512 
X021,X149 ,C021,S021,512 
X022,X150,C022,S8022,512 
X023 ,X151 ,C023,S023,512 
X024,X152 ,C024,S024,512 
X025,X153,C025,S8025,512 
X026 ,X154, C026 ,S026 ,512 
X027 ,X155 ,C027 ,S027 ,512 
X028 ,X156 ,C028,S028,512 
X029 ,X157 ,C029,S029 ,512 
X030,X158,C030,S030,512 
X031,X159,C031,S031,512 
X032,X160,512 

X033 ,X161,C033 ,S033 ,512 
X034,X162,C034,S5034,512 
X035 ,X163,C035,5035,512 
X036 ,X164, C036 ,S036 ,512 
X037,X165 , C037 ,S037 ,512 
X038,X166 ,C038,5038,512 
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cel 


BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
PBY2I 

BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
P3BY4I 
BIRFLI 
BTRFLI 


X039,X167,C039,S039,512 
X040 ,X168,C040 ,S040,512 
X041,X169,C041 ,S041,512 
X042 ,X170,C042 ,S042,512 
X043 ,X171,C043 ,S043 ,512 
X044,X172,C044,S044,512 
X045 ,X173 ,C045,S045,512 
X046 ,X174,C046 ,S046 ,512 
X047 ,X175 ,C047 ,S047 ,512 
X048 ,X176 ,C048,S048 ,512 
K049 ,X177 ,C049,S049, 512 
X050,X178,C050,S050,512 
K051,X179,C051,S051,512 
X052,X180 ,C052,S052,512 
X053,X181,C053,S053,512 
X054,X182,C054,S054,512 
X055,,X183 ,C055,S055,512 
X056 ,X184,C056 ,S056,512 
X057,X185 ,C057,S057,512 
X058,X186 ,C058,S058,512 
X059,X187,C059,S059,512 
X060 ,X188,C060,S060,512 
X061,X189,C061,S061,512 
X062,X190 ,C062,S062,512 
X063 ,X191,C063 ,S063,512 
X064,X192,512 

X065 ,X193,C065,S065,512 
X066 ,X194, C066 ,S066, 512 
X067 ,X195,C067 , S067, 512 
X068,X196 ,C068,S068, 512 
X069 ,,X197 ,C069,S069 ,512 
X070,X198,C070,S070,512 
X071,X199,C071,S071 ,512 
X072,X200,C072,S072,512 
X073 ,X2Q1,C073 ,S073 ,512 
X074 ,X202,C074,S074,512 
X075 ,X203 ,C075,S075,512 
X076 ,X204,C076,S076,512 
X077 ,X205 ,C077,S077, 512 
X078 ,X206 ,CO78,S078,512 
X079 ,X207 ,C079,S079, 512 
X080 ,X208,C080 ,S080,512 
X081 ,X209, C081 ,S081 ,512 
X082 ,X210,C082,S082,512 
X083 ,X211,C083,S083,512 
X084,X212 ,C084,S084,512 
X085 ,X213 ,C085 ,S085,512 
X086 ,X214,C086 , S086 , 512 
X087 ,X215 ,C087 ,S087,512 
X088 ,X216,C088 ,S088 ,512 
X089 ,X217 ,C089 ,S089 ,512 
X090 ,X218,C090,S090,512 
X091 ,X219,C091,S091,512 
X092,,X220,C092,S092,512 
X093 ,X221 ,C093 ,S093 ,512 
X094 ,X222 ,C094,5094,512 
X095 ,X223,C095,S095,512 
X096 ,X224,512 

X097 ,X225 ,C097 ,S097 ,512 
X098 ,X226 ,C098,S098 ,512 


BTRFLI 
BTIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BIRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
BTRFLI 
RET 
* 


X099 ,X227 ,C099 ,S099,512 
X100 ,X228,C100,S100,512 
X101,X229,C101,S8101,512 
K102 ,X230,C102,$102,512 
X103 ,X231,C103,S103,512 
X104 ,X232,C104,8104,512 
X105 ,X233,C105,S105,512 
X106 ,X234,C106 ,S106,512 
X107 ,X235,C107,S107,512 
X108 ,X236,C108,S108,512 
X109 ,X237,C109,S109,512 
X110,X238,C110,S8110,512 
X111,X%239,C111,8111,512 
X112,X240,C112,$112,512 
X113 ,X241 ,C113,S113,512 
X114,X%242,C114,8114,512 
X115,X243,C115,$115,512 
K116 ,X244,C116,5116,512 
X117,X245 ,C117,S8117,512 
X118,X246,C118,S118,512 
X119,X247,C119,S119,512 
X120 ,X248,C120,S8120,512 
X121,X249,C121,S121,512 
X122,X250,C122,S122,512 
X123 ,X251,C123,$123,512 
X124,X%252,C124,8124,512 
X125,X253,C125,8125,512 
X126 ,X254,C126,S126,512 
X127 ,X255 ,C127,S127,512 


KREKKKKKKKKAKKKEKKKAKKKEEKEKKEKRKEKEKREKEERKEKKKKKEKEKREKEKKEKRKARKEKK 


+ eH eH eH HH OH 


1024-POINT FFT CODE WITH BIT-REVERSED INPUT SAMPLES 


ALL INPUT REAL AND IMAGINARY DATA POINTS ARE ASSUMED 
TO BE IN CONSECUTIVE LOCATIONS (A TOTAL OF 2048 ) IN 
EXTERNAL DATA MEMORY STARTING FROM LOCATION 1024 IN 
PAGE 8. OUT OF THE 1024 COMPLEX POINTS, THERE ARE 
ALTOGETHER 496 PAIRS OF INPUT DATA WHICH NEED TO BE 
SCRAMBLED AS SHOWN BELOW. 


+ + Fe + HF Fe HF F 


KAKKKEKEKKEKKEKKKEKEKKEKKKEKKKKKEKREKRKARKKKEKKEKEKKKKEKKKKKAKKKKAAKKKK 


* 

FT1024 —- BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


2,1024,1024 
4,512,1024 
6,1536,1024 
8,256,1024 
10,1280,1024 
12,768,1024 
14,1792,1024 
16,128,1024 
18,1152,1024 
20 ,640,1024 
22,1664,1024 
24,384,1024 
26,1408,1024 
28,896 ,1024 
30,1920,1024 
32,64,1024 


vel 
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BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


34,1088,1024 
36,576,1024 
38,1600,1024 
40,320,1024 
42,1344,1024 
44,832 ,1024 
46 ,1856,1024 
48,192,1024 
50,1216,1024 
52,704,1024 
54,1728 ,1024 
56,448 ,1024 
58,1472,1024 
60,960 ,1024 
62,1984,1024 
66,1056 ,1024 
68 ,544,1024 
70,1568 ,1024 
72, 288,1024 
74,1312,1024 
76,800 ,1024 
78,1824,1024 
80, 160,1024 
82,1184,1024 
84 ,672,1024 
86, 1696,1024 
88, 416,1024 
90,1440,1024 
92,928,1024 
94,1952,1024 
98,1120,1024 
100 ,608,1024 
102 ,1632 ,1024 
104 ,352,1024 
106 ,1376,1024 
108 ,864, i024 
110 , 1888 ,1024 
112,224,1024 
114,1248,1024 
116 ,736,1024 
118 ,1760,1024 
120 ,480 , 1024 
122,1504,1024 
124 ,992,1024 
126, 2016 ,1024 
130, 1040 ,1024 
132 ,528,1024 
134 ,1552 ,1024 
136 ,272,1024 
138 ,1296 ,1024 
140 , 784, 1024 
142, 1808, 1024 
146 ,1168,1024 
148 ,656, 1024 
150, 1680 ,1024 
152,400, 1024 
154,1424,1024 
156,912,1024 
158,1936,1024 
162,1104,1024 


BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


164,592 ,1024 
166,1616,1024 
168,336 ,1024 
170,1360,1024 
172,848, 1024 
174,1872,1024 
176,208,1024 
178,1232,1024 
180 ,720,1024 
182 ,1744,1024 
184 ,464,1024 
186 ,1488,1024 
188 ,976,1024 
190, 2000 ,1024 
194,1072,1024 
196 ,560 ,1024 
198,1584,1024 
200 ,304,1024 
202 ,1328,1024 
204,816,1024 
206 ,1840 ,1024 
210,1200,1024 
212,688,1024 
214,1712,1024 
216,432 ,1024 
218,1456,1024 
220 ,944,1024 
222,1968,1024 
226,1136,1024 
228 ,624,1024 
230, 1648 ,1024 
232, 368,1024 
234,1392,1024 
236 , 880 , 1024 
238 ,1904,1024 
242 ,1264,1024 
244 ,752,1024 
246 ,1776,1024 
248 , 496 ,1024 
250,1520,1024 
252,1008 , 1024 
254, 2032,1024 
258 ,1032,1024 
260 ,520,1024 
262 ,1544,1024 
266 ,1288 ,1024 
268,776 ,1024 
270 ,1800 ,1024 
274,1160,1024 
276 ,648 ,1024 
278 ,1672,1024 
280 ,392,1024 
282 ,1416,1024 
284 ,904,1024 
286 ,1928,1024 
290,1096,1024 
292 ,584,1024 
294,1608,1024 
296 ,328,1024 
298,1352,1024 
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Sel 


BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


300 ,840,1024 
302,1864,1024 
306 ,1224,1024 
308 ,712,1024 
310,1736,1024 
312,456,1024 
314,1480,1024 
316 ,968,1024 
318,1992,1024 
322 ,1064,1024 
324,552,1024 
326 ,1576,1024 
330,1320,1024 
332,808, 1024 
334, 1832,1024 
338,1192,1024 
340 ,680, 1024 
342,1704,1024 
344,424,1024 


346, 1448,1024 


348 , 936 ,1024 
350,1960,1024 
354,1128,1024 
356 ,616,1024 
358, 1640,1024 
362 ,1384,1024 
364,872 ,1024 
366, 1896, 1024 
370,1256,1024 
372,744,1024 
374,1768,1024 
376 , 488, 1024 
378 ,1512,1024 
380 ,1000,1024 
382 ,2024,1024 
386 , 1048 ,1024 
388 ,536,1024 
390, 1560 ,1024 
394,1304,1024 
396 ,792,1024 
398 ,1816,1024 
402,1176,1024 
404 ,664,1024 
406 ,1688,1024 
410, 1432,1024 
412 ,920,1024 
414,1944,1024 
418 ,1112,1024 
420 ,600,1024 
422, 1624,1024 
426 ,1368,1024 
428 ,856 ,1024 
430, 1880,1024 
434,1240,1024 
436 ,728,1024 
438 ,1752,1024 
440 ,472,1024 
442 ,1496 ,1024 
444,984 ,1024 
446 , 2008 ,1024 


BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


450 ,1080,1024 
452 ,568,1024 

454,1592,1024 
458 ,1336,1024 
460 ,824,1024 

462 ;1848,1024 
466 ,1208,1024 
468 ,696 ,1024 

470,1720,1024 
474 ,1464,1024 
476 ,952,1024 

478,1976,1024 
482,1144,1024 
484 ,632,1024 

486 ,1656,1024 
490 ,1400,1024 
492 ,888,1024 

494 ,1912,1024 
498 ,1272,1024 
500 ,760,1024 

502,1784,1024 
506 ,1528,1024 
508 ,1016,1024 
510, 2040 ,1024 
514,1028,1024 
518,1540,1024 
522 ,1284,1024 
524,772,1024 

526 ,1796,1024 
530,1156,1024 
532 ,644,1024 

534,1668,1024 
538,1412,1024 
540 ,900,1024 

542 ,1924,1024 
546 ,1092,1024 
548 ,580,1024 

550,1604,1024 
554,1348,1024 
556 ,836,1024 

558,1860,1024 
562,1220,1024 
564,708,1024 

566 ,1732,1024 
570,1476,1024 
572 ,964,1024 

574,1988,1024 
578,1060,1024 
582 ,1572,1024 
586 ,1316,1024 
588 ,804,1024 

590 ,1828,1024 
594,1188,1024 
596 ,676,1024 

598,1700,1024 
602,1444,1024 
604 ,932,1024 

606 ,1956 ,1024 
610,1124,1024 
614,1636,1024 


9€I 
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BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


618,1380, 1024 
620, 868 , 1024 

622,1892,1024 
626 ,1252,1024 
628 ,740,1024 

630 ,1764,1024 
634,1508,1024 
636 ,996 ,1024 

638, 2020,1024 
642 ,1044,1024 
646 ,1556,1024 
650,1300,1024 
652,788,1024 

654,1812,1024 
658,1172,1024 
662,1684, 1024 
666 ,1428, 1024 
668 ,916,1024 

670,1940,1024 
674,1108,1024 
678,1620,1024 
682 ,1364,1024 
684,852 ,1024 

686 ,1876,1024 
690 ,1236,1024 
692,724,1024 

694,1748 ,1024 
698 ,1492,1024 
700 , 980, 1024 

702,2004,1024 
706,1076,1024 
710,1588,1024 
714,1332,1024 
716,820 ,1024 

718,1844,1024 
722 ,1204,1024 
726,1716,1024 
730,1460,1024 
732 ,948,1024 

734 ,1972,1024 
738,1140,1024 
742 ,1652,1024 
746 ,1396,1024 
748 , 884 ,1024 

750,1908,1024 
754,1268,1024 
758,1780,1024 
762,1524,1024 
764,1012,1024 
766, 2036 ,1024 
770, 1036 ,1024 
774,1548,1024 
778 ,1292,1024 
782 ,1804,1024 
786 ,1164,1024 
790,1676,1024 
794 ,1420 ,1024 
796 ,908,1024 

798 ,1932 ,1024 
802,1100,1024 


BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


806 ,1612,1024 
810 ,1356,1024 
812 , 844’, 1024 
814,1868,1024 
818,1228,1024 
822 ,1740,1024 
826 ,1484,1024 
828 ,972 ,1024 
830 ,1996,1024 
834,1068,1024 
838, 1580,1024 
842 ,1324,1024 
846 ,1836,1024 
850,1196,1024 
854,1708,1024 
858,1452,1024 
860 , 940 ,1024 
862 ,1964,1024 
866 ,1132,1024 
870, 1644,1024 
874,1388,1024 
878,1900,1024 
882 ,1260,1024 
886 ,1772,1024 
890 ,1516,1024 
892,1004,1024 
894, 2028,1024 
898,1052,1024 
902,1564,1024 
906 ,1308,1024 
910,1820,1024 
914,1180,1024 
918,1692,1024 
922,1436,1024 
926 ,1948,1024 
930,1116,1024 
934,1628,1024 
938 ,1372 ,1024 
942,1884,1024 
946 ,1244 ,1024 
950 ,1756,1024 
954,1500,1024 
956 ,988,1024 
958, 2012,1024 
962 ,1084,1024 
966 ,1596,1024 
970 ,1340,1024 
974 ,1852,1024 
978,1212,1024 
982 ,1724,1024 
986 ,1468,1024 
990,1980,1024 
994,1148,1024 
998,1660,1024 
1002 ,1404,1024 
1006 ,1916,1024 
1010 ,1276,1024 
1014,1788,1024 
1018 ,1532,1024 
1022, 2044, 1024 
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BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


1030,1538,1024 
1034,1282,1024 
1038,1794,1024 
1042,1154,1024 
1046 , 1666, 1024 
1050,1410,1024 
1054,1922,1024 
1058,1090,1024 
1062,1602,1024 
1066, 1346, 1024 
1070 ,1858,1024 
1074,1218,1024 
1078,1730,1024 
1082,1474,1024 
1086 , 1986, 1024 
1094,1570,1024 
1098 ,1314,1024 
1102,1826,1024 
1106,1186,1024 
1110,1698,1024 
1114,1442,1024 
1118,1954,1024 
1126 ,1634,1024 
1130,1378,1024 
1134,1890,1024 
1138,1250,1024 
1142,1762,1024 
1146, 1506, 1024 
1150, 2018,1024 
1158,1554,1024 
1162,1298,1024 
1166 ,1810,1024 
1174,1682,1024 
1178, 1426 ,1024 
1182,1938,1024 
1190,1618,1024 
1194, 1362,1024 
1198,1874,1024 
1202,1234,1024 
1206 ,1746,1024 
1210,1490,1024 
1214, 2002,1024 
1222,1586,1024 
1226 ,1330,1024 
1230, 1842,1024 
1238,1714,1024 
1242,1458,1024 
1246 ,1970,1024 
1254,1650,1024 
1258, 1394, 1024 
1262,1906,1024 
1270 ,1778,1024 
1274,1522,1024 
1278 , 2034, 1024 
1286 ,1546,1024 
1294, 1802,1024 
1302,1674,1024 
1306 ,1418,1024 
1310, 1930,1024 
1318,1610,1024 


BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 
BITRVI 


1322,1354,1024 
1326 ,1866,1024 
1334,1738,1024 
1338,1482,1024 
1342,1994,1024 
1350,1578,1024 
1358,1834,1024 
1366 ,1706,1024 
1370,1450,1024 
1374,1962,1024 
1382 ,1642,1024 
1390 ,1898,1024 
1398,1770,1024 
1402,1514,1024 
1406 ,2026,1024 
1414,1562,1024 
1422,1818,1024 
1430 ,1690,1024 
1438 ,1946,1024 
1446 ,1626,1024 
1454,1882,1024 
1462,1754,1024 
1466 ,1498,1024 
1470, 2010,1024 
1478,1594,1024 
1486 ,1850,1024 
1494 ,1722,1024 
1502,1978,1024 
1510,1658,1024 
1518,1914,1024 
1526,1786 ,1024 
1534,2042,1024 
1550,1798,1024 
1558,1670,1024 
1566,1926,1024 
1574,1606,1024 
1582,1862,1024 
1590 ,1734,1024 
1598,1990,1024 
1614,1830,1024 
1622,1702,1024 
1630,1958,1024 
1646 ,1894,1024 
1654,1766,1024 
1662 ,2022,1024 
1678 ,1814,1024 
1694 ,1942,1024 
1710,1878,1024 
1718,1750,1024 
1726 ,2006,1024 
1742 ,1846,1024 
1758,1974,1024 
1774,1910,1024 
1790 , 2038 ,1024 
1822 ,1934,1024 
1838,1870,1024 
1854,1998,1024 
1886 ,1966,1024 
1918,2030,1024 
1982,2014,1024 


8c 
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KEKKKKEKEKKEREREREKRRKKKKKERKAEEEKKKKKKAKKKK KKK KKK 


+ + + Fe He FH OF 


THE FIRST 8 STAGES OF THE 10-STAGE 1024-POINT COMPLEX 
FFT WILL BE PERFORMED AS 4 SEPARATE 256-POINT COMPLEX 
FFT'S USING THE ON-CHIP DATA BLOCKS BO AND Bl. THE 
KERNEL CODE BELOW WILL THEREFORE BE CALLED 4 TIMES TO 
ACCOMPLISH THIS WHILE EXTERNAL DATA MEMORY WILL HAVE 
MOVE ON-CHIP USING THE BLKD INSTRUCTION. 


+ ee ee HF HF 


RAKKKKKAEKREKREKRERKAKKKEKEKKKEKRKEREAEEERREREEKKKKAKAKRKKKRKKKKKKKKKKK 


x 


LARP AR1 
LRLK AR1,512 
RPTK 255 
BLKD 1024, x+ 
RPTK 255 
BLKD 1280 , x+ 
CALL KNL256 
LRLK AR1,1024 
RPTK 255 
BLKD 512,*+ 
RPTK 255 
BLKD 768 ,x+ 
* 
LRLK AR1,512 
RPTK 255 
BLKD 1536, «+ 
RPTK 255 
BLKD 1792, x+ 
CALL KNL256 
LRLK AR1,1536 
RPTK 255 
BLKD 512, x+ 
RPTK 255 
BLKD 768, x+ 
* 
LRLK AR1,512 
RPTK 255 
BLKD 2048 , x+ 
RPTK 255 
BLKD 2304 , x+ 
CALL KNL256 
LRLK AR1, 2048 
RPTK 255 
BLKD 512, x*+ 
RPTK 255 
BLKD 768 ,x+ 
* 
LRLK AR1,512 
RPTK 255 
BLKD 2560 , a+ 
RPTK 255 
BLKD 2816, x+ 
CALL KNL256 
LRLK AR1 , 2560 
RPTK 255 
BLKD 512, «+ 
RPTK 255 


BLKD 768 , x+ 


BLOCK MOVE FIRST GROUP OF 256 
COMPLEX POINTS (1024-1535) FROM 
EXTERNAL RAM TO ON-CHIP RAM 


EXECUTE 256-POINT KERNEL FFT 
AND RETURN RESULTS TO 
EXTERNAL RAM (1024-1535) 


BLOCK MOVE SECOND GROUP OF 256 
COMPLEX POINTS (1536-2047) FROM 
EXTERNAL RAM TO ON-CHIP RAM 


EXECUTE 256-POINT KERNEL FFT 
AND RETURN RESULTS TO 
EXTERNAL RAM (1536-2047) 


BLOCK MOVE THIRD GROUP OF 256 
COMPLEX POINTS (2048-2559) FROM 
EXTERNAL RAM TO ON-CHIP RAM 


EXECUTE 256-POINT KERNEL FFT 
AND RETURN RESULTS TO 
EXTERNAL RAM (2048-2559) 


BLOCK MOVE FOURTH GROUP OF 256 
COMPLEX POINTS (2560-3071) FROM 
EXTERNAL RAM TO ON-CHIP RAM 


EXECUTE 256-POINT KERNEL FFT 
AND RETURN RESULTS TO 
EXTERNAL RAM (2560-3071) 


* 


AAKKKAKKKEKKKKKAKERERKEKEREREKEREKERKEKKEREEEKRKKKKKKKKKKKKKKK 


RAM. 


* ee ee HF FO OH 


EXTERNAL DATA MEMORY. 


PERFORM STAGE 9 OF THE 1024-POINT FFT -- TWIDDLE 
FACTOR VALUES ARE TABLE-READ FROM EXTERNAL PROGRAM 
MEMORY TO ON-CHIP RAM AND THE GENERAL 'BUTTERFLY' 
SUBROUTINE IS EXECUTED 512 TIMES WITH ALL DATA IN 

A SPEED IMPROVEMENT CAN BE 
ACHIEVED BY MOVING FFT DATA ON-CHIP INSTEAD OF TWIDDLE 
FACTOR VALUES FOR EXECUTION. 
FACTOR VALUES WILL HAVE TO BE STORED IN EXTERNAL DATA 


HOWEVER, THE TWIDDLE 


+ Fe He + HF FH FH HF OF 


KAKKKKKKKKEKKAKKKKKEKKKEKKEKERKEKKKAKREKREKREREKEEKAKKKKKK KKK KKK 


* 
STAGE9 


LOOP 


LOOP1 


LRLK 
LRLK 
LARP 
LALK 
TBLR 
ADLK 
TBLR 
ADLK 
BANZ 


LRLK 
LRLK 
LRLK 
LRLK 
LARP 
CALL 
LARP 
BANZ 


LRLK 
LRLK 
LRLK 
LARP 
CALL 
LARP 
BANZ 


LOOP , x- , ARO 


ARO ,512 
AR1,512 
AR2,1024+512 
AR3, 255 

AR2 

BTRFLY 

AR3 
LOOP1 , x- , AR2 


AR1,512 

AR2 , 2048+512 
AR3,255 

AR2 

BTRFLY 

AR3 
LOOP2 , *~ , AR2 


INITIALISE TWIDDLE FACTORS 
SET UP TWIDDLE TABLE SIZE 
USE ARO TO POINT AT TABLE 
SET UP TWIDDLE TABLE ADDRESS 
AND STORE IN INTERNAL RAM 


AND STORE IN INTERNAL RAM 


INITIALISE STEP SIZE 

AR1 POINTS AT TWIDDLE FACTORS 
AR2 POINTS AT REAL DATA 
INITIALISE LOOP COUNT 


PERFORM LOOPED FFT FOR STAGE 9 


AR1 POINTS AT TWIDDLE FACTORS 
AR2 POINTS AT REAL DATA 
INITIALISE LOOP COUNT 


PERFORM LOOPED FFT FOR STAGE 9 


KRAEKKKKKKKAEKKEKKEREREKEKRKKEKKAEKERKEKEKERERERERKEEKEREKKKKKKA KKK AK 


RAM. 


+ Fe Fe eH FH HH He HF 


EXTERNAL DATA MEMORY. 


PERFORM STAGE 10 OF THE 1024-POINT FFT -- TWIDDLE 
FACTOR VALUES ARE TABLE-READ FROM EXTERNAL PROGRAM 
MEMORY TO ON-CHIP RAM AND THE GENERAL 'BUTTERFLY' 
SUBROUTINE IS EXECUTED 512 TIMES WITH ALL DATA IN 

A SPEER IMPROVEMENT CAN BE 
ACHIEVED BY MOVING FFT DATA ON-CHIP INSTEAD OF TWIDDLE 
FACTOR VALUES FOR EXECUTION. 
FACTOR VALUES WILL HAVE TO BE STORED IN EXTERNAL DATA 


HOWEVER, THE TWIDDLE 


+e ee eH FH HF He 


RKEKKKKKEKKEKKEKEKEKKKEEKEKRRERERKEREKEEKKEEKEREREKREKREREREREKRKKKKER 


* 
STGE10 


LARP 
LRLK 
RPTK 
BLKP 


WOOO , x+ 


INITIALISE TWIDDLE FACTORS 
AND STORE IN INTERNAL RAM. 
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6£1 


RPTK 255 * 


BLKP W128, x+ PAC ACC= (1/2) (QI*COSX) 
_ MPY x*+,AR2 P-REGISTER = (1/2).(QR*SINX) 
LARP ARO SPAC ACC= (1/2) (QI*COSX-QR*SINX) 
LRLK ARO ,1024 INITIALISE STEP SIZE SACH x*0- QI = (1/2) (QI*xCOSX-QR«ASINX) 
LRLK AR1,512 AR1 POINTS AT TWIDDLE FACTORS * 
LRLK AR2, 2048 AR2 POINTS AT REAL DATA * CALCULATE Re[Pm+1] & Re(Qm+1] STORE RESULTS in PR & OR 
LRLK AR3, 255 INITIALISE LOOP COUNT * 
LARP AR2 MAR x- 
LOOP3 CALL BTRFLY PERFORM LOOPED FFT FOR STAGE 10 LAC x*0+,14 ACC= (1/4)PR 
LARP AR3 ADD «0-,15 ACC= (1/4) [PR+(QRACOSX+QI*SINX) ] 
BANZ LOOP3 , x- , AR2 SACH x«0+,1 PR = (1/2) [PR+(QRxCOSK+QI*SINX) ] 
‘ SUBH * ACC= (1/4) [PR-(QR&COSK+QI*SINX) } 
LARP ARO SACH «*0-,1 QR = (1/2) [PR-(QR*COSX+QI*SINX) ] 
LRLK ARO ,512 INITIALISE TWIDDLE FACTORS x 
RPTK 255 AND STORE IN INTERNAL RAM. * CALCULATE Im[Pm+1] & Im[{Qm+1] STORE RESULTS in PI & QI 
BLKP W256 , x+ * 
RPTK 255 MAR x+ 
BLKP W384 ,x+ LAC x*0+,14 ACC= (1/4)PI 
- ADD x0-,15 ACC= (1/4) [PI+(QI*COSX-QR*SINX) ] 
LARP ARO SACH *0+,1 PI = (1/2) [PI+(QI*COSKX-QR*SINX) ] 
LRLK ARO , 1024 INITIALISE STEP SIZE SUBH * ACC= (1/4) [PI-(QI*xCOSX-QRASINX) ] 
LRLK AR1,512 AR1 POINTS AT TWIDDLE FACTORS SACH «+,1 QI = (1/2) [PI-(QI*COSX-QR«*SINX) ] 
LRLK AR2,2048+512 AR2 POINTS AT REAL DATA RET 
LRLK AR3 ,255 INITIALISE LOOP COUNT * 
LARP AR2 ; HAKKRRKAKKAKRERKRAERERIAAREAR ERIKA AIIAI RIAA IRIAKA AAA A I AAA 
LOOP4 CALL BTRFLY PERFORM LOOPED FFT FOR STAGE 10 * x 
LARP AR3 * 16-BIT TWIDDLE FACTORS FOR 1024-POINT COMPLEX FFT * 
BANZ LOOP4, x- ,AR2 * * 
B DONE HIKARI AIARAIAK KARRI IAAI IAAI AAI AAI AAI RIAA AI AAI AIA 
* x 
PTEeTTtT Litt COLL tT Ltt ttt tttitotatetetstotatatetottatatotstatetotatetototatatetatatetatetetatel wooo DATA 32767 ,0 
* * wool DATA 32766,201 
k RADIX-2 GENERAL BUTTERFLY SUBROUTINE * woo2 DATA 32765 ,402 
* x woo3 DATA 32762 ,603 
* * W004 DATA 32758 , 804 
* P=(PR+jQI) P+QaW=(PR+Re (QaW] )+j5 (PI+Im(QxW ]} ) x woos DATA 32752,1005 
* \/ * wo06 DATA 32746 ,1206 
x /\ x wo07 DATA 32738,1407 
* Q=(QRt+jQI) P-QaW=(PR-Re [(QaW] )+j (PI+Im[Qz2W ]} ) * woos DATA 32728 ,1608 
* * woo9 DATA 32718,1809 
* -j(2(pi)/N)k * wO010 DATA 32706 , 2009 
* W =e =COS(X)-jSIN(X) * woll DATA 32693 ,2210 
* N * W012 DATA 32679 , 2410 
* =WRtjWI * wo13 DATA 32664,2611 
* * wO14 DATA 32647 ,2811 
HRI RII IIIS IAIA IIIA IIIA IIIA IIIS IIIA IIIS IISA AAI IAAIAIA AAA AIK wo1s DATA 32629 , 3012 
* wol6 DATA 32610, 3212 
* CALCULATE QR*COS(X) + QI*SIN(X) AND STORE RESULT IN QR WO17 DATA 32590 , 3412 
k W018 DATA 32568 , 3612 
BTRFLY LT x«+,AR1 LOAD T-REGISTER WITH QR W019 DATA 32545 ,3811 
MPY x«+,AR2 P-REGISTER = (1/2) (QRxCOSX) W020 DATA 32521 ,4011 
LTP x-,AR1 ACC= (1/2)(QR*COSX); LOAD TR WITH QI w021 DATA 32496 ,4210 
MPY x«- P-REGISTER = (1/2) (QI*SINX) W022 DATA 32470 , 4410 
APAC ACC= (1/2) (QR*COSX+QI*SINX) W023 DATA 32442 ,4609 
MPY x«+,AR2 P-REGISTER = (1/2) (QI*COSX) W024 DATA 32413 , 4808 
LT * LOAD T-REGISTER WITH QR W025 DATA 32383 ,5007 
SACH «+,0,AR1 QR = (1/2) (QR*COSX+QI*SINX) W026 DATA 32352,5205 
* W027 DATA 32319 , 5404 
x CALCULATE QI*COS(X) - QR*SIN(X) AND STORE RESULT IN QI W028 DATA 32285 , 5602 


Ov! 
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w029 
W030 
W031 
W032 
W033 
W034 
W035 
W036 
W037 
W038 
W039 
w040 
W041 
W042 
W043 
W044 
W045 
W046 
W047 
W048 
W049 
W050 
wosl 
W052 
W053 
W054 
wos5 
WO56 
WO57 
wO58 
W059 
wO060 
wo61 
W062 
W063 
W064 
wO65 
W066 
W067 
w068 
W069 
w070 
wWO71 
W072 
W073 
W074 
wO75 
W076 
W077 
wO78 
wOo79 
wo80 
w08l 
WO82 
W083 
W084 
woss 
W086 
w087 
wO88 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


32250, 5800 
32214, 5998 
32177 ,6195 
32138 , 6393 
32098 ,6590 
32057 ,6786 
32015 ,6983 
31972 ,7179 
31927 ,7375 
31881 ,7571 
31834 ,7767 
31786 , 7962 
31736, 8157 
31686 ,8351 
31634, 8546 
31581, 8739 
31527 , 8933 
31471 ,9126 
31415 ,9319 
31357 ,9512 
31298 ,9704 
31238 , 9896 
31176,10088 
31114,10279 
31050 ,10469 
30985 ,10660 
30919, 10850 
30852 ,11039 
30784 ,11228 
30715,11417 
30644,11605 
30572 ,11793 
30499 ,11980 
30425 ,12167 
30350 ,12354 
30274,12540 
30196 ,12725 
30117,12910 
30038, 13095 
29957 ,13279 
29875 ,13462 
29791 ,13645 
29707 ,13828 
29622,14010 
29535 ,14191 
29448 , 14372 
29359,14553 
29269, 14733 
29178,14912 
29086, 15091 
28993 ,15269 
28899 , 15447 
28803 , 15624 
28707 , 15800 
28609, 15976 
28511,16151 
28411 ,16326 
28310 ,16500 
28209 , 16673 
28106 , 16846 


wos9 
wO090 
wool 
W092 
wO093 
W094 
wog5 
w096 
W097 
w098 
w099 
W100 
w101 
W102 
W103 
W104 
W105 
W106 
W107 
W108 
w109 
W110 
W111 
W112 
W113 
W114 
W115 
W116 
W117 
W118 
w119 
W120 
W121 
W122 
W123 
W124 
W125 
W126 
W127 
W128 
W129 
W130 
W131 
W132 
W133 
W134 
W135 
W136 
W137 
W138 
W139 
W140 
W141 
W142 
W143 
W144 
W145 
W146 
W147 
W148 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


28002,17018 
27897, 17190 
27791 ,17360 
27684 ,17531 
27576 ,17700 
27467 ,17869 
27357 , 18037 
27245 ,18205 
27133, 18372 
27020, 18538 
26906 , 18703 
26790 , 18868 
26674, 19032 
26557,19195 
26439 , 19358 
26319, 19520 
26199,19681 
26078 ,19841 
25956, 20001 
25832, 20160 
25708, 20318 
25583, 20475 
25457 , 20632 
25330 , 20788 
25202 , 20943 
25073 ,21097 
24943 , 21250 
24812,21403 
24680 , 21555 
24548 , 21706 
24414, 21856 
24279 , 22005 
24144, 22154 
24007 , 22302 
23870 , 22449 
23732 ,22595 
23593, 22740 
23453 , 22884 
23312 ,23028 
23170, 23170 
23028 , 23312 
22884 , 23453 
22740 , 23593 
22595 , 23732 
22449 , 23870 
22302 ,24007 
22154, 24144 
22005 , 24279 
21856 , 24414 
21706 , 24548 
21555, 24680 
21403 , 24812 
21250 , 24943 
21097 , 25073 
20943 , 25202 
20788 , 25330 
20632, 25457 
20475 ,25583 
20318, 25708 
20160 , 25832 
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lvl 


W149 
W150 
W151 
W152 
W153 
W154 
W155 
W156 
W157 
W158 
W159 
W160 
wW16l 
W162 
W163 
W164 
W165 
W166 
W167 
W168 
W169 
W170 
W171 
W172 
W173 
W174 
W175 
W176 
W177 
W178 
W179 
W180 
W181 
W182 
W183 
W184 
W185 
W186 
W187 
W188 
W189 
W190 
W191 
W192 
W193 
W194 
W195 
W196 
W197 
W198 
W199 
W200 
W201 
W202 
W203 
W204 
W205 
W206 
W207 
W208 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


20001, 25956 
19841 , 26078 
19681 , 26199 
19520 , 26319 
19358 , 26439 
19195, 26557 
19032 , 26674 
18868 , 26790 
18703 , 26906 
18538 , 27020 
18372 ,27133 
18205 ,27245 
18037 , 27357 
17869 , 27467 
17700 , 27576 
17531 ,27684 
17360,27791 
17190 ,27897 
17018, 28002 
16846 , 28106 
16673 ,28209 
16500, 28311 
16326, 28411 
16151,28511 
15976 , 28609 
15800 , 28707 
15624 , 28803 
15447 , 28899 
15269, 28993 
15091, 29086 
14912 ,29178 
14733 ,29269 
14553, 29359 
14372 ,29448 
14191 ,29535 
14010, 29622 
13828 , 29707 
13645,29791 
13462 ,29875 
13279,29957 
13095 , 30038 
12910 ,30117 
12725 ,30196 
12540 ,30274 
12354 ,30350 
12167,30425 
11980 , 30499 
11793 ,30572 
11605 ,30644 
11417 ,30715 
11228 ,30784 
11039, 30852 
10850 , 30920 
10660 , 30986 
10469 , 31050 
10279,31114 
10087 ,31176 
9896 ,31238 

9704 ,31298 

9512,31357 


W209 
W210 
W211 
W212 
W213 
W214 
W215 
W216 
W217 
W218 
W219 
W220 
W221 
W222 
W223 
W224 
W225 
W226 
W227 
W228 
W229 
W230 
W231 
W232 
W233 
W234 
W235 
W236 
W237 
W238 
W239 
W240 
W241 
W242 
W243 
W244 
W245 
W246 
W247 
W248 
W249 
W250 
W251 
W252 
W253 
W254 
W255 
W256 
W257 
W258 
W259 
W260 
W261 
W262 
W263 
W264 
W265 
W266 
W267 
W268 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
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DATA 
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DATA 
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DATA 
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DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


9319,31415 
9126 ,31471 
8933 ,31527 
8739, 31581 
8546 31634 
8351, 31686 
8157, 31736 
7962 ,31786 
7767 , 31834 
7571 ,31881 
7375 , 31927 
7179 , 31972 
6983 ,32015 
6786 , 32057 
6590, 32098 
6393 ,32138 
6195, 32177 
5998 32214 
5800 , 32250 
5602, 32285 
5404 ,32319 
5205 , 32352 
5007 , 32383 
4808 , 32413 
4609 , 32442 
4410 ,32470 
4211, 32496 
4011,32521 
3811, 32545 
3612, 32568 
3412, 32590 
3212, 32610 
3012, 32629 
2811, 32647 
2611, 32664 
2410 ,32679 
2210, 32693 
2009 , 32706 
1809 ,32718 
1608 , 32728 
1407 , 32738 
1206 ,32746 
1005 ,32752 
804 , 32758 
603 , 32762 
402 ,32765 
201 ,32766 
0,32767 

-201, 32766 

-402 32765 

-603 , 32762 

-804 32758 

-1005 ,32752 

-1206 ,32746 

-1407 ,32738 

-1608 ,32728 

-1809 , 32718 

-2009 , 32706 

-2210 ,32693 

-2410 ,32679 
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W269 
W270 
W271 
W272 
W273 
W274 
W275 
W276 
W277 
W278 
W279 
W280 
W281 
W282 
W283 
W284 
W285 
W286 
W287 
W288 
W289 
W290 
W291 
W292 
W293 
W294 
W295 
W296 
W297 
W298 
W239 
W300 
W301 
W302 
W303 
W304 
W305 
W306 
W307 
W308 
W309 
W310 
W311 
W312 
W313 
W314 
W315 
W316 
W317 
W318 
W319 
W320 
W321 
W322 
W323 
W324 
W325 
W326 
W327 
W328 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
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DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
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DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


-2611,32663 
-2811, 32647 
-3012,32629 
-3212,32610 
-3412,32590 
-3612,32568 
-3811,32545 
-4011 , 32521 
-4210 , 32496 
-4410 , 32470 
-4609 , 32442 
~4808 , 32413 
-5007 , 32383 
-5205 ,32352 
-5404 , 32319 
-5602 ,32285 
~5800 , 32250 
-5998 ,32214 
-6195, 32177 
-6393 , 32138 
-6590 , 32098 
-6786 ,32057 
-6983 ,32015 
-7179 ,31972 
~7375 ,31927 
-7571,31881 
-7767 , 31834 
-7962 ,31786 
-8157, 31736 
-8351 ,31686 
-8546 , 31634 
-8739,31581 
-8933,31527 
-9126,31471 
-9319,31415 
-9512,31357 
-9704,31298 
-9896 ,31238 
-10087 , 31176 
-10279,31114 
-10469, 31050 
-10660 , 30986 
-10850, 30920 
-11039, 30852 
-11228, 30784 
-11417,30715 
-11605 , 30644 
-11793 ,30572 
-11980 , 30499 
-12167 ,30425 
-12354,30350 
-12540 ,30274 
-12725 , 30196 
-12910,30117 
-13095 , 30037 
-13279, 29957 
-13462 ,29875 
-13645 ,29791 
-13828 , 29707 
-14010, 29622 


W329 
W330 
W331 
W332 
W333 
W334 
W335 
W336 
W337 
W338 
W339 
W340 
W341 
W342 
W343 
W344 
W345 
W346 
W347 
W348 
W349 
W350 
W351 
W352 
W353 
W354 
W355 
W356 
W357 
W358 
W359 
W360 
W361 
W362 
W363 
W364 
W365 
W366 
W367 
W368 
W369 
W370 
W371 
W372 
W373 
W374 
W375 
W376 
W377 
W378 
W379 
W380 
W381 
W382 
W383 
W384 
W385 
W386 
W387 
W388 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


-14191,29535 
-14372,29448 
-14553, 29359 
-14733, 29269 
-~14912,29178 
-15091 ,29086 
-15269, 28993 
~15447, 28899 
-15624, 28803 
-15800, 28707 
-15976, 28609 
-16151,28511 
-16326, 28411 
-16500 , 28310 
~16673 , 28209 
-16846 , 28106 
-17018, 28002 
-17190 , 27897 
-17360,27791 
-17531, 27684 
-17700 , 27576 
-17869 , 27467 
-18037 , 27357 
-18205, 27245 
-18372,27133 
-18538, 27020 
-18703 , 26906 
-18868 , 26790 
-19032,26674 
-19195 ,26557 
~19358 , 26439 
-19520, 26319 
-19681 , 26199 
-19841 , 26078 
-20001 ,25956 
-20160 , 25832 
~20318,25708 
-20475 ,25583 
-20632 ,25457 
-20788 , 25330 
-20943 , 25202 
-21097 ,25073 
-21250 , 24943 
-21403 , 24812 
-21555 ,24680 
-21706 ,24548 
~21856 , 24414 
-22005 ,24279 
~22154,24144 
- 22302 ,24007 
-22449 ,23870 
-22595 ,23732 
-22740 ,23593 
-22884 ,23453 
~23028,23312 
-23170,23170 
-23312,23028 
- 23453 , 22884 
-23593 ,22740 
-23732 ,22595 
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W442 


W445 
W446 
W447 
W448 


-23870 , 22449 
-24007 , 22302 
-24144 ,22154 
-24279 , 22005 
-24414, 21856 
~ 24548 , 21706 
-~24680 , 21555 
~24812 , 21403 
-24943 , 21250 
- 25073 , 21097 
-25202 , 20943 
-25330 , 20788 
-25457 , 20632 
~25583 , 20475 
~25708 , 20318 
-25832 , 20160 
-25956 , 20001 
-26078 ,19841 
-26199 , 19681 
-26319 ,19520 
~26439 ,19358 
-26557,19195 
~26674, 19032 
-26790 , 18868 
~26906 ,18703 
-27020 , 18538 
-27133 , 18372 
-27245 ,18205 
-27357 , 18037 
-27467 , 17869 
-27576,17700 
~27684 , 17530 
-27791 , 17360 
-27897 ,17190 
-28002 ,17018 
~28106 , 16846 
~28209 ,16673 
~28311,16500 
-28411, 16326 
-28511,16151 
-28609 , 15976 
-28707 , 15800 
~28803 , 15624 
~28899 , 15447 
-28993 ,15269 
-29086 ,15091 
-29178,14912 
~29269, 14733 
-29359,14553 
- 29448 , 14372 
-29535,14191 
-29622,14010 
- 29707 ,13828 
29791 ,13645 
-~29875 ,13462 
-29957 ,13279 
-~30038 ,13095 
-30117,12910 
-30196 ,12725 
~30274,12540 


-30350 , 12354 
-30425 ,12167 
~30499 ,11980 
-30572,11793 
-30644 ,11605 
-30715,11417 
-30784,11228 
-30852,11039 
-30920, 10850 
-30985 , 10660 
-31050, 10469 
-31114,10279 
-31176 , 10087 
-31238 , 9896 
-31298 ,9704 
-31357,9512 
-31415,9319 
~31471 ,9126 
-31527 , 8933 
-31581 ,8739 
-31634, 8546 
-31686 , 8351 
-31736, 8157 
~31786 , 7962 
~31834 , 7767 
-31881 ,7571 
-31927 , 7375 
-31972,7179 
-32015,6983 
-32057 ,6786 
-32098 , 6589 
-32138 ,6393 
~32177,6195 
-32214,5997 
-32250 , 5800 
-32285 , 5602 
-32319, 5404 
-32352,5205 
-32383 ,5007 
~32413 , 4808 
-32442 ,4609 
-32470 , 4410 
-32496 ,4210 
-32521 ,4011 
-32545 , 3811 
-32568, 3612 
-32590 , 3412 
-32610, 3212 
-32629 , 3012 
-32647 , 2811 
-32664, 2611 
~32679 , 2410 
~32693 ,2210 
-32706 , 2009 
~32718,1809 
-32728 , 1608 
-32738,1407 
-32746 ,1206 
~32752,1005 
-32758 ,804 


-32762 ,603 
32765 ,402 
-32766 , 201 
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APPENDIX E 


IDT 'S4FFT256' 


COOLEY-TUKEY RADIX-4, DIF FFT PROGRAM - 256-POINT STRAIGHT-LINE. 


FOUR STAGES OF THREE TYPES RADIX-4 BUTTERFLIES - 

ZERO, SPECIAL, AND NORMAL - IMPLEMENTED WITH MACROS. 

COMPLEX INPUT DATA LOCATED ON PAGES 4-7 OF DATA MEMORY. 
RESULTS ARE LEFT IN DATA RAM. 

USES 13-BIT COEFFICIENTS FROM MPYK INSTRUCTIONS. 

INTERMEDIATE VALUES ARE SCALED BY 1/4 AT EACH STAGE TO PREVENT 
OVERFLOW. 


+ ee ee HH HH eH OH Oe OH 


RRKKRKEKKKKRKRIAKKEK RIK IIRIK RK IIKIIK RIAA AAAI AERA ISI IIIA AARERARAERRRAAR KR 
* 

x DATA MEMORY ALLOCATION. 

* 

RAKKKKAKEKRRIAREKERK IIIA RIKI IKR IAI KIKI IKARIA IKI III IIIA RARER KK 
N EQU 256 * FFT LENGTH 


Tl EQU 96 * TEMPORARY LOCATIONS ADDRESSED 
T2  EQU 97 * BY AUXILIARY REGISTERS. 
* 

>) EQU 512 

Xl EQU 514 

X2 EQU 516 

x3 EQU 518 

X4 EQU 520 

x5 EQU 522 

X6 EQU 524 

X7 EQU 526 

X8 EQU 528 

x9 EQU 530 


NOLLVINGNA Td Lid JIG ¢-XIdVu “INIOd-9S7 V 


ad XIGNdddV 
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EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


x90 

x91 

X92 

X93 

X94 

X95 

X96 

X97 

x98 

x99 

X100 
X101 
X102 
X103 
X104 
X105 
X106 
X107 
X108 
X109 
X110 
X111 
X112 
X113 
X114 
X115 
X116 
X117 
X118 
X119 
X120 
X121 
X122 
X123 
X124 
X125 
X126 
X127 
X128 
X129 
X130 
X131 
X132 
X133 
X134 
X135 
X136 
X137 
X138 
X139 
X140 
X141 
X142 
X143 
X144 
X145 
X146 
X147 
X148 
X149 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
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X150 
X151 
X152 
X153 
X154 
X155 
X156 
X157 
X158 
X159 
X160 
X161 
X162 
X163 
X164 
X165 
X166 
X%167 
X168 
x169 
X170 
X171 
X172 
X173 
X174 
X175 
K176 
X177 
X178 
179 
X180 
X181 
X182 
X183 
X184 
x185 
X186 
X187 
X188 
x189 
x199 
x191 
X192 
X193 
X194 
X195 
X196 
X197 
X198 
X199 
X%200 
X201 
X202 
X203 
X204 
X205 
X%206 
X207 
X208 
X209 


X255 
* 


* TABLE WITH COSINES 


EQU 
EQU 
EQU 


1020 
1022 


4095 
4094 
4090 
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EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


-699 
-798 

-896 

-994 

-1091 
-1188 
-1284 
-1379 
-1473 
-1566 
-1658 
-1750 
-1840 
-1929 
-2017 
-2104 
-2190 
-2274 
-2357 
-2438 
-2518 
-2597 
-2674 
-2749 
-2823 
-2895 
-2965 
-3033 
-3100 
-3164 
-3227 
-3288 
-3347 
-3404 
-3459 
-3511 
-3562 
-3610 
-3657 
-3701 
-3743 
-3782 
-3820 
~3855 
-3887 
-3918 
-3946 
-3971 
-3995 
-4015 
-4034 
-4050 
-4063 
~4074 
-4083 
-4089 
-4093 
-4094 
-4093 
~4089 
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C191 
C192 
C193 
C194 
C195 
C196 
C197 
C198 
c199 
C200 
C201 
C202 
C203 
C204 
C205 
C206 
C207 
C208 
C209 
C210 
C211 
C212 
C213 
C214 
C215 
C216 
C217 
C218 
C219 
C220 
C221 
C222 
C223 
C224 
C225 
C226 
C227 
C228 
C229 
C230 
C231 
C232 
C233 
C234 
C235 
C236 
C237 
C238 
C239 
C240 
C241 
C242 
C243 
C244 
C245 
C246 
C247 
C248 
C249 
C250 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


Os! 
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c281 
C252 
C253 
C254 
C255 
k 


* TABLE WITH SINES 


EQU 
EQU 
EQU 
EQU 
EQU 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


4064 
4075 
4084 
4090 
4094 
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$112 
$113 
$114 
$115 
$116 
$117 
$118 
$119 
$120 
$121 
$122 
$123 
$124 
$125 
$126 
$127 
$128 
$129 
$130 
$131 
$132 
$133 
$134 
$135 
$136 
$137 
$138 
$139 
$140 
$141 
$142 
$143 
$144 
$145 
$146 
$147 
$148 
$149 
$150 
$151 
$152 
$153 
$154 
$155 
$156 
$157 
$158 
$159 
$160 
$161 
$162 
$163 
$164 
$165 
$166 
$167 
$168 
$169 
$170 
$171 


1567 
1474 


1285 
1189 
1092 


-699 

-798 

-896 

-994 

-1091 
-1188 
-1284 
-1379 
-1473 
-1566 
-1658 
-1750 
~1840 
-1929 
-2017 
-2104 
-2190 
-2274 
-2357 
-2438 
-2518 
-2597 
-2674 
-2749 
-2823 
-2895 
-2965 
-3033 
~3100 
-3164 
-3227 
-3288 
-3347 
-3404 
~3459 
-3511 
-3562 


$172 
$173 
$174 
$175 
$5176 
$177 
$178 
$179 
$180 
$181 
$182 
$183 
S184 
$185 
S186 
$187 
$188 
$189 
$190 
$191 
$192 
$193 
$194 
$195 
$196 
$197 
$198 
$199 
$200 
$201 
$202 
$203 
S204 
$205 
S206 
$207 
$208 
$209 
$210 
S211 
$212 
$213 
$214 
$215 
$216 
$217 
$218 
$219 
$220 
$221 
S222 
$223 
$224 
$225 
$226 
S227 
$228 
$229 
$230 
$231 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


EQU ~ 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


cSI 
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$232 EQU -2274 SACH Tl x Tl = (1/4)(XI - XI2) 
$233 EQU -2190 ADD x*,15,AR1 * Rl (ACC) = (1/4)(XI + X12) 
$234 EQU -2104 SUB *,14,AR2 x (1/4)XI1 
$235 EQU -2017 SACH *0-,0,AR1 * XI2 = R1 - (1/4)XI1 
$236 EQU -1929 ADD x+,15,AR2 *« (1/2)XI1 
$237 EQU -1840 SACH *+,0,AR1 x XI = R1 + (1/4)XI1 
$238 EQU -1750 * 
$239 EQU -1658 LAC 0+ x YI1 
$240 EQU -1566 ADD x0- x YI3 
$241 EQU -1473 SACL x0+ * YI1 = ¥I1 + YI3 
$242 EQU -1379 SUB x,1 x YI3 
$243 EQU -1284 SACL x0-,0,AR2 x YI3 = YI1 - YI3 
$244 EQU -1188 LAC *0+,14 x (1/4)YI 
$245 EQU -1091 SUB *,14 x (1/4)YI2 
S246 EQU -994 SACH T2 * T2 = (1/4)(¥YI - YI2) 
$247 EQU -896 ADD x,15,AR1 x S1 (ACC) = (1/4)(YI + YI2) 
$248 EQU -798 SUB *,14,AR2 x (1/4)YI1 
$249 EQU -699 SACH *0-,0,AR1 x YI2 = Sl - (1/4)YI1 
$250 EQU -600 ADD x*0+,15,AR2 x (1/2)YI1 
$251 EQU -S00 SACH *+,0,AR1 x YI = $1 + (1/4)¥I1, POINT TO NEXT XI 
S252 EQU -400 x 
$253 EQU -300 ZALH Tl 
$254 EQU -200 ADD «*0-,14 x (1/4)¥I3 
$255 EQU -99 MAR x- x POINT TO XI1l 
SACH x+ * XI1 = Tl + (1/4)Y¥I3 
MAR x0O+ *x POINT TO YI3 
PAGE SUB *-,15 x (1/2)¥I3 
® SACH Tl x Tl = Tl - (1/4)YI3 
k ZALH T2 
x COOLEY-TUKEY RADIX-4, DIF FFT PROGRAM - 256-POINT STRAIGHT-LINE. ADD *+,14 * (1/4)XI3 
* SACH *- x YI3 = T2 + (1/4)xXI3 
k FOUR STAGES OF THREE TYPES RADIX-4 BUTTERFLIES - SUB «,15 * (1/2)xI3 
x ZERO, SPCIAL, AND NORMAL - IMPLEMENTED WITH MACROS. LAR AR3,T1 * AR3 = T1 
k COMPLEX INPUT DATA LOCATED ON PAGES 4-7 OF DATA MEMORY. SAR AR3,x+ * XI3 = Tl 
x RESULTS ARE LEFT IN DATA RAM. MAR x0- * POINT TO YI1 
k USES 13-BIT COEFFICIENTS FROM MPYK INSTRUCTIONS. SACH x+ * YI1 = T2 - (1/4)XI3 , POINT TO NEXT XI1 
* INTERMEDIATE VALUES ARE SCALED BY 1/4 AT EACH STAGE TO PREVENT * 
* OVERFLOW. SEND 
* KRKKKKKKRKKEKKEKKKKEKKKKKEKKEKEREKERERKKEEEKEKKKERKEKKKKKKEKKKEEEKEKKAKKKKKKKKKK KKK 
RAEKKKKKKKKKKEKKKKKKKKKKKKKKK KKK KKK KR RRR KR KKK KKKKKK * 
k * NORMAL - STANDARD RADIX-4 BUTTERFLY. 
* MACROS TO PRODUCE STRAIGHT-LINE 256-POINT COMPLEX FFT. * 
x * X'S AND Y'S ARE INPUT AND OUTPUT LOCATIONS FOR BUTTERFLY. 
HAKKAKKKAKKRKERERE RIKI ERAREREAARERAERERARERAERIIAARRAKK AAR ARIK K < x IA'S SPECIFY TWIDDLE FACTOR LOCATIONS. 
k x ENTER WITH ARP = 1, AR] --> XI1, AR2 --> XI, ARO = [XI3] - [XI1] 
x ZERO FOR CASE W = 1 (THETA = 0). * EXIT WITH ARP = 1, AR1 --> NEXT XI1, AR2 --> NEXT XI 
*& ‘ * 
* X'S AND Y'S ARE INPUT AND OUTPUT LOCATIONS FOR BUTTERFLY. NORMAL $MACRO IA1,IA2,IA3 
* ENTER WITH ARP = 1, AR1 --> XI1, AR2 --> XI, ARO = [XI3]-[XI1] * 
* EXIT WITH ARP = 1, AR] --> NEXT XI1, AR2 --> NEXT XI LAC x*0+ * XI1 
*& ADD x0- * XI3 
ZERO SMACRO SACL *0+ * XI1 = XI1 + XI3 
x SUB *,1 x XI3 
LAC «0+ * XI1 SACL *0-,0,AR2 * XI3 = XI1 - XI3 
ADD «*0- * XI3 LAC x*0+,14 x (1/4)XI 
SACL x0+ * XI1 = XI1 + XI3 SUB x,14 x (1/4)XI2 
SUB x,1 * X13 SACH Tl *& Tl = (1/4)(XI - XI2) 
SACL x0-,0,AR2 * XI3 = XI1 - XI3 ADD *,15,AR1 * Rl (ACC) = (1/4)(XI + XI2) 
LAC *0+,14 * (1/4)XI SUB %*,14,AR2 * (1/4)XT1 
SUB *,14 x (1/4)XI2 SACH x0-,0,AR1 * XI2 = Rl - (1/4)xXI1 
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est 


*x+,15,AR2 
a+ ,0,AR1 


x0+ 

xO- 

*0+ 

*,1 
*0-,0,AR2 
x0+,14 
*,14 

T2 
*,15,AR1 
*,14,AR2 
*0-,0,AR1 
*0+,15,AR2 
*0+,0,AR1 


*0+,4,AR2 


C:IA2: 
zt 
S:IA2: 
k 
x-,4 
S:IA2: 


* 


* (1/2)XI1 
XI = R1 + (1/4)XI1 
YI 
YI3 
YI1 = YI1 + YI3 
YI3 
YI3 = YI1 - YI3 
(1/4)YI 
(1/4)¥I2 


* * e+e FF FH HH OF +e + e +e ee He FH +e Fe FF HH HF OF OF 


+e Fe ee eH OF OF 


* 


T2 = (1/4)(YI - YI2) 

S1 (ACC) = (1/4)(¥I + YI2) 
(1/4)¥I1 

YI2 = $1 - (1/4)¥I1 
(1/2)¥I1 


* YI = $1 + (1/4)YI1, POINT TO YI2 


(1/4) ¥13 

POINT TO XI1 

XI1 = Tl + (1/4)¥I3 
POINT TO YI3 
(1/2)¥13 

Tl = Tl - (1/4)¥I3 


(1/4) X13 

T2 = T2 + (1/4)XI3 
(1/2)X1I3, POINT TO YI3 
POINT TO YI1 

YI1 = T2 - (1/4)XI3 


YI1l 

Col *« YI1l 

xIl 

SIl * XI1l 

YI1 

YIL = YIixCOl - XI1*SI1 
SIl * YI1 

xIl 

Col « XI1l 

YI2 

KI1 = YI1*SI1 + XI1*COl, POINT TO XI3 
CO2 * YI2 

xI2 

ST2 * XI2 

YI2 

YI2 = YI2*CO2 - XI2xSI2 
SI2 * YI2 

xI2 

CO2 *« XI2 

Tl 


XI2 = YI2xSI2 + X1I2*CO2 


Co3 x Tl 
SI3 x T2 


XI3 = T1l«CO3 + T2x*SI3 


MPYK C:IA3: 
LTP Tl 
MPYK S:IA3: 
LTS x+,AR1 
SACH x«+,4 
MAR x0- 

* 
SEND 


KRKEEKKKEKKEKKEKEKKKEREEREKKEKEREKREKKKREKKEKEEEKKKEREKEKEREREREKKEKKKRKEKKEKEKKAKK 


SPCIAL FOR CASE THETA = pi/4 


+ + +t Fe e 


* 
SPCIAL $MACRO 
x 


SACL *-,0,AR1 
SUB x-,1 


LAC *0+,0,AR2 
SUB x 

SACL Tl 

ADD x,1l 

SACL *0+ 


LT «t+,AR1 
MPYK C32 
MAR xt 


+ + Fe FF 


+e ee + ee Fe +e & e F 


+e Fe 


+ * Fe HF eH HF Fe 


* 


C03 « T2 


SPAC, POINT TO NEXT XI, (LT = DUMMY OP) 


YI3 = T2xCO3 - T1x*SI3 
POINT TO NEXT XI1 


X'S AND Y'S ARE INPUT AND OUTPUT LOCATIONS FOR BUTTERFLY. 
ENTER WITH ARP = 1, AR1 --> XI1, AR2 --> XI, ARO = {XI3] - [XI1] 
EXIT WITH ARP = 1, AR1 --> NEXT XI1, AR2 --> NEXT XI 


(1/4)XI1 
(1/4)XI3 
XI1 = (1/4) (X11 
(1/2)xX13 
XI3 = (1/4)(XI1 


+ 


X13) 


XI3) 


(1/4)YI3 
(1/4)¥I1 
YI1 = (1/4)(YI1 + Y1I3) 
(1/2)¥13 

YI3 = (1/4)(YI1 


YI3), POINT TO YI1 


(1/4)XI 

(1/4)X1I2 

XI = (1/4)(XI + XI2) 
(1/2)XI2 

XI2 = (1/4)(XI - XI2) 


(1/4)¥12 

(1/4)¥I 

YI = (1/4)(¥I + YI2) 

(1/2)¥12 

YI2 = (1/4)(¥YI - YI2), POINT TO YI 


YI 

YIl 

YI = YI + YI1 

2 YI1l 

T2 = YI - YIl 
XI1, POINT TO XI3 
Tl = XI1 - XI 


XI = XI1 + XI, POINT TO XI2 


YI3 


YI3 = (XI2-YI3)«C32 


vst 
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LTA x0- *& XI3 

SACH *0+,4,AR2 «x XI1 = 
* 

MPYK C32 

LTP *,ARI * YI2 

MPYK C32 

APAC 

SACH «*0-,4 * XI3 = 

SPAC 

SPAC 

NEG 

MAR xt 

SACH «,4,AR2 * YI1 = 
* 

LAC Tl 

SACL «- *« YI2 = 

LAC T2 

SACL *0- * XI2 = 

MAR x+ 

MAR x«+,AR1l * POINT 
* 

LAC - * YI1 

SUB «+ * XI1 

SACL x- * YI1 = 

ADD x,l * 2 XIl 

SACL «0+ * XI1 = 

LAC «+ * XI3 

SUB x*- x YI3 

SACL x+ * XI3 = 

ADD x,1 *x 2 YI3 

NEG 

SACL x0- x YI3 = 

MAR xt * POINT 
ke 

SEND 


HAKKAR IAKRERERIREIREEERRERKEREIRAEREKEI KAR EAIAAA KER I KIIAEAAIAAAEAIAAAAA AAI 


& 


(XI2+YI3) «C32 


(YI2+X1I3)*C32 


(YI2-XI3)*C32 


Tl 
T2 


TO NEXT XI 
¥Il - XI1l 
YI1 + XI1 
XI3 - YI3 


-(YI3 + XI3) 
TO NEXT XIl 


* DIGREV MACRO TO DO EXCHANGE OF LOCATIONS FOR DIGIT REVERSAL. 


x 
DIGREV SMACRO I,J 
* 


LRLK AR1,X:I: 
LRLK AR2,X:J: 
* 
ZALH *,AR2 
ADDS «,ARl 
SACL «+,0,AR2 
SACH xt,0,AR1 
ZALH «,AR2 
ADDS x,AR1 
SACL x*,0,AR2 
; SACH «,0,AR1 
* 
SEND 


& 


* AR1 POINTS TO XI 
x AR1 POINTS TO XJ 


RAKKKEKEKEKAKEREREEKEREREREREKEREREKREERERERREREREREKEEEEREEREKEKEREKREREERKKRIK 


* MAIN ROUTINE TO CALL ABOVE MACROS WITH APPROPRIATE PARAMETERS. 


REKKKKKRKEKEKRERERKEERKREREKEKEKEKEREREKREKEKEEKREEEEREEEEEKKEREREKKKKKKKKKKK KKK 


FF256 


* 


AORG 0 
B32 


AORG 32 
LDPK 0 x STORE ALL TEMPORARY VARIABLES IN B2. 


INPUT DATA (256 REAL AND 256 IMAGINARY VALUES) 


LRLK AR1,X0 
RPTK 255 
IN x+, PAO 
RPTK 255 
IN x*+,PA0 


RKEKKKKKAKEKERERREKEKEREEKEEKEEEEEKEREREKEEEEKKKKEKREKEKKEKERKEEREEKEKKKKKKKKKKKKKK 


x 


* PASS 1 


& 


KREKKKEKKEEKREREKKKKERKKKEERKREKREREEEERERKEREEKEEKEKEEKREEKEREKEKERERKEKKKKKKKKKKK 


LRLK ARO, 256 x STEP VALUE BETWEEN BUTTERFLY "LEGS" 


ZERO * X0,X64,X128,X192,Y0, Y64,Y128,Y192 
NORMAL 1,2,3 * X1,X65,X129,X193 ,Y1,Y65,Y129,Y¥193 
NORMAL 2,4,6 * X2,X66,X130,X194,... 

NORMAL 3,6,9 

NORMAL 4,8,12 

NORMAL 5,10,15 


NORMAL 25,50,75 
NORMAL 26,52,78 
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SSI 


NORMAL 27,54,81 
NORMAL 28,56,84 
NORMAL 29,58,87 
NORMAL 30,60,90 
NORMAL 31,62,93 *& X31,X95,X159,X223 ,Y31,¥95,¥159,¥223 


SPCIAL 


* 


X32 ,X96,X160 ,X224,Y32,Y96,Y160,Y224 


* 


NORMAL 33,66,99 

NORMAL 34,68,102 
NORMAL 35,70,105 
NORMAL 36,72,108 
NORMAL 37,74,111 
NORMAL 38,76,114 
NORMAL 39,78,117 
NORMAL 40,80,120 
NORMAL 41,82,123 

NORMAL 42,84,126 

NORMAL 43,86,129 

NORMAL 44,88,132 

NORMAL 45,90,135 

NORMAL 46,92,138 
NORMAL 47,94,141 

NORMAL 48,96,144 
NORMAL 49,98,147 

NORMAL 50,100,150 
NORMAL 51,102,153 
NORMAL 52,104,156 
NORMAL 53,106,159 
NORMAL 54,108,162 
NORMAL 55,110,165 
NORMAL 56,112,168 
NORMAL 57,114,171 
NORMAL 58,116,174 
NORMAL 59,118,177 
NORMAL 60,120,180 
NORMAL 61,122,183 
NORMAL 62,124,186 
NORMAL 63,126,189 


X33 ,X97 ,X161,X225,Y33 ,Y97,Y161,Y225 
X34 ,X98,X162,X226,... 


+ 


* X63,X127,X191,X255,¥63,Y127,Y191,¥255 
* 
KKKRKKEKAKKK RIKKI ERR AKA RIK ERR ERIKA IIIA IKKE RIERA AIRE IRA AIRE 
x 
* PASS 2 
x 
RKRIKRAKKARK KK RARKRRKKERAKAKRKRERAEKK RAI AKIRA KKK KERRIER IR IIK IIIA RARER RIKI I 
x 
* FIRST SET OF BUTTERFLIES 
k 

LARK ARO ,64 STEP VALUE BETWEEN BUTTERFLY "LEGS" 
LRLK ARI ,X16 
LRLK AR2 , KO 
ZERO * X0,X16,X32,X48,Y0,Y16,Y32,Y48 


* 


NORMAL 4,8,12 

NORMAL 8,16,24 
NORMAL 12,24,36 
NORMAL 16,32,48 
NORMAL 20,40,60 
NORMAL 24,48,72 
NORMAL 28,56,84 * X7,X23,X39,X55,Y7,¥23,Y39,¥55 


* 


X1,X17,X33 ,X49,Y1,Y17,¥33 ,Y49 
X2,X%18,X34,X50,... 


* 


+ 


* 


be 


SPCIAL 


NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 


SECOND SET 


LRLK 
LRLK 
ZERO 


NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 


SPCIAL 


NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 


THIRD SET 


LRLK 
LRLK 
ZERO 


NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 


SPCIAL 


NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 


FOURTH SET 


36,72,108 
40,80,120 
44,88,132 
48 ,96,144 
52,104,156 
56,112,168 
60,120,180 


OF BUTTERFLIES 


AR1, X80 
AR2 , X64 


4,8,12 
8,16,24 
12,24,36 
16,32,48 
20 ,40,60 
24,48,72 
28,56, 84 


36,72,108 
40,80,120 
44 ,88,132 
48 ,96,144 
52,104,156 
56,112,168 
60,120,180 


OF BUTTERFLIES 


AR1,X144 
AR2 ,K128 


4,8,12 
8,16,24 
12,24,36 
16,32,48 
20 ,40,60 
24,48,72 
28,56, 84 


36,72,108 
40 ,80,120 
44,88 ,132 
48 ,96,144 
52,104,156 
56,112,168 
60,120,180 


OF BUTTERFLIES 


x 


* 


X8 ,X24,X40,X56,Y8,Y24,¥40 ,Y56 


X9,X25,X41,X57 ,¥9,¥25,Y41 ,Y57 
X10 ,X26 ,X42,X58,... 


X15 ,X31,X47,X63,Y15,Y31,¥47,Y63 


X64,X80 ,X96,X112,Y64,Y80,Y96,Y112 


X65 ,X81 ,X97,X113,Y65,Y81,¥97,¥113 
X66 ,X82,X98,K114,... 


X71 ,X87 ,X103,X119,Y71,Y87,Y103,Y119 
X72 ,X88 ,X104,X120,Y72,Y88,Y104,¥120 


X73 ,X89,X105,X121,Y73,Y89,Y105,Y121 
X74,X90,X106,X122,... 


X79,X95 ,X111,X127,¥79,Y95,¥111,Y127 


X128 ,X144,X160,X176,Y128,Y144,Y160,Y176 


X129 ,X145,X161 ,X177,Y129,¥145,Y161,¥177 
X130,X146,X162,X178,... 


X135,X151,X167,X183,Y135,Y151,¥167,¥183 
X136 ,X152,X168,X184,¥136 ,Y152,Y168,Y¥184 


X137 ,X153,X169 ,X185 ,¥137,¥153,Y169,Y185 
X138,X154,X170,X186,... 


X143 ,X159,X175,X191,¥143,Y159,Y175,Y191 


9SI 
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* 
x 


KKKKKEKKKEKKKKKRKRKEKKEKKREKKKRKKRKEKKKRKAKEKREREKARERERKEKERARKAKKEKKARERERERKREKEKKKKKKKK 


* 


LRLK 
LRLK 
ZERO 


NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 


SPCIAL 


NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 
NORMAL 


*x PASS 3 


* 


KKKKKKKAKKKKKEKKEKEKKKKKRERKEKKKKEKKKKKKKEKKERAKKKERREKKKEKKEKRAAKKKARKKKKKKKKKK KKK 


* 


* 


LARK 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 


AR1 , X208 
AR2 ,X192 


4,8,12 

8,16,24 
12,24, 36 
16,32,48 
20,40,60 
24,48,72 
28,56, 84 


36,72,108 
40 ,80,120 
44,88,132 
48 ,96,144 
52,104,156 
56,112,168 
60,120,180 


ARO,16 
AR1,X4 
AR2 ,X0 
16,32,48 
48 ,96,144 
AR1 , X20 
AR2 , X16 
16,32 ,48 
48 ,96,144 
AR1 , X36 
AR2 , X32 
16,32,48 
48 ,96,144 


AR1 , X52 
AR2 , X48 


* 


* 


+ + F + + Fe OF OF 


+ + + 


X192,X208 ,X224,X%240 , Y192 ,Y208 ,¥224 ,Y240 


X193 ,X209 ,X225 ,K241 ,¥193 ,Y209 ,Y225 ,Y241 
X194,X210,X226,X242,... 


X199,X215,X231 ,X247, 199, ¥215,¥231,Y247 
X200 ,X216 ,K232,X248, Y200,Y216,Y232 ,Y¥248 


X201 ,X217,X233 ,X249, Y201 ,Y217 ,¥233 ,Y249 
X202,X218,X234,X250,... 


X207 ,X223 ,X239,X255 , Y207 ,Y223,¥239,¥255 


STEP VALUE BETWEEN BUTTERFLY "LEGS" 


XO ,X4,X8,X12,Y0,Y4,Y8,Y12 
X1,X5,X9,X13,¥1,¥5,Y9,Y13 
X2,X6,X10,X14,¥2,Y6,Y10,Y14 
X3,X7,X11,X15,¥3,¥7,¥11,¥15 


X16 ,X20,X24,X28,Y16,¥20,Y24,Y28 
X17 ,X21 ,X25 ,X29,Y17 ,¥21,Y25 ,¥29 
X18 ,X22 ,X26,X30,Y18,¥22,Y26,Y30 
X19,X23 ,X27,X31,Y19,¥23,Y27,Y31 


X32 ,X36 ,X40 ,X44 ,Y32,Y36,Y40 ,Y44 
X33 ,X37 , X41,X45,Y33 ,Y¥37,Y41,Y45 
X34 ,X38,X42,X46 ,Y34,Y38,Y42,Y46 
X35 ,X39, X43 ,X47,Y35,¥39,Y43,Y47 


X48 , X52 ,X56,X60,Y48, ¥52,Y56, Y60 


NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


16 ,32,48 
48 ,96,144 
AR1 , X68 
AR2 , X64 
16,32,48 
48 ,96,144 
AR1 , X84 
AR2 , X80 
16 ,32,48 
48 ,96,144 
AR1,X100 
AR2 , X96 
16,32,48 
48 ,96,144 
AR1,X116 
AR2,X112 
16 ,32,48 
48,96,144 
AR1 ,X132 
AR2 ,X128 
16,32,48 
48 ,96,144 
AR1 ,X148 
AR2 ,X144 
16,32 ,48 
48 ,96,144 
AR1 ,X164 
AR2 ,X160 
16,32,48 


48,96,144 


+ et OF + OF OF + FF + + +e + + + FF + OF * 


+ FF OF 


X49 ,X53 ,X57 ,X61,Y49,Y53,Y57,Y61 
X50,X54,X58,X62,Y50,Y¥54,Y58,Y62 
X51,X55,X59, X63, ¥Y51,¥55,¥59, Y63 


X64 ,X68 ,X72 , X76 ,Y64,Y68,Y72,Y76 
X65 ,X69,X73,X77,Y65,Y69,Y73,Y77 
X66 ,X70 ,X74, X78 ,Y66 ,Y70,Y74,Y78 
X67 ,X71,X75, X79, Y67,Y71,Y¥75,Y79 


X80 , X84, X88 ,X92,Y80,Y84,Y88,Y92 
X81 ,X85 ,X89 ,X93 ,Y81,Y85 ,Y89 ,Y93 
X82, X86 ,X90 ,X94 ,Y82,Y86 ,Y90 ,Y94 
X83 , X87 ,X91,X95, Y83 ,Y87,¥91,Y95 


X96 ,X100 ,X104,X108 ,Y96 ,Y100 ,Y104,Y108 
X97,X101,X105,X109,Y97,Y101,¥105,Y109 
X98 ,X102,K106,X110,¥98,Y102,Y106,¥110 
X99 ,X103,X107,X111,¥99,Y103,Y107,Y111 


X112,X116,K120 ,X124,Y112,¥116,Y120,Y124 
X113,X117,X121 ,X125,Y113,Y117,¥121 ,¥125 
X114,X118,X122 ,X126,Y114,Y118,Y122,Y126 
X115,X119,X123,X127,Y115,Y119,¥Y123,Y127 


X128 ,X132,X136 ,X140 ,Y128 , Y132,¥136,Y140 
X129,X133,X137 ,X141,¥129,Y133,¥Y137,Y141 
X130,X134,X%138 ,X142,Y130,Y134,Y138,¥142 
X131,X135,X139 ,X143 ,¥131,Y135,Y139,Y143 


X144,X148,X152,X156,Y144,Y148,Y152,Y156 
X145 ,X149 ,X153 ,X157 ,Y145 ,Y149,¥153 ,Y157 
X146 ,X150,X154,X158, Y146,Y150,Y154, Y158 
X147,X151,X155,X159,Y147,Y151,¥155,Y159 


K160 ,X164,X168 ,X172 ,Y160 ,Y164,Y168,Y172 
X161,X165,X169,X173,Y161,Y165,Y169,Y173 
%162,X166 ,K170,X174,¥162,Y166,¥Y170,Y174 
X163 ,X167,X171,X175,Y163,Y167,¥Y171,Y175 
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LSI 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 

NORMAL 
SPCIAL 
NORMAL 


LRLK 
LRLK 


ZERO 
NORMAL 
SPCIAL 
NORMAL 
x 
x 


KAKKEKKKKKKEKEKKKKERRERKEEKREKEKRKEKKEREKKREREREKEKRKRKKEEKKREKRKREREREKKAEKEKEREREKEKK 


* 


x PASS 4 
* 


RHKKKKKKKEKRAEKKEKEREKIEREREKEREREREKKKEKEREREKEKEKREEKRKEKEREREKKREAKKEKREEREREREKRKKKKK 


® 


* 


AR1,X180 
AR2 ,X176 
16,32,48 
48 ,96,144 
AR1,X196 
AR2 ,X192 
16,32,48 
48,96 ,144 
AR1,X212 
AR2,X208 
16,32,48 
48 ,96,144 
AR1,X228 
AR2 ,X224 
16,32,48 
48 ,96,144 
AR1 , X244 
AR2 , X240 
16,32,48 


48 ,96,144 


ARO,4 


AR1,X1 
AR2 , XO 


AR1,X5 
AR2 , X4 


AR1,X9 
AR2,X8& 


AR1,X13 
AR2 ,X12 


+e + + + ee + + % 


+ + F 


* 


x 


* 


X176,X180 ,X184,X188,Y176 ,Y180,Y184,Y188 
X177,X181,X185 ,X189,Y177,¥181,Y185,¥189 
X178,X182,X186,X190,Y178, ¥182,Y186,Y190 
X179,X183 ,X187,X191,Y179,¥183,¥187,Y191 


X192 ,X196 ,X200 ,X204,Y192,Y196,Y200,Y204 
X193 ,X197 ,X201 ,X205,Y193,¥197,Y201,Y205 
X194,X198 ,X202 ,X206,Y194,Y198 ,Y202,Y206 
X195 .X199,X203 ,X207,Y195,¥199,Y203 ,Y207 


X208 ,X212,K216,X220,¥208,¥212,¥216,Y220 
X209 ,X213,X217 ,X221,Y209,¥213,Y217,Y221 
%210,X214,X218,X222,Y210,¥214,¥218, Y222 
X211,X215,X219 ,X223,Y211,¥215,¥Y219,¥223 


X224 ,X228 ,X232 ,X236,Y224,Y228,Y232 ,¥236 
X225 ,X229 ,X233 ,X237 ,Y225 , Y229 ,Y233 , 237 
X226 ,X230 ,X234 ,X238 , Y226 ,Y230, Y234,Y238 
X227 ,X231 ,X235 ,X239,¥227 ,Y231,Y235 ,Y239 


X240 ,X244,X248 ,X252,Y240 ,Y244,Y248 ,Y252 
X241 ,X245 ,X249 ,X253 ,¥241 ,Y245 ,Y249 ,Y253 
X242 ,X246 ,X250 ,X254, Y242 , ¥246 , Y250 ,Y254 
X243 ,X247 ,X251,X255 , Y243 ,Y247 ,Y251,¥255 


STEP VALUE BETWEEN BUTTERFLY "LEGS" 


XO ,X1,X2,X3,Y0,Y1,¥2,¥3 


X4,X5,X6,X7,Y4,Y5,Y6,Y7 


x X8,X9,X10,X11,Y8,Y9,Y10,Y11 


AR1,X17 
AR2 ,X16 


AR1 , X21 
AR2 ,X20 


AR1 , X25 
AR2 ,X24 


AR1 , X29 
AR2 , X28 


AR1 , X33 
AR2 , X32 


AR1 , X37 
AR2 , X36 


AR1 , X41 
AR2 , X40 


AR1 , X45 
AR2 , X44 


AR1 , X49 
AR2 , X48 


AR1, X53 
AR2 ,X52 


ARI , X57 
AR2 , X56 


AR1 , X61 
AR2 , X60 
AR1, X65 


AR2 , X64 


AR1 , X69 
AR2 , X68 


AR1 , X73 
AR2 ,X72 


AR1 ,X77 
AR2 , X76 


AR1 , X81 
AR2 , X80 


AR1, X85 
AR2 , X84 


ARI , X89 
AR2 , X88 


AR1 , X93 


* 


* 


* 


* 


X12,X13,X14,X15,¥12, ¥13,¥24, ¥15 
X16,X17,X18,X19,¥16,¥17,¥18,¥19 
X20 ,X21,X22,X23,¥20,¥21,¥22,¥23 
X24 ,X25,X26 ,X27,Y24,¥25,¥26,¥27 
X28 ,K29 ,K30, X31, ¥28,¥29,¥30,¥31 
X32,X33, X34, X35, Y¥32,¥33,¥34,¥35 
X36 , X37, X38, X39, ¥36 , 37 ,¥38,¥39 
X40, X41,X42,K43,¥40,¥41,¥42,¥43 
X44 ,X45, X46, X47, Y44,¥45, ¥46,Y47 
X48 X49, X50,X51,Y48,¥49,Y50,¥51 
X52,X53,X54,X55,¥52,¥53,¥54,¥55 
X56 ,X57,X58,X59, 56, ¥57, ¥58,¥59 


X60, X61,X62,X63,Y60 , Y61 ,Y62,Y63 


X64, X65, X66, X67 , Y64, Y65 , Y66 , Y67 
X68 , X69 ,X70,X71,Y68,¥69,Y70,Y71 
X72 ,X73 ,X74,X75 ,Y72 ,Y73,Y74,Y75 
X76 ,X77 ,X78 ,X79,Y76 ,Y77 ,Y78 ,Y79 
X80 , X81 , X82, X83 ,Y80 , Y81 , Y82 , Y83 
X84 ,X85 , X86 ,X87 , Y84,Y85 , Y86 , Y87 


X88 ,X89 ,X90,X91,¥88,Y89 ,Y90,Y91 


8SI 
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AR2 , X92 


AR1 , X97 
AR2 , X96 


AR1 ,X101 
AR2 ,X100 


AR1,X105 
AR2 , X104 


AR1,X109 
AR2,X108 


AR1 ,X113 
AR2,X112 


AR1 ,X117 
AR2,X116 


AR1,X121 
AR2 ,X120 


AR1 ,X125 
AR2 ,X124 
AR1 ,X129 
AR2,X128 


AR1 ,X133 
AR2 ,X132 


AR1 ,X137 
AR2 ,X136 


AR1 ,X141 
AR2 ,X140 


AR1 ,X145 
AR2 ,X144 


AR1 , X149 
AR2 ,X148 


AR1 ,X153 
AR2 ,X152 


AR1 ,X157 
AR2 ,X156 


AR1 ,X161 
AR2 , X160 


AR1 ,X165 
AR2 ,X164 


AR1,X169 
AR2 ,X168 


* 


* 


* 


X92 ,X93,X94,X95,Y92 ,¥93,Y94,Y95 


X96 ,X97,X98 ,X99, Y96 ,Y97 ,Y98,Y99 


X100,X101,X102,X103 ,Y100,Y101 ,Y102,¥103 


X104,X105,X106 ,X107 , Y104,Y105 ,¥106 ,Y107 


X108 ,X109,X110,X111,¥108,Y109,Y110,Y111 


X112,X113,X114,X115,¥112,Y113,¥114,¥115 


X116,X117,X118,X119,Y116,Y117,¥118,¥119 


X120,X121,X122 ,X123 ,Y120,¥121 ,Y122,Y123 


X124,X125 ,X126 ,X127 ,Y124,¥125 ,Y126 ,¥127 


X128 ,X129,X130,X131,¥128,¥129,Y130,Y131 


X132,X133 ,X134,X135 ,Y132,Y133 ,Y134,¥135 


X136 ,X137,X138,X139, ¥136, Y137,Y138,Y139 


X140 ,X141 ,X142,X143 ,Y140,Y141 ,Y142,¥143 


X144,X145 ,X146,X147 ,Y144,Y145 ,Y146,Y147 


X148 ,X149 ,X150,X151 ,Y148,Y149 ,Y150,¥151 


X152,X153,X154,X155 ,Y152,¥153,Y154,¥155 


X156,X157,X158,X159 ,Y156,¥157 , ¥158,Y159 


X160 ,X161 ,X162,X163 ,Y160,Y161 ,Y162,Y163 


X164,X165 ,X166,X167 ,Y164,Y165 , Y166,Y167 


X168 ,X169,X170,X171 ,Y168,Y169 ,¥170,Y171 


LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 


LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 
ZERO 
LRLK 
LRLK 


AR1 ,X173 
AR2,X172 


AR1 ,X177 
AR2 ,X176 


AR1,X181 
AR2,X180 


AR1,X185 
AR2 ,X184 


AR1,X189 
AR2 ,X188 
AR1 ,X193 
AR2 ,X192 


AR1 ,X197 
AR2 ,X196 


AR1 ,X201 
AR2 ,X200 


AR1 , X205 
AR2 , X204 


AR1 ,X209 
AR2 ,X208 


AR1 , X213 
AR2 ,X212 


AR1 ,X217 
AR2 ,X216 


AR1 ,X221 
AR2 ,X220 


AR1 ,X225 
AR2 ,X224 


AR1 ,X229 
AR2 ,X228 


AR1 , X233 
AR2 , X232 


AR1 , X237 
AR2 ,X236 


AR1 ,X241 
AR2 ,X240 


AR1 , X245 
AR2 , X244 


AR1 ,X249 
AR2 , X248 


* 


+ 


* 


* 


* 


X172,X173 ,X174,X175 , Y172,Y173,¥174,Y175 


X176 ,X177 ,X178 ,X179 ,Y176 ,Y177 ,Y178 ,¥179 


X180 ,X181,X182,X183,Y180,Y181,Y182,¥183 


X184,X185 ,X186 ,X187 ,Y184,¥185,Y186 ,Y187 


X188 ,X189,X190,X191,Y188,Y189,Y190,Y191 


X192,X193 ,X194,X195,Y192,Y193 ,¥194,Y195 


X196 ,X197 ,X198,X199,¥196,¥197,Y198,Y199 


X200,X201 ,X202,X203 ,Y200,Y201 ,¥202,¥203 


X204,X205 ,X206 ,X207,¥204,Y205 , Y206 ,¥207 


X208 ,X209 ,X210,X211,¥208,Y209 ,Y210 ,Y211 


X212,X213 ,X214,X215,Y212,Y213,¥214,¥215 


X216,X217 ,X218,X219,Y¥216,Y217,Y218,¥219 


X220,X221 ,X222,X223 ,Y220,Y221 ,Y222 ,¥223 


X224,X225 ,X226 ,X227 ,Y224,Y225 ,Y226 ,¥227 


X228 ,X229 ,X230,X231,¥228,¥229,¥230,Y231 


X232,X%233 ,X234,X235,Y232 ,¥233 ,Y234,Y235 


X236 ,X237 ,X238 ,X239 ,Y236 , Y237 ,Y238 ,¥239 


X240 ,X241 ,K242 ,X243 ,Y240,Y241 ,Y242 ,Y243 


X244,X245 ,K246 ,X247 ,Y244 ,Y245 ,Y246 , ¥247 
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6SI 


& 
k 


ZERO 
LRLK 
LRLK 
ZERO 


AR1 ,X253 
AR2 ,X252 


& KX248 ,X249,X250,X251,Y248,¥249 ,Y250, Y251 


* X252,X253,X254,X255 ,Y252 ,Y253 ,¥254,¥255 


x DIGIT REVERSE COUNTER FOR RADIX-4 FFT COMPUTATION. 


* 


LARP 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 


1 


1,64 
2,128 
3,192 
4,16 
5,80 
6,144 
7,208 
8,32 
9,96 
10,160 
11,224 
12,48 
13,112 
14,176 
15,240 
17,68 
18,132 
19,196 
21,84 
22,148 
23,212 
24,36 
25,100 
26,164 
27,228 
28,52 
29,116 
30,180 
31,244 
33,72 
34,136 
35,200 
37,88 
38,152 
39,216 
41,104 
42,168 
43,232 
44,56 
45,120 
46,184 
47,248 
49,76 
50,140 
51,204 
53,92 
54,156 
55,220 
57,108 
58,172 
59,236 


DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 


DIGREV — 


DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 
DIGREV 


175,250 
179,206 


DIGREV 183,222 
DIGREV 187,238 
DIGREV 191,254 
DIGREV 199,211 
DIGREV 203,227 
DIGREV 207,243 
DIGREV 219,231 
DIGREV 223,247 
DIGREV 239,251 


OUTPUT FFT DATA 


* 


LARP 1 
LRLK AR1,X0 
RPTK 255 
OUT x«+,PAl 
RPTK 255 
OUT x«+,PAl 

* 

x FFT COMPLETE. 

* 

WHOA B WHOA 
END 


4. Implementation of Fast Fourier Transform Algorithms with the TMS32020 


APPENDIX F 
A 128-POINT, RADIX-2 DIF FFT IMPLEMENTATION (LOOPED CODE) 
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TIDY “FFT2° 
He 
# COCLEY-TUKEY 128-POinT, RADIX-2, DIF FFT PROGRAM FOR THE TMS2202 
pia 
+ SINGLE FFT BUTTERFLY. 
ia REAL INFUT DATA, STORED IN FAGE 4 OF BLOCK Bo. 
* FFT COMPUTATICN IS DONE IN FAGES 4, 7 OF BLOCK Bi (COMPLEX NUMBERS). 
* USES TABLE LOCKUP (FROM FROGRAM MEMORY) OF THE TWIDDLE FACTORS. 
% INTERMEDIATE VALUES ARE SCALED BY .5S AT EACH STAGE SC AS TO FREVENT 
+ THE POSSIBILITY OF OVERFLOW. 
He NO} EXTERNAL RAM 15 USED. 
+ THE MAGNITUDE OF THE FFT IS COMPUTED AND NORMALIZED 30 THAT ITS MAXIMUM 
+ VALUE HAS A NONZERO BIT AFTER THE BINARY FOINT. 
* 
DE He Se HE HE TE HE FE FE IE TE HE HE ETE SE HE SE HE HE HE HE SE HE He te SE He FE HE HE SE SE HE HE SE SE TE at ESE HE HE SE TE TE He SE HE FE HE HE HE HE HE tb He HE HE HE te TE IE SE dt SE Hee SE SE aE te at BE HE te SE HE HE 
+ 
*# N IS THE SIZE OF THE TRANSFORM. N = SM. 
+ 
N EQ 125 
M EDU 7 
+t 
XIN EQU Siz * LOCATION GF REAL INFUT DATA 
XOUT Eats £40 # LOCATIGN MF REAL DUTPUT DATA 
XFFT EGU 745 *# LOCATION GF COMPLEX DATA FOR THE FFT 
+ 
%* BLOCK B2 DATA MEMORY ALLOCATION (DF = © WILL ALWAYS FUOINT TO B2). 
* 
XT EWU 36 # TEMPORARY ~— REAL FART 
YT EQU 97 * TEMPORARY - IMAGINARY FART 
I EQ 9S * 157 INDEX 
IA EGU oy * INDEX TO TWIDDLE FACTORS 
Ie EGU 100 #* INCREMENT Ta IA 
HGLDN EGU LOL # KONTAINS VALUE N 
BUARTN EGU 102 # CONTAINS VALLIE N/4 
Ni Eau 103 #* INCREMENT To I. 
Na EGU 104 * SEFARATIGN OF I AND L 
J EQU 105 # LOOP COUNTER 
kK EQU 104 * BIT REVERSAL INDEX COlLINTER 
ONE EDU 107 * CONTAINS VALUE 1 
TADIIR EGuU 106 # CONTAINS INFUT 
SINTBL EGU 109 * SINE TABLE FOINTER 
SIN EGU 110 # SINE LOCATICN 
cos EG 111 * COSINE LOCATION 
MAX EGU 112 *# MAX VALUE GF THE SOR MAGNITUDE OF FFT 
+ 
J 
* BEGIN FROGRAM MEMORY SECTION. 
+ 
Cee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee ee 
% 
AGRG © 
RSVECT B 33 
AURG 32 
INIT LOFK © * ALWAYS POINT TO Be FOR TEMF STORAGE 
TNFD 
SOVM 
SSXM #% 32010 ARITHMETIC 
SPM 1 *# SHIFT FRODUCT LEFT BY 1 
LACK 1 
SACL ONE 
SACL IE * INITIALIZE IE = 1 
LALK N 
SACL HOLIIN #* HOLDIIN = N 


SACL NZ 
LAC HOLDN, 14 
SACH GUARTN 
LRLE ARS, XFFT 
SAR ARS, IALDR 
LARF = 
LARK AR1,M-1 
+ 
#* READ IN 12& REAL POINTS 
* 
LRLK ARZ, XIN 
RPTE, 127 
IN #+, FAO 
+ 
# INITIALIZE FFT RAM 
+ 
LRLE: ARZ, XFFT 
ZAC 
SACL MAX 
RPTK. 255 
SACL t+ 
# 


+ MOVE REAL TATA FROM INPUT 


LOCATIONS 


INITIALIZE NZ = N 


GQUARTN = 
ADDRESS 
STORE 


N/4 
OF COMPLEX INFUT BATA 
ON FAGE 0 


AR1 CONTAINS E COUNTER 


TO COMPLEX FFT LOCATIONS 


LRLK AR=, XFFT 
LARK. ARO, 2 
RPTK 127 
BLED XIN, #O+ 
* 
* MOVE SGUARED MAGNITUDE OF PREVIOUS COMPUTATION Ta GUTPUT LOCATIONS 
+ 
LRLKE ARZ, XOUT 
RPTE 127 
BLED XFFT, #+ 
+ 
* FFT COMFPUTATICIN 
+ 
KLOOP LAC NZ,15 
SACH N1,1 % Ni = Nz 
ACH N2 # NZ = N2Z/2 
ZAC 
SACL IA * IA = 0 
SACL J * J = O 
LAR ARZ,NzZ * ARS CONTAINS J VALUE 
MAR #-,ARS # START AT N2-1 
* 
LALE: SINE 
SACL SINTBL * SINE TABLE BASE ADDRESS 
+ 
ILOOF Lac J,1 
SACL I * IT = J (DATA ORGANIZED AS REAL VALUE FOLLOWED 
+ BY IMAGINARY SC THAT ADDRESS I IS = TIMES J). 
+ 
ILOOF LAR ARO,I 
LAR AR=, LADLR # LOAD INFUT BASE ADDRESS 
MAR #0+ * ARS = I + IADDR 
LAR ARO,N1 % ADL N2#2 (N1 = NZ#2) 
LAC #0+,15 * LOAD (1/2)XI, POINT TO XL 
* (L =I + NZ) 
SUB #,15 % XT = (1/72)¢XI -— XL) 
SACH XT * STORE XT ON FAGE O 
ADDH #0- # XI = (1/2)(X1 + XL), POINT TO XI 
SACH #+ % STORE XI, FOINT TO YI 


e 
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e9l 


ke 


* 


+ DIGIT REVERSE COUNTER FOR RADIX-z FFT 


* 


LAC 
SuB 
SATH 
ADH 
SACH 


LAC 
TELR 
ADL 
TRLR 


LT 
MFY 
LTF 
MFY 
SFAC 
SACH 
MFY 
LTP 
MFY 
AFACT 
SACH 


#04+,15 
*#,15 
YT 

#O- 
HOF 


SINTBL 
SIN 
GUARTN 
cas 


cos 
YT 
SIN 
XT 


ADD INCREMENT FOR NEXT LOOP. 


Lac I 
ADD Ni,1 
SACL I 
SUB HOLDIN, 1 
BLZ ILOGP 
LAC SINTBL 
ADD Ie 
SACL SINTBL 
Lac J 
ANB CONE 
SACL J 
LARF 2 
BANZ JLOUP, #-, AR? 
Lac 16,1 
SACL I€ 
LARP 1 


BANZ KLOOP, #-, ARZ 


DRC2 ZAC 
SACL I 
SACL J 
LAR ARO, IADDR 
LARP 4 
LAR AR4, HOLDN 
MAR #-— 
MAR #-,ARZ 
DRLOOF SUB JJ 


* 


BGEZ NOSWAP 


* INDEX 


* IF I< J, 


LOAD (1/2)YI, 
YT = (1/2)¢Y1 
YI = 


STORE YI, 


READ IN SINE 


READ 


YT#COS——>F 
XT#SIN-->P 


YL = 
YT#SIN-->P 


XT#COUS-—->P 


(1/2)(YI + YL), 


IN COSINE 


POINT TO YL 
- YL) 


POINT TO YL 


XT#COS -— XT#SIN 


XL = XT#COS + YT#SIN 


I= 1+ Ni 


WHILE I < N 


COMPUTATION. 


* FOR I = 0 TQ 


SINTBL FOINTER BY IE 


N-Zz 


THEN SWAP 


POINT TO YI 


LAR AR2,.J 
MAR #0+, ARS 
LAR ARZ,I 


J = J + ITADDR 


# SWAP I AND L VALUES. 


NOSWAP 


INLOOP 


OUTLOP 


a KK 


CONT 
* 
% 
+ 


*x 


* 
LG@OP 


COMPUTE THE 


LRLK 
LARK 


1 ZAC 
MPYK 
SQRA 
SQRA 
APAC 
SACH 
SUBH 
BLEZ 
LAC 
SACL 
BANZ 


MAR 
ZALH 
apas 
SACH 
SACL 
ZALH 
ADDS 
SACH 
SACL 
LAC HOLDN 
SACL K 
LAC 
SuB 
BLZ 
ADD kK,1 
SACL J 
LAC I 
ADE ONE,t 
SACL I 


#0+, ARE * l= 
#, ARS 
* 
#+,0,ARZ #* X(1) 
e+ * XJ) 
# ARS 
* 
#,0,ARZ * Y(I) 
#,0, ARS * YC) 
J 
K 
DUTLOP, #, AR4 
SACL J 
Lac K,15 
SACH K 
B INL.OOP 


BANZ DRLOOP, #-,ARZ 


I + IADDR 


ou 


oil 


* 


* 


cc 
Wi 

q 
I 

x 


x 
" 
vr 
N 
N 


INCREMENT I 


MAXIMUM VALUE OF THE SG@UARED MAGNITUDE OF PFT 


AR2, XFFT 
ARS, 127 


Oo 
++ 


#+, ARS 


IADDR 
MAX 

CONT 

IADDR 

MAX 

LOOP1, #-,ARZ 


* XC 1) #2 
# YC) #¥zZ 


* IADDR = X(L)##2 + Y( I) ##2 


NORMALIZE THE MAX VALUE TO FIND EXPONENT 


LARK 
ZALH 
RETK 
NORM 
SAR 


ARZ,0 
MAX 
14 


AR2,1 


COMPUTE SG@UARED MAGNITUDE OF FFT 


LRLK 
LRLK 
LARK 


2 ZAC 
MP YK. 
S@RA 
SQRA 


ARZ, XFFT 
AR1, XFFT 
AR, 127 


o 


H+ 
#+,AR1 


* ARZ CONTAINS EXPONENT 
* -STORE EXPONENT IN I 


% XC LT) e#Z 
* YCI) #82 


vol 
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AFAC 
SFR 
RET I 
BFL af 
SACH #+,0,AR3 
BANZ LOOF2, #-,ARZ 
+ 
* OUTPUT SOARED MAGNITUDE 
* 
LRLE ARZ, XFFT 
RETE 127 
CIT t+ FAS 
% 
* FFT COMPLETE. 
* 
WHOA B WHOA 
+ 


* COEFFICIENT TABLE (SIZE OF TABLE IS 3N/4), 
+ 
SINE EGU $ 


DATA >42F2 
DATA 244600 
DATA =4AGE 


DATA = 
COSINE EQU 
DATA 
DATA 
DATA =; 
DATA : 
DATA = 
DATA 
DATA 
DATA 2 
DATA 27642 
DATA >73B4 


* XCWITCY) 


* NORMALIZE RESULT 


X(T) ##2 + YC) #2 


DATA 
DATA 
LIATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
LATA 


=>7OES3 
>4DCA 
S6AGE 
Sé4DB0 
2G2F2 
>SED7 
SSASZ 
PSSFA 
23134 
24l4o 


DATA > 


LATA 
DATA 
LATA 


DATA > 
DATA > 


DATA - 


LATA > 
DATA : 
LATA = 


DATA 


DATA, >FS 
DATA > 


DATA > 


DATA =D 
DATA ° 
DATA => 
DATA = 
DATA = 
TATA “= 
DATA =BEE 
DATA =BS 
DATA = 
LATA = 
LATA 


DATA 


DATA : 


DATA 


DATA > 
DATA = 


DATA 


DATA : 


DATA : 


DATA = 
DATA +8555 


DATA 


DATA : 
DATA 2& 
ATA > 
DATA 4 


END 


APPENDIX G 
A 256-POINT, RADIX-2 DIF FFT IMPLEMENTATION (LOOPED CODED) 
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BLKP SINE, #+ 


IDT “FFT2~ LRLK AR4, TABLE * ADDRESS OF SINE TABLE 
* LAR ARO, QUARTN 
* COOLEY-TUKEY 2546-POINT, RADIX-2, DIF FFT PROGRAM FOR THE TMS32020. MAR +*0+,AR2 * COSTBL = TABLE + N/4, point to J caunter 
* SAR AR4,COSTBL 
* SINGLE FFT BUTTERFLY. LARK AR1,M-1 * AR! CONTAINS K COUNTER 
* COMPLEX INPUT DATA, STGRED AS X(1I), YC(I), XC(I+1), YCI+1), ..4 * 
* USES TABLE LOOKUP (FROM EXTERNAL DATA MEMORY) GF THE TWIDDLE FACTORS. * READ IN 256 COMPLEX POINTS 
* INTERMEDIATE VALUES ARE SCALED BY .S AT EACH STAGE SO AS TO PREVENT * 
* THE POSSIBILITY OF OVERFLOW. LRLK ARZ, INPUT 
* RPTK. 255 
SE TEA He Fb THE TE HE HE HE HEE HE SEE HE HE SE SE HE FE FEE SE SE IE IE HE SE HE TE TE SE HE HE ESE SE HE SE ETE HE HE SE SE TE HE HE HE IE HE JE SE HE EE SE JE SE SE Se IE HE HE IEE He SHE SE 0 EE HE EE IN #+,PAO 
* RETK. 255 
* N IS THE SIZE OF THE TRANSFCIRM. N = 2M, IN #+,PAO 
* * 
N EGU 256 + FFT COMFUTATIGCN 
M EQU 3 * . 
* KLOGP LAC N2,15 
INPUT EOU Siz #* LOCATION OF COMPLEX INPUT DATA IN INTERNAL BATA MEM SACH N1,1 *# Ni = No 
TABLE Eis 1024 *# LOCATION OF COEFFICIENT TABLE IN EXTERNAL CATA MEM SACH NZ * NZ = N2E/2 
* ZAC 
* BLOCK BZ DATA MEMORY ALLOCATION (OP = © WILL ALWAYS POINT TO Hz). SACL IA * IA = 0 
* SACL J # J = O 
XT EQU 34 * TEMFORARY - REAL PART LAR ARZ,NZ *% ARZ CONTAINS VALUE 
YT EGU 97 * TEMPORARY - IMAGINARY FART MAR #-,ARS3 * START AT Nze-t 
I EQUI ¥sS * 1ST INDEX % 
IA EDU 99 # INDEX TO TWIBDDLE FACTORS LAR AR4S,COSTBL # COSINE TABLE BASE ADDRESS 
IE EGU 100 # INCREMENT TO IA * 
HOLDN Eau 101 * IONTAINS VALUE N LOGOF LAC wt, 
QUARTN EQU 102 % IZONTAINS VALUE N/4 SACL I * I = J (BATA ORGANIZED AS REAL VALUE FOLLOWED 
N1 EQU 103 * INCREMENT TO I. * BY IMAGINARY 30 THAT ADDRESS I IS 2 TIMES .!). 
N2 EQU 104 # SEFARATION OF I AND L * 
J Ea 105 # LOOP COUNTER ILOOF LAR ARO,I 
K EQUI 106 # BIT REVERSAL INDEX COUNTER LAR ARS, IADR * LOAL INFUT BASE ADDRESS 
ONE Emly 107 # CONTAINS VALUE 1 i MAR #0O+ # ARS = I + IADDR 
TADDR Emu 106 # CONTAINS INPUT LAR ARO,NI + ADD Ne#2 (NL = NE#2) 
COSTBL EA 10% * COSTBL = TABLE + N/4 LAD #04+,15 %# LOAD (172)XI, FOINT TO Xb 
* +t (L = I + N2) 
ca SUB #,15 % XT = (1/2) (XT - XL) 
* BEGIN FROGRAM MEMORY SECTION. SACH XT # STORE XT ON FAGE © 
* ADERH #0- # XD = CL/72)¢XI + XL), POINT TO XI 
Cece ee ee ee ee ee ee SACH a+ *# STORE XI, FOINT Ta YI 
* LAC #0+,15 * LOAD (1/2)YI, FOINT TO YL 
ACRG SUB #,15 # YT = (1/2)0YI - YL) 
REVERT B SACH YT 
AUDRG ALDLIH #0- # YI = (1/2) (YI + YL), POINT Tao YI 
INIT LOPE: # ALWAYS FOINT TO Be FOR TEMP STORAGE SACH #0+,0,AR4 # STORE YI, FOINT TO YL 
SOVM ea AR4 POINTS TO COSTBL 
SoXM # 22010 ARITHMETIC LAR ARO, GUIARTN 
SPM 1 * SHIFT FROOQUCT LEFT BY 1 LT #O~ % LOAD T WITH Cos, POINT Ta SIN 
LACK 1 MFY YT 
SACL. ONE LTE #0+,ARS # ACK <-- CHYT, POINT TO YL 
SACL ITE * INITIALIZE IE = 1 MFY XT 
LALK N SPAC # YL = CHYT — S#XT 
SACL HOLIIN # HOLIIN = N SACH #-,0,AR4 # STORE YL 
SACL N2 * INITIALIZE N2 = N MFY YT 
LAC HOLDIIN, 14 LTF #,ARS # ACO L-- SeYT, FOINT TO XL 
SACH QUARTN # GQUARTN = N/4 MFY XT 
LRLE ARS, INFUT *# ADDRESS OF COMPLEX INFUT DATA AFAC * XL = CH#eXT + SHYT 
SAR ARS, IADDR # STORE ON FAGE O SACH # *# STORE Xi 
LRLE AR4, TABLE # ADDRESS CF SINE TABLE a 
LARF 4 % 
RETK 1°71 # MOVE ive COEFFICIENTS # ADL INCREMENT FOR NEXT LOGF. 
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LOI 


* 


Lac I! 
ADD Ni,1 * 1 = 1+ Ni 
SACL I 
SUB HOLDN, 1 * WHILE I< N 
BLZ2 ILOOP 
LAR ARO, IE * INDEX COSTBL POINTER BY IE 
LARP 4° 
MAR = #0+, ARZ 
LAC J 
ADD ONE *J=J+t 
SACL J 
BANZ JLOOP, #-, ARS 
LAC IE,1 
sacl. I€ * IE = 2 #* IE 
LARP 1 


BANZ KLOOP, # 


~, AR2Z 


* DIGIT REVERSE COUNTER FOR RADIX-2 FFT COMPUTATION. 


* 


DRC2 zac 
sact I 
SACL J 
LAR ARO, IADDR 
LARP 4 
LAR AR4,HOLDN * FOR I = 0 TO N-2 
MAR #- 
MAR #-,ARZ 
DRLOOP SUB J * IF I < J, THEN SWAP 
BGEZ NOSWAP 
* 
LAR AR2,J 
MAR #0+, ARS * J = J + LADDR 
LAR AR3,I 
MAR #04, ARZ * 1 = I + IADDR 
# SWAP I AND L VALUES. 
ZALH #, ARS 
ADDS * 
SACH #+,0,AR2 # X(I) = X(J) 
SACL #+ : * XCS) = Xr) 
* 
ZALH #,AR3 
ADDS * 
SACH #,0,ARZ2 * ¥CI) = YC) 
SACL *,0,AR3 * Y(S) = Y¥CD) 
NOSWAP LAC HOLDN 
SACL K * K = N 
INLOOP Lac J 
SUB K * IF J >= K THEN 
BLZ OUTLOP,#,AR4 
SACL J *J=J5- K 
Lac k,15 
SACH K * K = K/2. 
B INLOOP 
OUTLOP ADD K,1i 
SACL JJ *tL#2wtl+Jd 
Lac I 
ADD ONE, 1 
SACL I * INCREMENT I 


BANZ DRLGOP, #-,ARZ 


* 


* 


OUTPUT 


LRL 


FFT Vv 


K 


RPTK 


OUT 


RPTK 


OUT 


+ FFT COMPLETE. 


* 


WHOA 
* 


* COEFFICIENT TABLE (SIZE OF TABLE IS 3N/4). 


* 
SINE 


B WHOA 


EQU $ 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
BATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


?0 
>324 
>648 
>?6B 
>cs8c 
>FAB 
>?12C8 
> 15E2 
>1LSF9 
>1CO0C 
>LFLA 
>2224 
>2528 
>2827 
>2BiF 
>2E11 
>SOFC 
>33DF 
>3éBA 
>398D 
>3CS7 
>3F17 
>41CE 
>447B 
>471D 
>49B4 
>4C40 
>4ECO 
>S5134 
>539B 
DSSF6 
25843 
>SA82 
>SCB4 
>SED7 
>é0EC 
>62F2 
>&4E? 
>6é6DO0 
>68A7 
>6AGE 
>6C24 
>éDCA 
>6FSF 
>70E3 
27255 
>73B6 
>7505 


ALUES 


AR2, INPUT 
255 
#+,PAL 
2355 

#+, PAL 


891 
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DATA 
DATA 
DATA 
DATA 
BATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
TOSINE EQU 
BATA 
DATA 
DATA 
DATA 
DATA 
DATA 
BATA 
BATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
BATA 
DATA 
DATA 
BATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
BATA 
DATA 
DATA 
DATA 


27642 
>776C 
27885 
>798A 
>7A7D 
>7BSD 


>7C2A 


>7CE4 
>7D8A 
>7E1E 
>7E9D 
>7FOA 
>7F 62 
>7FAT7 
>7FD9 
>7FF6 
$ 

>7FFF 
>7FFE 
>7FD9 
>7FA7 
>7F62 
>7FOA 
>7E9D 
>7E1E 
>708A 
>70E4 
>7CZA 
>7B5D 
>7A7D 
>798A 
>7885 
>776C 
>7642 
>7505 
>73B4 
>7255 
>70E3 
>OFSF 
>6DCA 
>6C24 
>OAGE 
>68A7 
>66DI0 
>O4E9 
>62F2 
>60EC 
>SEL7 
>SCB4 
>5A82 
>5843 
>SSF6 
>539B 
>5134 
>4ECO 
>4C40 
>49B4 
>471D 
>447B 
>41CE 
>3F17 
>3CS7 
>398D 


BATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
LATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
BATA 
DATA 
BATA 
DATA 
LATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
LATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
BATA 
DATA 


>3246BA 
> 33DF 
SOFC 
>2E11 
?2B1F 
+2827 
22528 
72224 
DLIFLA 
>1LCOL 
Z1ISF? 
>iSe&z 
712C8 
>FAB 
>C&C 
>96B 
7648 
2324 
>O 
>FOCDC 
>F IBS 
>F 69S 
>F374 
SFOSS 
>SEDSS 
PEALE 
ET O7 
PESF4 
SEOES 
>DBDDC: 
>DADS 
>D7D? 
>D4E1 
>DIEF 
>CFO4 
SCCr21 
>C946 
>CE673 
SC3AY 
?PCOESY 
>BESZ 
>BBSS 
>BSES 
>B6&4I- 
>BSCO 
>BL40 
PAECC 
SACES 
>AAOA 
>A7BD 
TASTE 
>A34Z 
=A1Z9 
2OF 14 
>9DOE 
>9BL7 
>9930 
?PITSY 
2592 
>93DIT 
29236 
>29O0A1 
>SF 1B 


DATA 
DATA 
END 
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INTRODUCTION 


In Pulse Code Modulation (PCM) systems, which are com- 
monly used in public and private (PBX) telephone networks, 
samples of an analog speech waveform are encoded as binary 
words and transmitted serially usually at a rate of 8000 
samples per second. This digitized data is communicated 
most efficiently if the amplitude of the waveform is com- 
pressed to logarithmic scale before transmission (reducing 
the number of bits required for their representation), and then 
expanded at the receiver. 

The conversion to logarithmic scale insures that low- 
amplitude signals are digitized with a minimal loss of fidelity. 
This procedure of first compressing and then expanding the 
signal is known as ‘‘companding’’ (COMpressing and 
exPANDING). Figures 1(a) and 1(b) show the procedures 
involved in companding, typically accomplished by a 
hardware device called a codec or combo-codec (the 
combined PCM codec and filter).! Since codecs are 
inexpensive, they have been widely used as input/output (1/O) 
devices for analog signals in many digital signal processing 
applications, such as digital telephony. 

In a digital signal processing system that incorporates 
codecs, a reversed companding process is required as shown 
in Figure 1(c). The compressed PCM data is first converted 
to linear PCM to be processed by the digital signal processor. 
After the digital signal processing, the processed linear PCM 
is then compressed before sending it to the codec to produce 
an analog output signal. 


A/D 
CONVERTER 


The TMS320 family of digital signal processors is 
designed for numeric-intensive applications. Because of the 
processor’s high speed, the companding (actually an expand- 
and-compress procedure) described in Figure 1(c) can be 
performed with minimum execution time and program 
requirements. This allows the processor to dedicate most of 
its resources for real-time digital signal processing 
applications, such as filtering, tone generation/detection, 
transcoding, vocoding, and echo cancellation. Companding 
can be performed in software in two ways: (1) by calculating 
the companding algorithm in real-time, or (2) by looking up 
a table pregenerated using the algorithm. The lookup-table 
approach naturally requires more storage, but it provides 
faster execution than the algorithmic approach. The tradeoff 
must be made by the digital signal processing designer 
between memory and speed requirements. In addition, 
companding can be accomplished externally in hardware (see 
the application report, ‘“Telecommunications Interfacing to 
the TMS32010’’).3 

The main portion of this report presents four 
TMS32010 programs that implement the standard 
companding algorithms. The TMS32010 is the first 
generation of the TMS320 family of digital signal processors. 
Three programs for companding, which use the TMS32020, 
the second-generation digital signal processor, are included 
in the appendix. One of these programs uses the lookup-table 
approach. Note that no special effort is taken to further 
optimize these programs for any particular application. The 
purpose of this report is to show how companding can be 


PARALLEL- COMPRESSED 
TO-SERIAL BINARY SERIAL 
CONVERSION OUTPUT DATA 


LINEAR-TO-LOG 
COMPRESSION 


SAMPLE CLOCK 
CLOCK 
(a) TRANSMISSION 
COMPRESSED SERIAL-TO- DIA SMOOTHING ANALOG 
BINARY SERIAL PARALLEL CONVERTER FILTER OUTPUT 
INPUT DATA CONVERSION 
LOG-TO-LINEAR 
EXPANSION 


(b) RECEPTION 


TMS320 DIGITAL SIGNAL PROCESSOR 


COMPRESSED 


ANALOG DIGITAL ANALOG 
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| 
| 
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| 
| 
| 
| 
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(c) COMPANDING BY A DIGITAL SIGNAL PROCESSOR 
Figure 1. PCM Companding 
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performed by both generations of digital signal processors. 
Other application reports are available, which show how the 
companding routines can be optimized for special 
applications, such as Adaptive Differential Pulse Code 
Modulation (ADPCM)2 and telecommunication interfaces3 
using the TMS32010, and echo cancellation4 using the 
TMS32020. 


COMPANDING 


In any sampled data system, the analog-to-digital (A/D) 
conversion process introduces quantization noise. For the 
usual linear A/D encoding scheme, the digitized code word 
is a truncated binary representation of the analog sample. 
The effect of this truncation is most pronounced for small 
signals. For voice transmission, this is undesirable since most 
information in speech signals resides in the lower amplitudes 
even though speech signals typically require a wide dynamic 
range. This can be remedied by adjusting the size of the quan- 
tization interval so that it is proportional to the input signal 
level. In this case, the quantization interval is small for small 
amplitude signals and larger for larger signals. Consequently , 
lower amplitudes are represented with more quantization 
levels and, therefore, with greater resolution. 


CHARACTER SIGNAL 
BINARY CODE* 


80 


64 


48 


32 


COMPANDED AND CODED SIGNAL 


16 


1/128 §=1/64 1/32 1/16 


The resulting encoding scheme is logarithmic in nature 
and has the property of yielding the greatest dynamic range 
for a given signal-to-noise ratio and word length. Compan- 
ding is defined by two international standards based on this 
relation — both compress the equivalent of 13 bits of 
dynamic range into 7. The standard employed in the United 
States and Japan is known as the yu-255 law companding 
characteristic and is given by the equation 


F(x) = sgn(x) mies! 


where: 


F(x) is the compressed output value 
x is the normalized input signal (between —1 and 1) 


pw is the compression parameter (= 255 in North 
America) 


sgn(x) is the sign (+) of x 


The European standard is referred to as A-law com- 
panding and is defined by the equation 


I 

{ 

| 

I E 

2 | 
] 5 0.625 + 
1 ro) ' 
| 2 ( 
| N 0.500 + 
| » = 255 a | 
| 2 | 
| 9 0.375 + 
| = 1 
| | 
| 0.250 -+- 
| I 
| { 
| 

| 

] 


1/2 1.0 


INPUT SIGNAL RELATIVE LEVEL 


* This is the bit pattern transmitted for positive input values. The left-most bit is a O for negative input values. 


Figure 2. Companding Curve of the »-Law Compander (from Reference 5) 
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A|x| 1 
sgn(x) T+ InA) for O < |x| <= 
F(x) = 
(1 +inAjx|), 1 
sgn(x) > nA) for a = Ix| <1 
where: 


F(x) is the compressed output value 

x is the normalized input signal (between —1 and 1) 
A is the compression parameter (= 87.6 in Europe) 
sgn(x) is the sign (+) of x 


In practice, the code word is actually inverted before 
transmission. Low amplitude signals tend to be more 
numerous than large amplitude samples. Consequently, in- 
verting the bits increases the density of positive pulses on 
the transmission line which improves the performance of tim- 
ing and clock recovery circuits. 


p-255 COMPANDING 


Eight-bit sign-magnitude words can represent 255 dif- 
ferent code words. This made 255 the most convenient choice 
for the p-law companding parameter. This companding 
characteristic exhibits the valuable property of being closely 
approximated by a set of eight straight-line segments, as 
shown in Figure 2.5 This figure illustrates how the input 
sample values of successively larger intervals are compressed 
into intervals of uniform size. The slope of each segment 
“is exactly one-half that of the preceding one. The step size 
between adjacent code words is doubled in each succeeding 
segment. This property allows the conversion to and from 
a linear format to be done very efficiently. 


TMS32010 Algorithm 
Uniformly quantized 14-bit sign-magnitude numbers 
are compressed into 8-bit signed u-255 code words by the 
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program ‘MULAWCMP’ and expanded to their original 
amplitude by the program ‘MULAWEXP’, both listed and 
flowcharted in the Program Listings section. The code word 
Y, formed by MULAWCMP, has the format Y = 
PSSSQQQQ composed of: 


Polarity bit: P 
3-bit segment number: SSS 
4-bit quantization bin number: QQQQ 


The encoding algorithm is best understood by 
examining the segment endpoints of Table 1, which begin 
with the values 31, 95, 223,..., 4063. 


Note that 
31 = 26 — 33 
95 = 27 — 33 
223 = 28 — 33 
4063 = 212 — 33 


so that if 33 is added to each value in the table, the end points 
become powers of two. 

This means that the segment number corresponding to 
a number N (which is to be encoded) can be determined by 
finding the most significant ‘1’ bit in the binary representation 
of N + 33. Furthermore, as Table 2 indicates, the following 
four bits make up the quantization bin number. The 
remaining bits are discarded. 

On expansion, these lost bits are assumed to have been 
the median of the possible numbers which these lost bits could 
have represented — a one followed by zeroes (see Table 3). 
This rounding limits the loss in accuracy. 
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Table 1. Encoding/Decoding Table for-»-255 PCM* 
(Courtesy of John Wiley & Sons, see Reference 6) 


Quantization 
Code Q 


215-223 
223-239 


463-479 
479-511 


959-991 
991-1055 


1951-2015 
2015-2143 


3935-4063 
4063-4319 


7903-8159 


“This table displays magnitude encoding only. Polarity bits are assigned as ‘’O’’ for positive and ‘’1’’ for negative. In transmission, all 


bits are inverted. 
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Table 2. 1-255 Binary Encoding Table* 


Compressed Code Word 
: Bt: 6/5 4 3 2 1 
) 


* The polarity is not shown in this table. 
NOTE: The leading bit is the sign bit. 
EXAMPLES: 

IAS 


ESEESEESSie 


(1) +86549 OAS +86519+3340 = +89819 = +3824g = (0}0 0011 1000 00109-» (0)100 11005 
(2) -251349 —® —251319-3310 = -254619 = —9F216 = (1)0 1001 1111 00102 -»(1)110 00119 


Table 3. »-255 Binary Decoding Table* 


Compressed Code Word 
5 4 3 2 1 


° 
coo Ooo OA Pia 


—-_ = -=§ = © OO O 


6) 
0 
1 
1 
.¢) 
.e) 
1 
1 


= O- O-7 0-— 


"The polarity is not shown in this table. 
NOTE: The leading bit is the sign bit. 
EXAMPLES: 


Biased Output Values 


oo o0o0o00 oO =H|{o 


REMOVE BIAS 


(1) 3C46 = (0)011 11009 -» (0)0 0001 1100 10009 = +01C816 = +45619 ——————— _45610- 3310 = +42310 


REMOVE BIAS 


(2) E346 = (1)110 00119 -©(1)0 1001 1100 00002 = -09C01g = —249619 —————> _ — 2496 19 +3310 = — 246319 


Performance 

Analysis of the PCM y-255 companding system of 
Figure 1 shows that the approximated digital values approach 
the original inputs closely, with a signal-to-quantization noise 
ratio of 39.3 dB for a full-range sinusoid.® In general, voice 
signals have smaller quantization errors but lower signal 
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power, so p-255 performance in voice transmission is ap- 
proximately the same. The signal-to-quantization noise ratio 
for »-255 law encoding is given in Figure 3 for sinusoid 
inputs. The algorithm space and time requirements are given 
in Table 4. 
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FULL LOAD SIGNAL 
= +3 dBm0 


UNFILTERED 
PIECEWISE 
LINEAR 1-255 


UNFILTERED 
8-BIT »-255 


SIGNAL-TO-QUANTIZING NOISE (SQR) IN dB 


-70 ~-60 - 50 -40 -30 -20 -10 0 3 
SIGNAL POWER OF SINEWAVE (dBm0O) 


Figure 3. Signal-to-Quantizing Noise of u.-Law Coding with Sinewave Inputs 
(Courtesy of John Wiley & Sons, see Reference 6) 


Table 4. Summary of »-Law Program Space and Time Requirements 


Compress 105 13 17 40 8.0 


Tt Assuming initialization 
+ Worst case 
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A-LAW COMPANDING 


The companding characteristic recommended by 
CCITT and adopted in Europe is the ‘A-law’ standard. Not 
only can this characteristic be approximated with linear 
segments, as with the y-law approximation, but a portion 
of the rule is linear by definition. 

The A-law programs ‘ALAWCOMP’ and 
‘ALAWEXP’ are listed and flowcharted in the Program 
Listings section. They differ from the y-law routines in the 


CHARACTER SIGNAL 
BINARY CODE* 


32 


COMPRESSED AND CODED SIGNAL 
a 
> 


16 


| Ke 
1/128" 4/64 1/32 1/16 


handling of the first segment. This segment is defined to be 
exactly linear for the A-law. Also, biasing is not required 
before conversion. The inputs should be scaled to a max- 
imum value of 4096 for representation, as opposed to 8158 
for the p-law case. Since this allows for a minimum step less 
refined than the p-law, the A-law characteristic provides less 
fidelity for small signals but is superior in terms of dynamic 


range. 
Figures 4 and 5 and Tables 5, 6, and 7 presented below 
* are analogous to those given above for p-law. 


= 
i) 
~ 0.625 + 
2 
ro) | 
a | 
N 0.500 + 
A = 87.6 a l 
= 
5 0.375 
= 


1/4 1/2 1 


INPUT SIGNAL RELATIVE LEVEL 


"For positive input values. The left-most bit is a O for negative input values. Even bits (beginning with 1 at the left) are inverted before 


transmission. 


Figure 4. Companding Curve of the A-Law Compander (from Reference 5) 
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Table 5. Segmented A-Law Encoding/Decoding Table 
(Courtesy of John Wiley & Sons, see Reference 6) 


124-128 
128-136 


248-256 
256-272 


496-512 
512-544 


992-1024 
1024-1088 


1984-2048 
2048-2176 


3968-4096 
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Table 6. A-Law Binary Encoding Table* 


Compressed Code Word 


Bit: 


co) 


x «x « KK KK KIO 
= = = = © OO O 
= =OoOo--—-]— 0 OO 
= O- O- 0 0 


” The polarity is not shown in this table. 

NOTE: The leading bit is the sign bit. 

EXAMPLES: 
(1) +322149 = +C9516 = (0) 1100 1001 01017 -w(0) 111 10019 
(2) -19949 = -C716 = (1) 0000 1100 01119 -» (1) 011 10002 


Table 7. A-Law Binary Decoding Table* 


Compressed Code Word 


5 Bit: 11 10 9 
7) 


(o) 


6) 
0 60 
0 60 
Oo 6060 
0 60 
Oo 61 
1 


Q3 Q2 Q1 Qo 
1 


-0O-- 0.0 — 


0) 
0 
1 
1 
0 
0 
1 
1 


* The polarity is not shown in this table. 
NOTE: The leading bit is the sign bit. 
EXAMPLES: 
(1) (0) 001 11015 -® (0) 0000 0011 10117 = +3B7g = +5910 
(2) (1) 110 01009 -> (1) 0101 0010 00002 -™ —5201g = -—1312169 
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UNFILTERED 

PIECEWISE —@>/’ 

LINEAR »-255 
e 


SIGNAL-TO-QUANTIZING NOISE (SQR) IN dB 
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Figure 5. Signal-to-Quantizing Noise of A-Law PCM Coding with Sinewave Inputs 
(Courtesy of John Wiley & Sons, see Reference 6) 


Table 8. Summary of A-Law Program Space and Time Requirements 


T Assuming initialization 
+ Worst case 


SUMMARY 


The programs, listed in the next section, have been 
designed to reduce both memory space used and ‘‘loop 
time,’’ i.e., that time required to complete the calculations 
after initializations have been made. Of course, other 
space/time tradeoffs are possible. Dedicated to companding, 
the TMS32010 can compress 125,000 or expand 200,000 
words in a second using these routines. This speed and the 
versatility of the TMS32010 allow one device to compand 
a PCM data stream while simultaneously performing related 
functions such as filtering, vocoding, and tone 
generation/recognition. 
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Words of 
Program Cycles 
Memory 


eee initialization Loop? 


Compress 97 11 
Expand 48 7 


Time 
Required t 


7.2 
5.0 


PROGRAM LISTINGS 


The following TMS32010 program flowcharts and 
assembly language routines are listed below: 


‘“MULAWCMP’: p-LAW COMPRESSION 
“MULAWEXP’: y-LAW EXPANSION 
‘ALAWCOMP’: A-LAW COMPRESSION 
“ALAWEXP’: A-LAW EXPANSION 


5. Companding Routines for the TMS32010/TMS32020 


FLOWCHART: MULAWCMP 


(_BEGIN_) 


INITIALIZE CONSTANTS 


INPUT 14-BIT SIGN-MAGNITUDE 


NUMBER X AND SAVE POLARITY P 


MASK X TO Xmag (LOW 13 BITS) 
ADD 33 TO Xymac 


NO 


SATURATE YmaG 
(SET TO ALL ONES) 


SET B = BIT POSITION OF THE 
LEFTMOST ‘1° IN X . (FIND 
B THROUGH BINAR RCH OF 
8 MOST SIGNIFICANT BITS). 


SET QO = FOUR BITS FOLLOWING B 
JOIN S AND O TO FORM Ywac 


JOIN P'TO Yyjag TO FORM Y 
en 3 Q2 Q; Ao) 


(Y = PSo Sy 


INVERT Y TO FORM Y 


OUTPUT Y 


Figure 6. »-Law Compression 
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MULAWCMP 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
0050 
0051 
0052 
0053 
0054 


0000 


0000 
0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
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0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 


0009 
OO0A 
000B 
000C 
000D 


7E01 
5001 
2401 
5002 
2D01 
5003 
1001 
5004 
7EFF 
5005 


+ 
* 


+ + + + HH HF HF HF KH HF HF HF HF HF HH HF HF HH KF OF 


O * 
Zz 
ie) 


BIT4 
BIT13 
MASK13 
MASK8 


5 
Nn 
~ 
ie 


MASK2 


+ 0 
= 
» 
n 


*ON'UOK MS + * 


+ 


INIT 


2.1 83.076 08:21:33 10/26/83 
PAGE 0001 
IDT "MULAWCMP ' 
"MULAWCMP' PERFORMS A MU-255 COMPRESSION. THE 


14-BIT SIGN-MAGNITUDE INPUT X, 


X = P X12 Xl1l X2 Xl X0 


IS ENCODED AS AN 8-BIT SIGN-MAGNITUDE NUMBER Y, 


Y = P S2 S1 SO Q3 Q2 Q1 QO consisting of 
POLARITY BIT: P, 

3-BIT SEGMENT NUMBER: S = 
4-BIT QUANTIZATION BIN NUMBER: 


S2 S1 SO 
Q = Q3 Q2 Ql Q0 


Y IS INVERTED BEFORE TRANSMISSION. 
PORT 0 IS USED FOR I/O. 


WORST-CASE TIMING IN CYCLES: 17 INIT / 40 LOOP 
SPACE REQUIREMENTS IN WORDS: 13 DATA / 105 PROGRAM 


CONSTANTS: 

EQU 1 = 

EQU 2 =>0010 (ONE IN BIT 4) 

EQU 3 =>2000 (ONE IN BIT 13) 

EQU 4 =>l1FFF (13 ONES) 

EQU 5 =>Q00FF (8 ONES) 

EQU 6 =>Q000F (4 ONES) 

EQU 7 =>0003 (2 ONES) 

EQU 8 =33 

VARIABLES: 

EQU 9 DATA INPUT (14 BITS) 

EQU 10 ENCODED DATA OUTPUT (8 BITS) 
EQU 11 POLARITY OF DATA (0 FOR POS) 
EQU 12 ~3-BIT SEGMENT NUMBER 

EQU 13 4-BIT QUANTIZATION BIN NUMBER 
AORG 0 

LACK 1 

SACL ONE 

LAC ONE, 4 

SACL BIT4 

LAC ONE, 13 

SACL BIT13 

SUB ONE 

SACL MASK13 

LACK >OOFF 

SACL MASK8 
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0055 OOOA 7EOF LACK >O000F 
0056 O000B 5006 SACL MASK4 
0057 000C 7E03 LACK >0003 
0058 O00D 5007 SACL MASK2 
0059 OO00E 7E21 LACK 33 
0060 OOOF 5008 SACL BIAS 
Q061 * 
0062 * GET INPUT, SAVE POLARITY, AND MASK TO MAGITUDE 
0063 * 
0064 0010 4009 START IN X,0 INPUT DATA 
0065 0011 2003 LAC BIT13 POLARITY BIT MASK 
0066 0012 7909 AND X 
0067 0013 5C0B SACH P,4 0 FOR POS; 2 FOR NEG. 
0068 0014 2009 LAC X 
0069 0015 7904 AND MASK13 
0070 * 
0071 * BIAS MAGNITUDE AND SATURATE IF OVERFLOW OCCURS 
0072 * 
0073 0016 0008 ADD BIAS BIAS INPUT X BY 33 
0074 0017 5009 SACL x 
0075 0018 7903 AND BIT13 CHECK FOR OVERFLOW INTO P 
0076 0019 FFOO BZ LMOST NO OVERFLOW IF ZERO 

OO1A OO1E 
0077 * 
0078 * SATURATION: ENCODE LARGEST CODE WORD 
0079 * 
0080 001B 7E7F LACK >7F 7 ONES 
0081 001C F900 B SIGN 

001D 0063 
0082 * 
0083 * COMPUTE THE THREE BITS S(= S2 Sl SO) OF THE 
0084 * COMPRESSED WORD 
0085 * 
0086 * S = BP - 5 
0087 * 
0088 * WHERE BP IS THE BIT POSITION OF THE LEFTMOST '1' 
0089 * IN X. THE FOUR FOLLOWING BITS ARE IN THE HIGH 
0090 * HALF OF THE ACCUMULATOR. THESE FOUR BITS ARE 
0091 * Q (=Q3 Q2 Ql QO). 
0092 * 
0093 * SEARCH BITS X12 THRU X5. (SEE TABLE 1 OF TEXT.) 
0094 * 
0095 * 
0096 O0O01E 2906 LMOST LAC MASK4,9 1111 0000 
0097 O0O1F 7909 AND x 
0098 0020 FFOO BZ EEE 

0021 0042 
0099 0022 2B07 LAC MASK2,11 1100 0000 
0100 0023 7909 AND X 
0101 0024 FFOO BZ cc 

0025 0034 
0102 0026 2C01 LAC ONE,12 1000 0000 
0103 0027 7909 AND X 


5. Companding Routines for the TMS32010/TMS32020 183 
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0104 


0105 
0106 
0107 
0108 


0109 
0110 
Olll 
0112 


0113 
0114 
0115 


0116 
0117 
0118 
0119 


0120 
0121 
0122 
0123 


0124 
0125 
0126 


0127 
0128 
0129 


0130 
0131 
0132 
0133 
0134 


0135 
0136 
0137 
0138 
0139 


0140 
0141 
0142 
0143 


0144 
0145 
0146 


0028 
0029 
002A 
002B 
002C 
002D 
002E 
002F 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
003A 
003B 
003C 
003D 
003E 
003F 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 


004A 
004B 
004C 
004D 
004E 


O04F 
0050 
0051 
0052 
0053 


0054 
0055 
0056 
0057 


0058 
0059 
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FFOO 
002F 
7E07 
500C 
2809 
F900 
OOSF 
7E06 
500C 
2909 
F900 
OOSF 
2A01 
7909 
FFOO 
003D 
7E05 
500C 
2A09 
F900 
OOSF 
7E04 
500C 
2B09 
F900 
OOSF 
2707 
7909 
FFOO 
0054 
2801 
7909 
FFOO 
004F 


7E03 
500C 
2C09 
F900 
OO5SF 


7E02 
S00C 
2D09 
F900 
OOSF 


2601 
7909 
FFOO 
005D 


7E01 
500C 


cc 


EEE 


GG 


* 
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OL ies 


0100 0000 


001. 


0001 .... 


0000 1100 


0000 1000 


0000 1... 


0000 Ol.. 


0000 0010 


0000 O01. 
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0147 OOS5SA 2E09 LAC X,14 
0148 005B F900 B XDONE 

005C OO5F 
0149 x 
0150 005D 500C H SACL S 0000 0001 (acc = 0) 
0151 OO5SE 2F09 LAC X,15 
0152 * 
0153 * REMOVE LEFTMOST ‘1' AND STORE Q 
0154 * 
0155 OO5F 6202 XDONE SUBH ~~ BIT4 
0156 0060 580D SACH Q 
0157 al 
0158 * FORM 8-BIT COMPRESSED WORD FROM Q, S, AND P. 
0159 * 
0160 0061 200D LAC Q Q: BITS 0-3 QQQ00 
0161 0062 040C ADD S,4 S: BITS 4-6 —SSSQQQQ 
0162 0063 060B SIGN ADD P,6 P: BIT 7 PSSSQQ00Q 
0163 x 
0164 * COMPLEMENT FOR TRANSMISSION AND OUTPUT 
0165 * 
0166 0064 7805 XOR MASK8 
0167 0065 500A SACL ¥ 
0168 0066 480A OUT ¥,0 PORT 0 
0169 0067 F900 FIN B FIN 

0068 0067 
0170 * 
0171 END 


NO ERRORS, NO WARNINGS 
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FLOWCHART: MULAWEXP 


INITIALIZE CONSTANTS 


INPUT Y 


INVERT Y TO FORM Y 


SAVE P AND S 


SET SUM = 2*0+33 


SHIFT SUM BY S AND REMOVE 
ORIGINAL BIAS OF 33 TO 


FORM XmaG 
(XmaG = (SUM AS) - 33] 


JOIN P TO Xmac TO FORM X 


OUTPUT X 


Figure 7. »-Law Expansion 
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MULAWEXP 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
0050 
0051 
0052 
0053 
0054 


0000 


0000 
0001 
0002 
0003 
0004 
0005 


0006 
0007 
0008 
0009 
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0001 
0002 
0003 


0004 
0005 
0006 
0007 
0008 


7EO1 
5001 
2701 
5002 
7E21 
5003 


4004 
7EFF 
7804 
5004 


PAGE 0001 
IDT "MULAWEXP ' 


* 
* 


"MULAWEXP' PERFORM A MU-LAW EXPANSION. THE 
8-BIT DATA INPUT IS 


Y = P S2 S1 SO Q3 Q2 Q1 QO WHICH CONSISTS OF 


POLARITY BIT: P 
3-BIT SEGMENT NUMBER: S = S2 S1 SO 
4-BIT QUANTIZATION NUMBER: Q = Q3 Q2 Ql Q0 


THE INPUT Y IS EXPANDED INTO A 14-BIT OUTPUT 
X = P Xl2 X11 K10 ... X2 X1 X0 CONSISTING OF 


POLARITY BIT:P 
AND A 13-BIT MAGNITUDE 

Ss 
(X12...X0) = (33 + 2Q) X 2 - 33 


PORT 0 IS USED FOR I/O. | 
WORST-CASE TIMING IN CYCLES: 6 INIT / 23 LOOP 
SPACE REQUIREMENTS IN WORDS: 8 DATA / 46 PROGRAM 


+ + + FH HF HH HH HF HF HF HH HF H HF HH HF HF HF HF 


CONSTANTS: 
* 
ONE EQU 1 = ] 
BIT7 EQU 2 = >0080 
BIAS EQU 3 = 33 
* 
* VARIABLES: 
* 
Y EQU 4 MU-LAW COMPRESSED 8-BIT DATA INPUT 
x EQU 5 DECODED (EXPANDED) 14-BIT OUTPUT 
P EQU 6 POLARITY OF DATA (0 FOR POS) 
S EQU 7 3-BIT SEGMENT NUMBER 
SUM EQU 8 VALUE TO BE SHIFTED 
* 
AORG 0 
* 
INIT LACK 1 
SACL ONE 
LAC ONE, 7 
SACL BIT7 
LACK 33 
SACL BIAS 


* 


* INVERT INPUT 
* 


START IN Y,0 
LACK >OOFF 
XOR Y 
SACL Y 
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MULAWEXP 


0055 
0056 
0057 
0058 
0059 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
0080 


0081 
0082 
0083 
0084 
0085 
0086 
0087 
0088 
0089 
0090 


0091 


0092 
0093 
0094 
0095 
0096 
0097 
0098 
0099 
0100 
0101 


OO0A 
000B 
000C 
000D 
000E 


000F 
0010 
0011 
0012 
0013 
0014 


0015 
0016 
0017 
0018 


0019 
OO1A 
001B 
001C 
001D 


OO1E 
OO1F 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
OO2A 
002B 
002C 
902D 


NO ERRORS, 
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7902 
5006 
7E7F 
7904 
5004 


2C04 
5807 
2104 
1507 
0003 
5008 


7E1E 
0107 
7F8C 
1003 


0606 
5005 
4805 
F900 
001C 


2008 
7F8D 
2108 
7F8D 
2208 
7F8D 
2308 
7F8D 
2408 
7F8D 
2508 
7F8D 
2608 
7F8D 
2708 
7F8D 
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* SAVE POLARITY AND STRIP TO LOW 7 BITS 


0000 FOR POS; 0080 FOR NEG 


IS CORRECT. STRIP Y OF S AND Q. 


SHIFT S INTO HIGH HALF OF ACC 


DOUBLE 
REMOVE S BITS (STRIP TO 2Q) 


SUM = 2Q + BIAS 


S AND REMOVE BIAS 


AND BIT? 
SACL P 
LACK >007F 
AND Y 
SACL Y 
* 
* MAGNITUDE 
LAC Y,12 
SACH S 
LAC Y,1 
SUB S,5 
ADD BIAS 
SACL SUM 
* 
* SHIFT SUM BY 
LACK SBASE 
ADD Si 
CALA 
SUB BIAS 
* 
* ACC = 
ADD P,6 
SACL xX 
OUT X,0 
FIN B FIN 


* LOAD SUM SHIFTED 0:7 


SBASE LAC 
RET 
LAC 
RET 
LAC 


NO WARNINGS 


SUM, 0 
SUM, 1 
SUM, 2 
SUM, 3 
SUM, 4 
SUM,5 
SUM, 6 


SUM,7 


OFFSET FOR SHIFT ROUTINE 
DOUBLE S (2 WORDS/SHIFT SEGMENT) 
SHIFT SUM BY S 


MAGNITUDE, ADD POLARITY TO BIT 13 


SHIFT P TO BIT 13 


OUTPUT RESULT TO PORT 0 
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FLOWCHART: ALAWCOMP 


INITIALIZE CONSTANTS 


INPUT 13-BIT SIGN-MAGNITUDE 


NUMBER X AND SAVE POLARITY P 


MASK X TO XnaG (LOW 12 BITS) 


SET B = BIT POSITION OF THE 
LEFTMOST ‘1’ IN THE 7 MOST 


SIGNIFICANT BITS OF Xac. 
(USE BINARY SEARCH) 


NO 


B FOUND? 


YES 


SET Q = FOUR BITS 
FOLLOWING B 


JOIN S AND Q TO FORM YwaG 


JOIN P TO YmjaG TO FORM Y 


Figure 8. A-Law Compression 
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SEGMENT 
SET YMAG 
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PAGE 0001 
0001 IDT "ALAWCOMP ' 
0002 eK 
0003 * ‘ALAWCOMP' PERFORMS AN A-LAW COMPRESSION. 
0004 * THE 13-BIT SIGN-MAGNITUDE INPUT X, 
0005 * 
0006 * X = P Xll X10 ... X2 Xl XO 
0007 * 
0008 * IS ENCODED AS AN 8-BIT SIGN-MAGNITUDE NUMBER Y, 
0009 * 
0010 * Y = P S2 S1 SO Q3 Q2 Ql QO consisting of 
0011 * 
0012 * POLARITY BIT: P, 
0013 * 3-BIT SEGMENT NUMBER: S = S2 Sl SO 
0014 * 4-BIT QUANTIZATION BIN NUMBER: Q = Q3 Q2 Q1 QO 
0015 * 
0016 * Y IS INVERTED BEFORE TRANSMISSION. 
0017 * PORT 0 IS USED FOR I/O. 
0018 * 
0019 * WORST-CASE TIMING IN CYCLES: 14 INIT / 36 LOOP 
0020 * SPACE REQUIREMENTS IN WORDS: 11 DATA / 97 PROG 
0021 * 
0022 * CONSTANTS: 
0023 * 
0024 * 
0025 0001 ONE EQU 1 =] 
0026 0002 BIT4 EQU 2 = >0010 (ONE IN BIT 4) 
0027 0003 MASK12 EQU 3 = >OFFF (12 ONES) 
0028 0004 MASK8 EQU 4 = >OOFF ( 8 ONES) 
0029 0005 MASK4 EQU 5 = >000F ( 4 ONES) 
0030 0006 MASK2 EQU 6 = >0003 ( 2 ONES) 
0031 * 
0032 *VARIABLES: 
0033 0007 xX EQU 7 DATA INPUT (13 BITS) 
0034 0008 yY EQU 8 ENCODED DATA OUTPUT (8 BITS) 
0035 0009 P EQU 9 POLARITY OF DATA (0 FOR POS) 
0036 OOOA S EQU 10 3-BIT SEGMENT NUMBER 
0037 0OO0B Q EQU pa 4-BIT QUANTIZATION BIN NUMBER 
0038 0000 
0039 * 
0040 * 
0041 0000 AORG 0 
0042 * 
0043 0000 7EO1 MINIT LACK 1 
0044 0001 5001 SACL ONE 
0045 0002 2401 LAC ONE, 4 
0046 0003 5002 SACL BIT4 
0047 0004 2C0l LAC ONE, 12 
0048 0005 1001 SUB ONE 
0049 0006 5003 SACL MASK12 
0050 0007 7EFF LACK >OOFF 
0051 0008 5004 SACL MASK8 
0052 0009 7EOF LACK >000F 
0053 000A 5005 SACL MASK4 
0054 000B 7E03 LACK >0003 
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0055 000C 5006 SACL MASK2 

0056 * 

0057 * GET INPUT AND SAVE POLARITY 

0058 000D 4007 START IN X,0 

0059 OOOE 2C0l LAC ONE ,12 

0060 OOOF 7907 AND X 

0061 0010 5Cc09 SACH P,4 

0062 = 

0063 * STRIP TO LOW 12 BITS 

0064 0011 2007 LAC X 

0065 0012 7903 AND MASK12 

0066 0013 5007 SACL X 

0067 ig 

0068 * 

0069 al 

0070 * 

0071 2 

0072 i 

0073 x 

0074 0014 2805 LMOST LAC MASK4 , 8 

0075 0015 7907 AND X 

0076 0016 FFOO BZ EEE 
0017 0038 

0077 ™ 

0078 0018 2A06 LAC MASK2, 10 

0079 0019 7907 AND X 

0080 001A FFOO BZ cc 
001B 002A 

0081 * 

0082 001C 2B01 LAC ONE, 11 

0083 001D 7907 AND X 

0084 OO01E FFOO BZ B 
OO1lF 0025 

0085 ys 

0086 0020 7E07 LACK 7 

0087 0021 500A SACL S) 

0088 0022 2907 LAC X,9 

0089 0023 F900 B XDONE 
0024 0058 

0090 - 

0091 0025 7E06 B LACK 6 

0092 0026 S5SO0A SACL S 

0093 0027 2A07 LAC X,10 

0094 0028 F900 B XDONE 
0029 0058 

0095 ~ 

0096 002A 2901 CC LAC ONE, 9 

0097 002B 7907 AND X 

0098 002C FFOO BZ D 
002D 0033 

0099 * 

0100 O002E 7E05 LACK 5 

0101 OO02F 500A SACL S) 

0102 0030 2B07 LAC X,11l 
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INPUT DATA THRU PORT 0 
POLARITY BIT MASK 


0 FOR POS; 1 FOR NEG, 


S =BP -4 WHERE BP = BIT POSITION OF THE LEFTMOST '1' 

IN X. TIND THE '1l' THROUGH A BINARY SEARCH OF 8 MSB'S 
OF X. STORE S AND LOAD X SHIFTED LEFT BY 16-S SO 

THAT THE '1' AND FOUR FOLLOWING BITS ARE IN THE HIGH 

HALF OF THE ACCUMULATOR. SEARCH BITS 4 THRU ll. 


1100 0000 


1000 0000 


Ol.. 


0010 0000 


O01. 
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0103 0031 F900 B XDONE 

0032 0058 
0104 * 
0105 0033 7E04 D LACK 4 0001) 24. 
0106 0034 500A SACL S 
0107 0035 2c07 LAC X,12 
0108 0036 F900 B XDONE 

0037 0058 
0109 * 
0110 0038 2606 EEE LAC MASK2 , 6 0000 1100 
0111 0039 7907 AND X 
0112 003A FFOO BZ GG 

003B 004A 
0113 * 
0114 003C 2701 LAC ONE, 7 0000 100 
0115 003D 7907 AND x 
0116 003E FFOO BZ F 

003F 0045 
0117 * 
0118 0040 7E03 LACK 3 0000" 1s 
0119 0041 500A SACL S 
0120 0042 2D07 LAC X,13 
0121 0043 F900 B XDONE 

0044 0058 
0122 * 
0123 0045 7E02 F LACK 2 0000 Ol.. 
0124 0046 SO0A SACL S 
0125 0047 2E07 LAC X,14 
0126 0048 F900 B XDONE 

0049 0058 
0127 
0128 004A 2501 GG LAC ONE, 5 0000 0010 
0129 004B 7907 AND X 
0130 004C FFOO BZ SEGZ 

004D 0053 
0131 * 
0132 004E 7E01 LACK i 0000 001. 
0133 004F 500A SACL S 
0134 0050 2F07 LAC X,15 
0135 0051 F900 B XDONE 

0052 0058 
0136 * SEGMENT 0: _SSSQQQQ =X/2 
0137 0053 2FO7 SEGZ LAC X,15 
0138 0054 5807 SACH X 
0139 0055 2007 LAC X 
0140 0056 F900 B SIGN 

0057 005C 
0141 * 
0142 * REMOVE LEFTMOST '1' AND STORE Q 
0143 0058 6202 XDONE SUBH BIT4 
0144 0059 580B SACH Q 
0145 * 
0146 * FORM 8-BIT COMPRESSED WORS FIR Q, S, AND P. 
0147 005A 200B LAC Q Q:BITS 0-3 2E00 
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0148 005B 040A ADD S,4 S:BITS 4-6 —_SSSQQQQ 
0149 005C 0709 SIGN ADD P,7 PSSSQQQQ 
0150 be 
0151 * COMPLEMENT FOR TRANSMISSION AND OUTPUT 
0152 005D 7804 XOR MASK8 
0153 OOSE 5008 SACL Y 
0154 OOSF 4808 OUT Y,0 PORT 0 
0155 0060 F900 FIN B FIN 

0061 0060 
0156 * 
0157 END 


NO ERRORS, NO WARNINGS 


5. Companding Routines for the TMS32010/TMS32020 


FLOW CHART: ALAWEXP 


INITIALIZE CONSTANTS 
INPUT Y 
INVERT Y TO FORM Y 


SAVE P AND S 


Ott 


NO 


SET SUM = 2*0433 


SET XmaG = 2*Q+1 


SHIFT SUM BY S TO 
FORM X 
(XmaG = SUM A S) 


JOIN P TO Xmac TO FORM X 


OUTPUT X 


Figure 9. A-Law Expansion 
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0001 IDT "ALAWEXP' 
0002 ae 
0003 * "ALAWEXP' PERFORM AN A-LAW EXPANSION. THE 8-BIT 
0004 * DATA INPUT IS 
0005 * 
0006 * Y = P §2 S1 SO Q3 Q2 Ql QO WHICH CONSISTS OF 
0007 * 
0008 * POLARITY BIT: P 
0009 * 3-BIT SEGMENT NUMBER: S = S2 Sl SO 
0010 * 4-BIT QUANTIZATION NUMBER: Q = Q3 Q2 Q1 Q0 
0011 * 
0012 * THE INPUT Y IS EXPANDED INTO A 13-BIT OUTPUT 
0013 * 
0014 * X = P X1l X10 X9 ... X2 X1 X0 CONSISTING OF 
0015 * 
0016 * POLARITY BIT:P 
0017 * AND A 13-BIT MAGNITUDE (X12...X0) 
0018 * 
0019 * PORT 0 IS USED FOR I/O. 
0020 * WORST-CASE TIMING IN CYCLES: 4 INIT / 25 LOOP 
0021 * SPACE REQUIREMENTS IN WORDS: 7 DATA / 48 LOOP 
0022 * 
0023 * CONSTANTS: 
0024 * 
0025 0001 ONE EQU 1 = 1 
0026 0002 BIT7 EQU 2 = >0080 (ONE IN BIT 7) 
0027 * 
0028 * VARIABLES: 
0029 * 
0030 0003 Y EQU 3 A-LAW COMPRESSED 8-BIT DATA INPUT 
0031 0004 xX EQU 4 DECODED (EXPANDED) 13-BIT OUTPUT 
0032 0005 P EQU 5 POLARITY OF DATA (0 FOR POS) 
0033 0006 Ss EQU 6 3-BIT SEGMENT NUMBER 
0034 0007 SUM EQU 7 VALUE TO BE SHIFTED 
0035 * 
0036 0000 AORG 0 
0037 * 
0038 0000 7E01 INIT LACK 1 
0039 0001 5001 SACL ONE 
0040 0002 2701 LAC ONE,7 
0041 0003 5002 SACL BIT? 
0042 * 
0043 * INVERT INPUT 
0044 * 
0045 0004 4003 START IN Y,0 
0446 0005 7EFF LACK >OOFF 
0047 0006 7803 XOR Y 
0048 0007 5003 SACL Y 
0049 * 
0050 * SAVE POLARITY AND STRIP TO LOW 7 BITS 
0051 0008 7902 AND BIT7 
0052 0009 5005 SACL P 0000 FOR POS; 0080 FOR NEG 
0053 000A 7E7F LACK >007F 
0054 000B 7903 AND Y 
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ALAWEXP 


0055 
0056 
0057 
0058 
0059 
0060 
0061 


0062 
0063 
0064 
0065 


0066 
0067 
0068 
0069 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
0080 
0081 


0082 
0083 
0084 
0085 
0086 
0087 
0088 
0089 
0090 
0091 
0092 
0093 
0094 
0095 
0096 
0097 
0098 
0099 


000C 


000D 
000E 
OOOF 
Q010 
0011 


0012 
0013 
0014 
0015 


0016 
0017 
0018 
0019 


OO1A 
001B 
001C 


001D 
OO1E 
001F 
0020 
0021 


0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
002A 
002B 
002C 
002D 
002E 
002F 


NO ERRORS, 
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320 FAMILY MACRO ASSEMBLER 
5003 SACL Y 

* 

* MAGNITUDE IS CORRECT. 
2c03 LAC Y,12 
5806 SACH S 
2006 LAC S 
FEOO BNZ SEGNZ 
0016 

* SEGMENT 0: 

2103 LAC Y,1 
0001 ADD ONE 
F900 B SIGN 
001D 

* NONZERO SEGMENT: SUM 
7E21 SEGNZ LACK 33 
0103 ADD Y,1 
1506 SUB S,5 
5007 SACL SUM 

* 

* SHIFT SUM BY S USING 
7E20 LACK SBASE-2 
0106 ADD S,l 
7F8C CALA 

* 

* ACC = MAGNITUDE. ADD 
0505 SIGN ADD P,5 
5004 SACL X 
4804 OUT x,0 
F900 FIN B FIN 
0020 

* 

* LOAD SUM SHIFTED 0:6 
2007 SBASE LAC SUM, 0 
7F8D RET 
2107 LAC SUM, 1 
7F8D RET 
2207 LAC SUM, 2 
7F8D RET 
2307 LAC SUM, 3 
7F8D RET 
2407 LAC SUM, 4 
7F8D RET 
2507 LAC SUM, 5 
7F8D RET 
2607 LAC SUM, 6 
7F8D RET 

* 

END 


NO WARNINGS 


2.1 83.076 08:29:54 10/26/83 


PAGE 0002 


STRIP Y OF S AND Q. 
SHIFT S INTO HIGH HALF OF ACC 


CHECK FOR SEGMENT 0 


EXPAND X TO 2*Q + 1 


= 2*Q + 33 


REMOVE S BITS 


VARIABLE SHIFT ROUTINE AT SBASE 
OFFSET (MINUS 0 CASE) 

DOUBLE S (2 WDS/SHIFT SEGMENT) 
SHIFT SUM BY S 


POLARITY TO BIT 12. 
SHIFT P.TO BIY 12 


OUTPUT RESULT TO PORT 0 
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Tables 9 and 10 are included to aid in verifying 
particular implementations of the algorithms that have been 
presented. 


Table 9. Segmented ,»-255 Companding* 
(Courtesy of John Wiley & Sons, see Reference 6) 


Segment S Quantization 
010 011 100 101 110 111 Qa 


F000 [001 | BIN 
95 223 479 991 2015 4063 
i ee Eee ae ‘ 
ae ae 239 511 105521434319 Po 
| 3) T3911 255 543 1119 2271 4575 AiG 5 
| 85 | 43 | 119 271 575 1183 2399-4831 
7 47 127 287 607 1247 2527 7 
| 9 | 51 | 135 303 639 1311 2655 5343 Fool 6 
| 11 [ 55 | (143 319 671 1375 2783 5599 oo 
151 335 703 1439 2911 5855 
Quantization [S| 59 [187 3357031439 29115855 | ; 
| 15 | 63 | 159 351 735 1503 3039 6111 
Endpoints 1000 8 
167 367 767 1567 3167 6367 F001 
) 19 | 71 [175 383 799 1631 3295 6623 4010 40 
21 75 183 3 1 16 3 
ee ae 3983 95 3423 6879} 
| 23 | 79 | 191 415 863 1759-3551 7135 7160 > 
2 1 
| 25 { 83 | 199 431 895 1823 3679 7391 
207 447 927 1887 3807 7647 104 
| 29 | 91 | 215 463 959 1951 3935-7903 saaa ae 
| 31 | 95 {| 223 479 991 2015 4063 8159 a ae 


*(1) Sample values are referenced to a full-scale value of 8159. (2) Negative samples are encoded in sign-magnitude format with a polarity 
bit of 1. (3) In actual transmission the codes are inverted to increase the density of 1’s when low signal amplitudes are encoded. (4) 
Analog output samples are decoded as the center of the encoded quantization interval. (5) Quantization error is the difference between 
the reconstructed output value and the original input sample value. 


Table 10. Segmented A-Law Companding 
(Courtesy of John Wiley & Sons, see Reference 6) 


010 011 100 101 110 111 BIN a 
64 128 256 512 1024 2048 i ee 
68136272 1088 2176 
7 14 1152 304 
| 72144288 57611522304 |} 
11 


544 
: saa 
80 160 320 640 1280 2560 poor 


2 

0100 4 

84 168 336 672 1344 2688 O101B 
88 176 352 704 1408 2816 0110 6 

2 184 368 736 1472 2944 

6 


9 
9 192 384 768 1536 3072 


100 0 400 1600 3200 ~~ : 
04 8 
3 


Quantization 
Endpoints 


20 800 
[104208416 ~~ —it6ea age *f OO 
1082164828728 ase | OOO 
[112224448096 17923584 |] 108 

116 232 464 928 1856 3712 

120 240 480 960 1920 3840 P1101 13 
124 248 496 992 1984 3968 ec a Se 
128 256 512 1024 2048 4096 Pe ll 


_ _ 
o NO 
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1. 
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APPENDIX 
COMPANDING ROUTINES FOR THE TMS32020 


This appendix provides companding programs for the TMS32020. The basic theory and operation are similar to what is described 
for the TMS32010 in the major portion of this report. 

Programs included in the appendix for y-law and A-law expansion and compression utilize the serial port of the TMS32020 
for 8-bit serial I/O. The routines are interrupt-driven to allow direct interfacing to a codec, such as the Texas Instruments 
TCM2913 (see the data sheet for further information). The following paragraphs briefly describe each of the programs included 
in the appendix. 

The first program reads a p-law value from the Data Receive Register (DRR), expands it to a 14-bit linear value, and 
writes it to location X. This value is then compressed back to an 8-bit p-law value and written to the Data Transmit Register 
(DXR). Since p-law codecs invert all bits of the 8-bit value for transmission, XORK (exclusive-OR immediate with accumulator 
with shift) instructions are used to perform inversion in the TMS32020 before expansion and after compression. Note also 
that the LACT (load accumulator with shift specified by T register) instruction is useful for performing the conditional shift 
implemented in the TMS32020 program by a computed subroutine call (CALA). The compression routine of the program 
assumes a left-justified 14-bit value within the accumulator. Therefore, the value stored in X is left-shifted twice before the 
compression routine is entered. The NORM (normalize contents of the accumulator) instruction is then used to find the MSB 
of the accumulator by performing an in-place accumulator left-shift if the two MSBs are the same. At the same time, the 
count of the left-shifts is maintained in auxiliary register 0 (ARO) and used to compute the segment number S. 

The second program performs A-law expansion and compression, and is similar to the u-law program. For A-law 
transmission, however, only the even-order bits of the 8-bit value are inverted. Note that the LACT and NORM instructions 
are still used to compute the expanded and compressed values. 

The third program is an example of a simplified solution to y-law expansion. The 8-bit p-law value is used as an index 
into a 256-word table of 14-bit linear values accessed by the TBLR (table write) instruction. This lookup-table approach 
may also be utilized for A-law expansion. While this method is obviously the fastest method of performing expansion, it 
is also the most inefficient in terms of program memory requirements. 
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NOS I1DT 


000} 
0002 
0003 
0004 
0005 
0006 
0007 


200 


0000 
0000 
0001 
OOI1A 
OOI1A 
0016 
oo1Ic 
001D 
OOLE 
OOIF 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
002A 
002B 


002C 
002D 


OO02E 
O02F 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 16:02:54 11-18-85 


0000 
0001 
0004 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 


FF 80 
0400 


9800 
2C00 
D404 
7FO00O 
D406 
7FO00 
6860 
4460 
0B61 
CE18 
6C63 
3C60 
4263 
1061 
F980 
002B 
CE23 
6062 


2262 
6062 


4062 
F380 
0040 
4867 
3065 
CB06 
CEA2 
DE04 
FOOO 
6866 


PAGE 0001 


POR RRR SER ESAS ESA RE RER RRS ER EER REAR AR EEE RRR ERR ERRSERRA RRS 


* ee & & & 


* 


DRR 
DXR 
IMR 
S 
BIAS 
x 
SUM 
SIGN 
NEG7 
Q 
BIAS 


RSVE 


EXPA 


POSV 
% 
JSTI 


COMP 


U-LAW EXPAND/COMPRESS CODEC LOOPBACK PROGRAM. UPON 
RECEIVING AN RINT INSTRUCTION, DRR IS READ, EXPANDED, 
AND WRITTEN TO DXR, AT WHICH TIME THE 
PROCESSOR IDLES UNTIL ANOTHER RINT INSTRUCTION IS 
GENERATED. THE FIRST WORD TRANSMITTED IS A ZERO. 


TORS ESEURES ERE SERER ESE ESA EEE RRS EE RAE AEE EES SL 


COMPRESSED BACK, 


EQU 
EQU 


2 EQU 
CT B 


ND BIT 


AL SACL 


FY LAC 


RS ZALH 


INIT 


26 
DRR,8 
DRR,12 
>7F 00,4 


>7F 00,4 


S 
S 
BIAS, 11 


SUM,4 
S 

SUM 
BIAS 
POSVAL 


x 
X»2 

X 

x 

NEGCMP 
BIAS2 
ARO,NEG7 
6 
>F000,14 


Q 


PROGRAM 1 


**e* eee &© & 


* U-LAW SEGMENT NUMBER 
* = 33 


" = >FFFO 
* U-LAW QUANTIZATION BIN 


* TEST ORR FOR SIGN 
* ZERO SIGN AND OTHER MSBS 
* INVERT ALL BITS 


* ZERO ACCH 


* POSITIVE IF TC = 1 


* LEFT-JUSTIFY 14-BIT NUMBER 


* U-LAW COMPRESS # IN ACCH 


* (# ALREADY LEFT-JUSTIFIED) 
* FIND MSB 


* ZERO 2 MSBS & ALL LSBS 
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PAGE 0002 
0052 0038 7060 SAR ARO,S 
0053 0039 4060 ZALH $ 
0054 003A CEIB ABS 
0055 003B 0266 ADD Q,2 
0056 003C D406 XORK >FFOO,4 * INVERT ALL BITS 
003D FFOO 
0057 003E FF8O B SATCH 
OO3F O04E 
0058 * 
0059 0040 CE1B NEGCMP ABS 
0060 0041 4867 ADDH BIASZ2 * (# ALREADY LEFT-JUSTIFIED) 
0061 0042 3065 LAR  ARO,NEG7 
0062 0043 CB06 RPTK 6 * FIND MSB 
0063 0044 CEA2 NORM 
0064 0045 DEO4 ANDK >F000,14 * ZERO 2 MSBS & ALL LSBS 
0046 FOOO 
0065 0047 6866 SACH Q 
0066 0048 7060 SAR ARO,S 
0067 0049 4060 ZALH S$ 
0068 004A CE1B ABS 
0069 004B 0266 ADD Q,2 
0070 004C D406 XORK >7FO0,4 * INVERT ALL BITS IN Q 
004D 7F00 
0071 * (P=0 FOR NEGATIVE VALUES) 
0072 O04E 6CO! SATCH SACH DXR,4 
0073 * 
0074 O04F CEQO WAIT EINT 
0075 0050 CEIF IDLE * WAIT FOR RINT 
0076 * 
0077 * 
0078 * INITIALIZATION ROUTINE 
0079 " 
0080 " 
0081 0400 AORG 1024 
0082 0400 C800 INIT LDPK 0 * POINT DP TO B2 AND MMRS 
0083 0401 CAI10 LACK >10 
0084 0402 6004 SACL IMR * ENABLE RINT BUT DISABLE 
0085 * ALL OTHERS 
0086 0403 CEOF FORT 1 * CONFIGURE SERIAL PORT TO 
0087 " BYTE MODE 
0088 0404 CE03 SOVM 
0089 0405 CEO7 SSXM 
0090 0406 CE08 SPM 0 
0091 0407 CA2! LACK 33 
0092 0408 6061 SACL BIAS 
0093 0409 2261 LAC BIAS,2 
0094 040A 6067 SACL BIAS2 
0095 040B DOO! LALK -7 
040C FFF9 
0096 040D 6065 SACL NEG7 
0097 040E CA0O LACK 0 
0098 040F 6000 SACL DRR “ ZERO MSBS OF DRR 
0099 0410 6001 SACL DXR * ZERO DXR FOR FIRST 
0100 “ TRANSMIT OPERATION 
0101 0411 5588 LARP 0 * ZERO ARP 
0102 0412 5588 LARP 0 * AND ARB 
0103 0413 CEO EINT 
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0104 0414 CEIF IOLE * WAIT FOR RINT 


0105 END 
NO ERRORS, NO WARNINGS 
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NOS IDT 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
0020 
0021 
0022 
0023 


0024 
0025 
0026 
0027 


0028 


0029 
0030 
0031 
0032 


0033 
0034 
0035 


0036 
0037 
0038 


0039 
0040 
0041 
0042 
0043 
0044 
0045 
0046 


0047 
0048 
0049 
0050 
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0000 
oco0 
0001 
OO1A 
OOLA 
001B 
QOIC 
0010 
OO1E 
OOIF 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
002A 
002B 
002C 


002D 
OO2E 
002F 
6030 
0031 
0032 
0033 
0034 
0035 
0036 


0037 
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0000 
000) 
0004 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 


FF8O 
0400 


9800 
2C00 
D404 
7F 00 
D406 
5500 
6860 
4460 
3060 
FB90 
002D 
OB67 
CE18 
F880 
002A 
CE23 
6C62 
FF 80 
0037 


7060 
OB61 
CE18 
6C63 
3C60 
4263 
F880 
0036 
CE23 
6062 


2362 


1.0 85.157 16:01:10 t1-18- 


PAGE 0001 


85 


MH MH HH HH MH HT Ob HE TH UE SE HE Ob Ot 3 OE Te TE Me Ot OE OE OF OE 9 Oe Ot He Ot OE TE 5 Oe He Ot OE TE OE Ot Ot Oe He HE Ot Te Ot EO Ot 


x* * k* * * * *£ 


* 


DRR 
DXR 
IMR 
S 
BIAS 
x 
SUM 
SIGN 
NEG7 
Q 
ONE 


RSVECT 


EXPAND 


PSVAL 1 


SEGNZ 


POSVAL 
* 


JSTIFY 


PROCESSOR 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


>7FO00,4 
>5500,4 
S 


S 
ARO,S 


SEGNZ, *- 


ONE, 11 


PSVAL 1 


X14 
JUSTIFY 


ARO,S 
BIAS, 11 


SUM, 4 
5 

SUM 
POSVAL 


PROGRAM 2 


A-LAW EXPAND/COMPRESS CODEC LOOPBACK PROGRAM. UPON 
RECEIVING AN RINT INSTRUCTION, DRR IS READ, EXPANDED, 
COMPRESSED BACK, AND WRITTEN TO DXR, AT WHICH TIME THE 
IDLES UNTIL ANOTHER RINT INSTRUCTION 15S 
GENERATED. THE FIRST WORD TRANSMITTED IS A ZERO. 


9 4 Ft Tt He Me St HG UE HE OE OE He FE OE TE DE OF 2G OE 9 OF OF FE 4 Hb FE HE OF Ob Ob OF tb te tb Ob SE Ob He Ot Ot OE OE Ob UE Oe Ht He TE Ot TE Ot OE Ot 


* A-LAW SEGMENT NUMBER 
* = 33 


* = >FFFO 
* A-LAW QUANTIZATION BIN 


* TEST DRR FOR SIGN 
* ZERO SIGN AND OTHER MSBS 


* INVERT EVEN-ORDER BITS 


* ZERO ACCH 


* TEST FOR SEGMENT NUMBER 


* STORE DECREMENTED S 


* * © ke &€ & 8 & 


0 


203 


NO$ I OT 


0051 
0052 
0053 
0054 


0055 
0056 
0057 
0058 


0059 
0060 
0061 


0062 
0063 


0064 
0065 
0066 
0067 
0068 
0069 


0070 
0071 


0072 
0073 


204 


0038 


0039 
OO3A 
003B 
003C 
003D 
OO03E 
CO3F 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
004A 
004B 
004C 
004D 
O04E 
OO4F 
0050 
0051 


0052 
0053 
0054 
0055 
0056 
0057 
0058 
0059 
OO5A 
005B 


Q0SC 
005D 
OOS5E 
OOSF 
0060 
q061 
0062 
0063 
0064 
0065 
0066 


0067 


0068 
0069 


32020 FAMILY MACRO ASSEMBLER PC 1.0 


6062 


4062 
F380 
0052 
3065 
CB06 
CEA2 
FB80 
0047 
6866 
2166 
D406 
5500 
FF80 
0067 
DEO04 
FOOO 
6866 
7060 
4060 
CE1B 
0266 
D406 
5500 
FF80 
0067 


CE1B 
3065 
CB06 
CEA2 
FB8O 
OOSE 
6866 
2166 
D406 
D500 


FF80 
0067 
DEO4 
FOOO 
6866 
7060 
4060 
CE1B 
0266 
D406 
D500 


6CO0 1 


CEO0 
CE IF 


* 


COMPRS 


SEGNZ1 


* 


NEGCMP 


SEGNZ2 


SACL 


ZALH 
BLZ 


LAR 

RPTK 
NORM 
BANZ 


SACH 
LAC 
XORK 


X 


Xx 
NEGCMP 


ARO,NEG7 
6 


SEGNZ1,* 
Q 

Q,1 
>5500,4 
SATCH 
>F000,14 
Q 


ARO;S 
S 


Q,2 
>5500,4 


SATCH 


ARO,NEG7 
6 


SEGNZ2,* 
Q 


Q,1 
>D500,4 


SATCH 
>F000,14 
Q 

AROQ,S 

> 

Q,2 
>D500,4 


DXR,4 


85.157 


16:01:10 11-18-85 
PAGE 0002 


LEFT-JUSTIFY 13-BIT NUMBER 


A-LAW COMPRESS # IN ACCH 


FIND MSB 


SEGMENT NUMBER = O 


INVERT EVEN-ORDER BITS 


ZERO 2 MSBS & ALL LSBS 


INVERT EVEN-ORDER BITS 


FIND MSB 


SEGMENT NUMBER = 0 
INVERT EVEN-ORDER BITS 


AND SET SIGN BIT TO 1 


ZERO 2 MSBS & ALL LSBS 


INVERT EVEN-ORDER BITS 


AND SET SIGN BIT TO 1 


WAIT FOR RINT 
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PAGE 0003 

0096 i INITIALIZATION ROUTINE 

0097 - 

0098 _ 

0099 0400 AORG 1024 

0100 0400 C800 INIT LDPK 0Q * POINT DP TO B2 AND MMRS 

0101 0401 CAI1O LACK >10 

0102 0402 6004 SACL IMR * ENABLE RINT BUT DISABLE 

0103 : ALL OTHERS 

0104 0403 CEOF FORT 1 * CONFIGURE SERIAL PORT TO 

0105 sd BYTE MODE 

0106 0404 CEO3 SOVM 

0107 0405 CEO7 SSXM 

0108 0406 CEOS SPM 0 

0109 0407 CAO! LACK 1 

0110 0408 6067 SACL ONE 

0111 0409 CA2! LACK 33 

0112 040A 6061 SACL BIAS 

0113 040B DOO! LALK -7 

040C FFF9 

0114 040D 6065 SACL NEG7 

0115 O40E€ CAOO LACK 0 

0116 O40F 6000 SACL ORR * ZERO MSBS OF DRR 

0117 0410 6001 SACL OXR * ZERO DXR FOR FIRST 

0118 - TRANSMIT OPERATION 

0119 0411 5588 LARP 0 * ZERO ARP 

0120 0412 5588 LARP 0 * AND ARB 

0121 0413 CEOO EINT 

0122 0414 CEIF IDLE * WAIT FOR RINT 

0123 END 

NO ERRORS, NO WARNINGS 
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NO$IDT 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 


0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 


0044 
0045 
0046 
0047 
0048 
0049 
0050 
0051 
0052 
0053 
0054 
0055 


206 


0000 
goo00 
0001 
OO1A 
OOILA 
001B 


0oiICc 


0010 
OOIE 


OOI1F 
0020 
0021 


0022 


0023 
0024 
0025 
0026 
0027 
0028 
0029 
002A 
002B 


o02C 
002D 
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0000 
ooo! 
0004 
0060 
0061 


FF80 
OOIF 
2000 
0061 
5860 


CEOO 
CE IF 
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[RRP REE RRESEERERARRAR RRR E RRR REE RE RER ER REAR REE RRR RE REAR E RE RERERS E E 


PROGRAM 3 


* 
* 
U-LAW TABLE LOOKUP EXPANSION PROGRAM. UPON RECEIVING 7 
AN RINT INSTRUCTION, DRR IS READ, EXPANDED, AND * 
WRITTEN TO THE STORAGE LOCATION X, AT WHICH TIME THE . 
PROCESSOR IDLES UNTIL ANOTHER RINT INSTRUCTION IS if 
GENERATED. OXR IS SET TO ZERO SO THAT ALL WORDS es 
TRANSMITTED ARE ZEROES. ‘ 

* 


(ee Ree RE EERE SRE REE RES E ERE RRR RRR E RE RE RERSEREEREERRESEE S 


*=*e *« &€ Ye & & 


* 


DRR EQU 0 
DXR EQU | 
IMR EQU 4 
x EQU 96 
BADDR EQU 97 * CONTAINS BASE ADDR FOR 
" TABLE 
AORG 0 
RSVECT B INIT 
AORG 26 
EXPAND LAC  DRR 
ADD BADODR * ADD TABLE BASE ADDR FOR 
“ U-LAW EXPANSION TABLE 
* LOOKUP 
TBLR X * READ INTO LOCATION X 
* 
EINT 
IDLE * WAIT FOR RINT 
* 
" 
* INITIALIZATION ROUTINE 
* 
% 
INIT LOPK 0 * POINT DP TO B2 AND MMRS 
LACK >10 
SACL IMR * ENABLE RINT BUT DISABLE 
" ALL OTHERS 
FORT 1 * CONFIGURE SERIAL PORT TO 
" BYTE MODE 
SOVM 
SSXM 
SPM 0 
LALK XTBL 
SACL BADDR “ BASE ADDR FOR TBL LOOKUP 
LACK 0 
SACL ORR * ZERO MSBS OF DRR 
SACL DXR * ZERO OXR FOR FIRST 
* TRANSMIT OPERATION 
EINT 
IDLE * WAIT FOR RINT 


TABLE FOR U-LAW EXPANSION TABLE LOOKUP 


* &@ ee * & 
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NO$I1DT 


0056 
0057 
0058 
0059 
0060 
0061 
0062 
0063 
0064 
0065 
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0300 


0300 
0301 
0302 
0303 
0304 
0305 
0306 
0307 
0308 
0309 
030A 
030B 
030C 
0300 
O30E 
O30F 
0310 
0311 
0312 
0313 
0314 
0315 
0316 
0317 
0318 
0319 
O31A 
0318 
031C 
0310 
O31E 
O31F 
0320 
0321 
0322 
0323 
0324 
0325 
0326 
0327 
0328 
0329 
032A 
032B 
032C 
032D 
032E 
032F 
0330 
0331 
0332 
0333 
0334 
0335 
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0300 XTBL 
* 


EOA1 
E1A1 
E2A1 
E3A1 
E4A1 
ESA1 
E6A1 
E7A1 


E8A1 


E9A1 
EAA1 
EBA 
ECA] 
EDA1 
EEA1 
EFA] 
FO61 
FOE 1 
F161 
FIE1 
F261 
F2E 1 
F361 
F361 
F461 
F4E1 
F561 
FSE1 
F661 
F6E 1 
F761 
F7E1 
F841 
F881 
F8Cl 
F901 
F941 
F981 
FOCI 
FAQ1 
FA41 
FABI 
FAC1 
FBO] 
FB41 
FB81 
FBC 
FCO1 
FC31 
FC51 
FC71 
FC91 
FCB1 
FCD1 


AORG 
EQU 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


768 
$ 


>EDAI 
>ELA! 
>EZAL 
>ESAI 
>E4A1 
>E5A1 
>E6A1 
>ETAI 
>EBAL 
»ESAL 
>EAAL 
>EBAI 
>ECA1 
»EDA! 
>EEAI 
>EFAI 
>F O61 
>FOE1 
>F 161 
>FIEI 
>F261 
>F2E} 
>F 361 
>F3E1 
>F 461 
>F 4E1 
>F 561 
>F5E1 
>F 661 
>F6E1 
>F 761 
>F7E} 
>F 841 
>F 881 
>F8C1 
>F 901 
>F941 
>F 981 
>F9C1 
>FAQ!) 
>FA41 
>FA81 
>FAC1 
>FBO! 
>FB41 
>F BS! 
>FBC 1 
>»FCO] 
>FC31 
>FCS] 
>FC71 
>FC9I 
>FCB1 
>FCD! 


PAGE 0002 


* NEGATIVE VALUES FIRST 


(FF, FE, ETC.) 
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NO$IDT 


0113 
0114 
O115 
0116 
0117 
0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 
0134 
0135 
0136 
0137 
0138 
0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 
0153 
0154 
0155 
0156 
0157 
0158 
0159 
0160 
0161 
0162 
0163 
0164 
0165 
0166 
0167 
0168 
0169 
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0336 
0337 
0338 
0339 
033A 
033B 
033C 
033D 
O33E 
033F 
0340 
0341 
0342 
0343 
0344 
0345 
0346 
0347 
0348 
0349 
034A 
0348 
034C 
034D 
O34E 
O34F 
0350 
0351 
0352 
0353 
0354 
0355 
0356 
0357 
0358 
0359 
035A 
035B 
035C 
035D 
O35E 
O35F 
0360 
0361 
0362 
C363 
0364 
0365 
0366 
0367 
0368 
0369 
036A 
036B 
036C 
036D 
OQ36E 
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FCF I 
FOL 
FD31 
FD51 
FO71 
FD91 
FOB! 
FDD1 
FOF 1 
FE11 
FE29 
FE39 
FE49 
FE59 
FE69 
FET79 
FE89 
FE99 
FEAQ 
FEBS 
FEC9 
FEDS 
FEES 
FEF9 
FFO9 
FF19 
FF25 
FF2D 
FF35 
FF3D 
FF45 
FF 4D 
FF55 
FF5D 
FF65 
FF6D 
FF75 
FF7D 
FF85 
FF8D 
FF9S 
FF9D 
FFA3 
FFAT 
FFAB 
FFAF 
FFB3 
FFB7 
FFBB 
FFBF 
FFC3 
FFC7 
FFCB 
FFCF 
FFD3 
FFD7 
FFDB 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


>FCF 1 
>FD11 
>FD31 
>FD51 
>FD71 
>FD91 
>FDBI 
>FDD! 
>FDF | 
>FEI1 
>FE29 
>FE39 
>FE49 
>FE59 
>FE69 
>FET79 
>FE89 
>FE99 
>FEAS 
>FEBS 
>FEC9 
>FED9 
>FEES 
>FEFS 
>FFO9 
>FF19 
>FF25 
>FF2D 
>FF35 
>FF3D 
>FF45 
>FF 4D 
>FF55 
>FF5D 
>FF65 
>FF6D 
>FF75 
>FF7D 
>FF85 
>FF8D 
>FF9S 
>FFOD 
>FFA3 
>FFAT 
>FFAB 
>FFAF 
>FFB3 
>FFB7 
>FFBB 
>FF BF 
>FFC3 
>FFCT7 
>FFCB 
>FFCF 
>FFD3 
>FFD7 
>FFDB 
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PAGE 0004 
0170 O36F FFDF DATA >FFDF 
0171 0370 FFE2 DATA >FFE2 
0172 0371 FFE4 DATA >FFE4 
0173 0372 FFE6 DATA >FFE6 
0174 0373 FFE8 DATA >OFFEB 
0175 0374 FFEA DATA >FFEA 
‘0176 0375 FFEC DATA >FFEC 
0177 0376 FFEE DATA >FFEE 
0178 0377 FFFO DATA >FFFO 
0179 0378 FFF2 DATA >FFF2 
0180 0379 FFF4 DATA >FFF4 
0181 O37A FFF6 DATA >FFF6 
0182 037B FFF8 DATA >FFF8 
0183 O037C FFFA DATA >FFFA 
0184 0370 FFFC DATA >FFFC 
0185 O37E FFFE DATA >FFFE 
0186 O37F OOOO DATA >0O 
0187 2 
0188 0380 IFSF DATA >1F5F * POSITIVE VALUES NEXT 
0189 0381 ILE5F DATA >1ESF * (POLARITY BIT = 1) 
0190 0382 1D5F DATA >I1D5F 
0191 0383 ICSF DATA >1CSF 
0192 0384 1B5F DATA >1B5F 
0193 0385 1ASF DATA >I1A5F 
0194 0386 195F DATA >195F 
0195 0387 185F DATA >185F 
0196 0388 175F DATA >175F 
0197 0389 165F DATA >165F 
0198 038A 155F DATA >155F 
0199 038B 145F DATA >145F 
0200 O038C 135F DATA >135F 
0201 O38D 125F DATA >125F 
0202 O38E 115F DATA >115F 
0203 O38F !105F DATA >105F 
0204 0390 OF9OF DATA >FO9OF 
0205 0391 OF IF DATA OFIF 
0206 0392 OE9SF DATA >E9F 
0207 0393 OEIF DATA EIF 
0208 0394 OD9F DATA >D9F 
0209 0395 ODIF DATA >DIF 
0210 0396 OC9F DATA »>C9F 
0211 0397 OCIF DATA >CIF 
0212 0398 OB9SF DATA >B9F 
0213 0399 OBIF DATA >BIF 
0214 O39A OA9SF DATA >A9F 
0215 039B OAIF DATA >AIF 
0216 039C O99F DATA >99F 
0217 039D O9IF DATA >9I1F 
0218 O39E O89F DATA >89F 
0219 O39F O8I1F DATA >8IF 
0220 O3A0 O7BF DATA >7BF 
0221 O3AL O77F DATA >77F 
0222 O03A2 073F DATA >73F 
0223 O3A3 O6FF DATA >6FF 
0224 03A4 O6BF DATA >6BF 
0225 O3A5 O67F DATA >67F 
0226 O03A6 O063F DATA >63F 
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0227: 
0228 
0229 
0230 
0231 
0232 
0233 
0234 
0235 
0236 
0237 
0238 
0239 
0240 
0241 
0242 
0243 
0244 
0245 
0246 
0247 
0248 
0249 
0250 
0251 
0252 
0253 
0254 
0255 
0256 
0257 
0258 
0259 
0260 
0261 
0262 
0263 
0264 
0265 
0266 
0267 
0268 
0269 
0270 
0271 
0272 
0273 
0274 
0275 
0276 
0277 
0278 
0279 
0280 
0281 
0282 
0283 
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O5SFF 
O58F 
O5S7F 
053F 
O4FF 
048F 
047F 
043F 
O3FF 
O3CF 
O3AF 
038F 
O36F 
034F 
032F 
030F 
O2EF 
O2CF 
O2AF 
028F 
O26F 
O24F 
022F 
020F 
O1EF 
0107 
O1C7 
01B7 
O1A7 
0197 
0187 
0177 
0167 
0157 
0147 
0137 
0127 
0117 
0107 
OOF7 
O0E7 
000B 
00D3 
00cB 
00c3 
008B 
0083 
OOAB 
00A3 
009B 
0093 
008B 
0083 
007B 
0073 
0068 
0063 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


>5FF 
> SBF 
>57F 
>53F 
> 4FF 
> 4BF 
>47F 
>43F 
>3FF 
>3CF 
>3AF 
>38F 
>36F 
>34F 
>32F 
>30F 
>2EF 
>2CF 
>2AF 
>28F 
>26F 
>24F 
>22F 
>20F 
>1LEF 
>1D7 
>1C7 
>1B7 
>1A7 
>197 
>187 
>177 
>167 
>157 
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NO$IDT 


0284 
0285 
0286 
0287 
0288 


0308 
0309 
0310 
0311 
0312 
0313 
0314 
0315 
0316 
0317 


O3E0 
031 
03E2 
03E3 
O3E4 
03E5 
O3E6 
O3E7 
03E8 


0369. 


O3EA 
O3EB 


O3EC 


O3ED 
O3EE 
O3EF 
O3F0 
O3F 1 
O3F2 
03F3 
O3F 4 
O3F5 
O3F6 
O3F7 
O3F8 
O3F9 
O3FA 
O3FB 
O3FC 
O3FD 
O3FE 
O3FF 


NO ERRORS, 
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0050 
0059 
0055 
0051 
004D 


NO WARNINGS 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


END 


>50 
>59 
>55 
>51 
>4D 
>49 
>45 
>41 
>3D 
>39 
>35 
>31 
>2D 
>29 
»25 
>21 
PIE 
>1C 
>1A 
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5. Companding Routines for the TMS32010/TMS32020 


Digital Signal Processing Routines 


6. Floating-Point Arithmetic with the 
TMS32010 


Ray Simar, Jr. 
Digital Signal Processing - Semiconductor Group 
Texas Instruments 
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INTRODUCTION 


The TMS32010 Digital Signal Processor is a fixed-point 
16/32-bit microprocessor. However, it can also perform 
floating-point computations at a speed comparable to 
dedicated floating-point processors. 

The purpose of this application report is to analyze an 
implementation of floating-point addition and multiplication 
on the TMS32010. The floating-point single-precision stan- 
dard proposed by the IEEE will be examined. Using this 
standard, the TMS32010 performs a floating-point multipli- 
cation in 8.4 microseconds and a floating-point addition in 
17.2 microseconds. 

To illustrate floating-point formats and the tradeoffs 
involved in making a choice between different floating-point 
formats, a review of floating-point arithmetic notation and 
of addition and multiplication algorithms is first presented. 


FLOATING-POINT NOTATION 


The floating-point number f may be written in floating- 
point format as 


f=mxbe 
where 

m = mantissa 

b= base 

e = exponent 


For example, 6,789,320 may be written as 
0.6789320 x 107 

In this case, 
m =0.6789320 
b=10 


e=7 


The two floating-point numbers f, and f2 may be 
written as 


fj =m, xb*l 
f2 =m xb 


6. Floating-Point Arithmetic with the TMS32010 


Floating-point addition/subtraction, multiplication, and 
division for fj and f2 are defined as follows: 


fy +f) =(my tmz xb 1-2) xbel ifeyze7 (1) 
or 
= (my xb~ ©2~-¢1))+mg) xb if ej <e2 
f) x f2 =m Xm xb + 2) (2) 
f}/fp = (myj/mz) x b€1 — 2) (3) 


A cursory examination of these expressions reveals 
some of the factors involved in the implementation of 
floating-point arithmetic. For addition, it is necessary to shift 
the mantissa of the floating-point number which has the 
smaller exponent to the right by the difference in the 
magnitude of the two exponents. This is shown in the 
multiplication by the terms 


b~ (1-2) and pb 2-¢1) 


This right shift can result in mantissa underflow. There 
are also possibilities for mantissa overflow. Addition and sub- 
traction of exponents can lead to exponent underflow and 
overflow. To alleviate underflow and overflow, it is 
necessary to decide on some scheme for roundoff. For a 
detailed description and analysis of underflow and overflow 
conditions and rounding schemes, see reference 1. 

It is desirable to have all numbers normalized, i.e., the 
mantissas of fj and f2 have the most significant digit in the 
leftmost position. This provides the representation with the 
greatest accuracy possible for a fixed mantissa length. The 
result of any floating-point operation must also be normaliz- 
ed. The factors associated with normalization, overflow, and 
other characteristics of floating-point implementations are 
best illustrated with a few examples. 

Consider the addition of two binary floating-point 
numbers f; and f2 where 


fj =0.10100 x 2011 
f2 = 0.11100 x 2001 
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Both of these numbers are normalized, i.e., the first 
bit after the binary point is a 1. Addition requires equal 
exponents, so the fractions are aligned by shifting right the 
one with the smaller exponent and adjusting the smaller 
exponent. This yields 


fo =0.00111 x 2011 
Then, 


£1 + £2 =0.10100 x 2011 +.0.00111 x 2011 
=0.11011 x 2011 =f3 


The sum may overflow the left end by one digit, thus 
requiring a postaddition adjustment or renormalization step. 
Since it is assumed that the register is only of a finite length, 
this renormalization will result in the loss of the lowest order 
bit. 

Another example illustrates the overflow past the most 
significant bit. With an assumed register length of five, let 


fy =0.11100 x 2011 
fp =0.10101 x 2001 


Then, 


0.11100 x20ll=fy 


+ 0,0010101 x 2011 =f5 
T,0000101 x 20IT =f3 


The significance of the two digits underlined in the right 
part of the mantissa is suspect, since it is assumed that the 
corresponding bits of fj are zero. The left underlined digit 
is the overflow past the most significant bit. To finish the 
addition, f3 is shifted to the right and the exponent adjusted 
accordingly. Thus, 


1.0000101 x 2011 =f3 


The shift of the fraction and the adjustment of the exponent 
yield 


0.10000101 x 2100 = f3 
The result may be rounded, giving 

0.10001 x 2100 = f3 
or truncated, giving 

0.10000 x 2100 = f3 

FLOATING-POINT ALGORITHMS 

Multiplication Algorithm 

The algorithm for normalized floating-point multiplica- 


tion is lustrated in Figure 1. This algorithm is an implemen- 
tation of Equation 2 in the section on floating-point notation. 
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The floating-point numbers being multiplied are A and B 
written as 


A=ma xb°A and B=mpxb*B 
The result is 
C=mc xb*C 


For the resulting mc, there are three special cases. The 
mc may be zero, in which case there is a branch to 
Step 10 to set C=0. If mc #0, then the most significant bit 
will be in either the first or second leftmost bit. If the most 
significant bit is in the second leftmost bit, then a left shift 
of mc is necessary (see Step 5). Otherwise, C is already in 
normalized form, and there is a branch to Step 6. 

Step 6 implements the desired rounding scheme. After 
this rounding, it is possible that mc will overflow (see 
Step 7). In this case, it is necessary to right-shift mc one 
bit (see Step 8). Step 9 checks for special cases of ec. If 
there is an overflow or underflow of ec, it is handled in 
Step 10. Otherwise, the result is in range, and the calcula- 
tion is complete. 


mA mB 


:] MULTIPLY MANTISSAS 
mc =MaA <X MB 
eA 6 
2 | ADO EXPONENTS 
ec =ea tog 
3 TEST FOR SPECIAL CASES OF mc 


NORMALIZED 


OVERFLOW 


RIGHT SHIFT mc ONE BIT 
ec=ec+1 


TEST FOR SPECIAL CASES OF ec 


OVERFLOW IN RANGE 
OR UNDERFLOW 


10 SET SPECIAL VALUES OF RESULT 


C=AxB 


Figure 1. Floating-Point Multiplication 


6. Floating-Point Arithmetic with the TMS32010 


Addition Algorithm 

The implementation of normalized floating-point 
addition is more involved than for multiplication. This 
addition algorithm, outlined in Figure 2, is an implementa- 
tion of Equation | in the section on floating-point notation. 


> 


— 
frac a 
<—ore 
OSPOSE OF EXTRA BITS: 
7 
=< 


C=A+8 


Figure 2. Floating-Point Addition 


Step 1 compares eA and ep for determining ec. For 
this illustration of the algorithm, it is assumed that ea sep. 
Step 2 determines the right shift (d) that is required to align 
mA. Step 3 implements this right shift of mA. Step 4 disposes 
of the extra bits of mA by using the desired rounding techni- 
que. The mantissas of A and B are then added in Step 5. 
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Now, things become somewhat more involved. The mc 
may be zero, in which case there is a branch to Step 9 which 
sets ec = 0; a branch to Step 14 sets the special value of the 
result. The mc may overflow, in which case a right shift 
of one is necessary (see Step 7). The mc may have k leading 
zeroes, in which case a left shift of k is required. This nor- 
malization step is generally the most involved and time- 
consuming step to perform. Steps 10, 11, and 12 round mc, 
test for a possible overflow due to the rounding, and adjust 
ec accordingly. Step 13 involves the determination of the 
special case of ec. Finally, after Step 14, the sum C=A+B 
is formed. 


IEEE FLOATING-POINT 
SINGLE-PRECISION FORMAT 


Of interest is a set of formats known as the IEEE stan- 
dard. This IEEE recommended format consists of a variety 
of precision formats (single, double, single-extended, and 
double-extended). The IEEE has also proposed several 
techniques for handling special cases such as overflow, 
underflow, +o, and rounding. For complete details, the 
reader is referred to the proposed IEEE standard .2 

The single-precision format is a 32-bit format consisting 
of a 1-bit sign field s, an 8-bit biased exponent e, and a 23-bit 
fraction f (see Figure 3). The value of a binary floating-point 
number X is determined as follows: 


X =(-1)8 x 2(€- 127) x 1. 


Figure 3. IEEE Floating-Point Single-Precision Format 


The advantage of this format is that it is structured in 
such a way as to provide easy storage and straightforward 
input/output operations on 8-, 16- and 32-bit processors. The 
disadvantage with this format is that the large mantissa will 
generally span several words of memory. 


FLOATING-POINT IMPLEMENTATION 


IEEE Implementation 

The IEEE single-precision format is described here as 
it applies to the addition and multiplication algorithms. In 
these floating-point routines written for the TMS32010, all 
results are truncated to 31 bits. This was done so that the 
user has more flexibility to develop a rounding scheme 
suitable for his application. The representations of + © are 
ignored so that the user can decide how to handle these ex- 
ceptions in a manner that is appropriate for his particular 
application. 
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I/O Considerations 

The first consideration is the internal representation of 
the binary floating-point number. If the number is read into 
the TMS32010 as two 16-bit words, some processing is then 
necessary to put the floating-point number into a represen- 
tation which is easier to process. The representation used 
in the TMS32010 programs in the Appendix is shown in 
Figure 4. This internal representation may be arrived at by 
a simple manipulation of the IEEE bit fields. For this par- 
ticular algorithm, it is assumed that the floating-point number 
is input to the TMS32010 as the four 16-bit fields shown 
in Figure 4. However, the user can easily supply his own 
routine to arrive at this format from two 16-bit inputs to the 
TMS32010 where the inputs contain the IEEE single- 
precision format. 


15 1413121110 9 8 7 6 5 43 2 1 0 


ASIGN (0 if positive 
or —1 if negative) 


16 1413121110 9 8 7 6 5 43 2 1 O 


AHI (NORMALIZED) | 0 1 f (most significant 14 bits) 


15 1413121110 9 8 7 6 5 43 2 1 0 


ALO } Of (least significant 9 bits) oo 0000 


15 1413121110 9 8 7 6 5 43 2 1 90 


AEXP 


Figure 4. Floating-Point Representation 


The format in Figure 4 was chosen to minimize the ex- 
ecution time of the floating-point addition and multiplication 
routines. The format of the result is shown in Figure 5. 
Notice that it is identical to the format in Figure 4 except 
for CLO. CLO has its 16 most significant bits valid for both 
the multiplication and addition routines. 
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15 1413 12 1110 9 8 -7 6 5 43 2 1 0 


15 1413121110 9 8 7 6 5 4 3 2 1 90 


15 14131211109 8 7 6 5 43 2 1 0 


15 14131211109 8 7 6 5 43 2 1 O 


Figure 5. Result Representation 


Normalization 

Since the floating-point addition involves a normaliza- 
tion, a technique similar to a binary search algorithm is us- 
ed in the addition routine in the Appendix. To begin the nor- 
malization routine, note that with the format used for the 
result (see Figure 5), all mantissas can be considered to be 
positive. The binary search for the most significant bit (the 
leftmost 1 since the mantissa is positive) is illustrated in 
Figure 6. 

The first move is to split C into CHI and CLO. If 
CHI +0, then the most significant bit (MSB) is in CHI; other- 
wise, it is in CLO. For this example, it is in CHI. The next 
step is to split CHI into C11 and C12. If C11 +0, then the 
MSB is in the eight bits of C11; otherwise, it is in C12. For 
this example, the MSB is in C12. Next, split C12 into C23 
and C24. Again, if C24 #0, then the MSB is in C24; other- 
wise, it is in C23. Since C24 #0, split C24 into C37 and 
C38. Since C37 #0, the MSB is in C37. Finally, splitting 
C37, a simple bit test shows that the MSB is in position 19. 
Using this technique, it is possible to find the MSB in a 32-bit 
field with only five compares. 
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c 
31 30 29 28 27 26 26 24 23 22 21 20 19 18 17 16 15 14131211109 8 76643210 


| 


31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16/15 14 13 121110 9 8 7 6 § 43 210 
0 0000¢000000 0% 1 0 0311411010031 0000832311 
ra 2 : 


c11 ct2 
31 30 29 28 27 26 25 24'23 22 21 20 19 18 17 16 


0 000000 0l0 00031100 


0 000!1 100 


C37 C38 
19 18117 16 


Figure 6. Binary Search 


Added Precision 

As illustrated in Figure 5, the 16 most significant bits 
of CLO are valid, i.e., C is valid for 31 places beyond the 
binary point. Oftentimes the user is not as concerned with 
the IEEE standard as in being certain that he has enough ac- 
curacy for his particular application. Since the TMS32010 
uses 16-bit words, the routines in the Appendix implicitly 
maintain a 30-bit mantissa. They also implicitly use a 16-bit 
exponent. If the user desires this added accuracy and dynamic 
range, then it is readily implementable with no additional 
cost in execution time. The normalization for the addition, 
as mentioned previously, operates over the entire 32-bit ac- 
cumulator. For the strict IEEE format, the user will only 
want to normalize over the 25 most significant bits of the 
accumulator. The structure of the normalization routine 
makes this modification simple. 

The routines in the Appendix make no provision for 
the representations of + o and exponent underflow and 
overflow. The user of the routines should consider the degree 
of significance of these results and the way they should be 
handled for his particular application. Since these routines 
are written to operate at maximum speed, truncation of results. 
is used. If the user desires to implement a rounding scheme, 
then he will also need to check for the possibility of overflow 
due to the rounding scheme. This step is shown in the 
multiplication and addition flowcharts (see Figures 1 and 2). 
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SUMMARY 


The TMS32010 may be used to perform floating-point 
operations with great accuracy, wide dynamic range, and 
high-speed execution. The design engineer has the respon- 
sibility of deciding what type of floating-point format is best 
for his application. To aid in understanding floating-point 
operations, several examples have been given illustrating the 
manipulations necessary to implement floating-point addi- 
tion and multiplication algorithms. Flowcharts for these 
algorithms are also included. The Appendix contains the 
TMS32010 code for the IEEE floating-point single-precision 
format used in multiplication and addition. These same 
routines may also be used without modification to implement 
a format with up to a 30-bit mantissa and a 16-bit exponent 
without any increase in execution time. 
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APPENDIX 


320 FAMILY MACRO ASSEMBLER 2.1 83.076 16:33:40 1/18/84 


KHKKKKEKKEHKEKKEKKEKKKEKKKKKKEKKEEKKKEKKKKKEKEKKKRKKKKKKRKRKKK KKK KKK KEK 


THIS IS A FLOATING-POINT MULTIPLICATION ROUTINE WHICH 
IMPLEMENTS THE IEEE PROPOSED FLOATING-POINT FORMAT ON 
THE TMS32010. 


(eee ee SPER ELESELESSE SSSA SES ESSERE SSEL ESE SESE ESLER ESE SESE LESS SY 


INITIAL FORMAT (ALL 16 BIT WORDS) 


| ALL 0 OR 1 | ASIGN (0 OR -1l) 
lol. 15 BITS | AHI (NORMALIZED) 
iol 9 BITS |--0--| ALO 


| | AEXP (-127 TO 128) 


TO HAVE THIS CORRESPOND TO IEEE FORMAT, INPUT 0.1F * 
2 ** (BE + 1) INSTEAD OF 1.F * 2 ** E AND SUBTRACT 
127 FROM E. 


THE FINAL FORMAT IS THE SAME AS THE INITIAL FORMAT 
EXCEPT THAT FOR CLO WE HAVE: 


| 16 BITS | CLO 


re we me ee ee ee a ae ae awe = ee 


THE 16 BITS OF CLO ARE VALID. ANYTHING PAST THESE 
HAS BEEN TRUNCATED. 


KRHKRKEKKKKKKKKKEKKKKEKKKRKKKKKKRKKK KKK KKRKKKKKKKRKK KKK KKRKRKKKKKRKRK KKK 


WORST CASE (EXCLUDING INITIALIZATION AND I/0): 
8.4 MICROSECONDS 
WORDS OF PROGRAM MEMORY: 72 


RHKKKEKKEKKKEKEKEKKEKEKKEKKEKKKKEKKKEKKKKKKKRKKKEKKKKKKRKKKKKKKKRK KK KKK 


+ te FF FH FH OF OH OH OH OO OOO OOOH OOOH OO HHH 


IDT 'IEEEMULT' 
DSEG 

* 

AEXP BSS 1 

AHI BSS 1 

ALO BSS 1 

ASIGN BSS 1 

BEXP BSS 1 
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0056 
0057 
0058 
0059 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
0080 
0081 
0082 
0083 
0084 
0085 
0086 
0087 
0088 
0089 
0090 
0091 
0092 
0093 
0094 
0095 
0096 
0097 
0098 
0099 
0100 
0101 
0102 
0103 
0104 
0105 
0106 
0107 
0108 
0109 
0110 
0111 
0112 


0005 
0006 
0007 
0008 
0009 
OO0A 
000B 
000Cc 
000D 
OO00E 
O00F 
0010 
0011 
0012 


0013 
0000 


0000 
0001 
0002 
0003 
0004 
0005 
0006 
0007 


0008 
0009 
000A 
000B 
000C 
000D 
OO0E 
OO0F 
0010 
0011 


0012 
0013 
0014 


0015 
0016 
0017 
0018 
0019 


OO1A 
001B 


001C 


4003 
4000 
4001 
4002 
4007 
4004 
4005 
4006 


6E00 
7EO1 
5010 
7E03 
SOOF 
7F89 
1010 
SQ11 
7TEO2 
5012 


2000 
0004 
5008 


6A02 
6D05 
7F8E 
580D 
500C 


6A01 
6D06 


7F8F 


BHI BSS 1 
BLO BSS 1 
BSIGN BSS l 
CEXP BSS 1 
CHI BSS 1 
CLO BSS 1 
CSIGN BSS l 
TLO BSS 1 
THI BSS 1 
CLOHI BSS 1 
M0003 BSS 1 
ONE BSS 1 
NEGONE BSS 1 
TWO BSS 1 
* 
DEND 
* 
PSEG * BEGIN THE PROGRAM SEGMENT * 
* 
IN ASIGN, PAO * INPUT * 
IN AEXP, PAO 
IN AHI,PAO 
IN ALO, PAO 
IN BSIGN,PAO 
IN BEXP, PAO 
IN BHI,PAO 
IN BLO, PAO 
* * FINISHED THE INPUT ROUTINE * 
* 
* * A LITTLE INITIALIZATION * 
START LDPK 0 
LACK 1 
SACL ONE 
LACK 3 
SACL M0003 
ZAC 
SUB ONE 
SACL NEGONE 
LACK 2 
SACL TWO 
* * DONE WITH THE INITIALIZATION * 
* * ADD EXPONENTS * 
LAC AEXP 
ADD BEXP 
SACL CEXP * CEXP = AEXP + BEXP * 
* * FINISHED ADDING EXPONENTS * 
* * MULTIPLY MANTISSAS * 
LT ALO * FIRST PRODUCT, (ALO * BHI) * 
MPY BHI 
PAC 
SACH THI 
SACL TLO 
* 
LT AHI * SECOND PRODUCT, (AHI * BLO) * 
MPY BLO 
* 
APAC * (ALO * BHI + AHI * BLO) * 
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0113 
0114 
0115 
0116 
0117 
0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 


0134 
0135 
0136 
0137 


0138 
0139 
0140 
0141 
0142 
0143 
0144 


0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 


0153 


0154 
0155 
0156 
0157 
0158 
0159 
0160 


0161 
0162 
0163 
0164 
0165 
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001D 


OO1E 
0O1F 
0020 


0021 
0022 
0023 


0024 
0025 


0026 
0027 


0028 
0029 
002A 
002B 


002C 


002D 
002E 
002F 
0030 
0031 


0032 


0033 
0034 


0035 
0036 
0037 
0038 
0039 


003A 
003B 
003C 
003D 


003E 
003F 
0040 


7F8F 


600D 
610C 
580D 


6D05 
7F8E 
610D 


5909 
500A 


t+ + + + H % 


* 
* 


SHIFT1 
* 


* 


SETSIN 


APAC 


ADDH THI 
ADDS TLO 
SACH THI 


MPY BHI 
PAC 
ADDS THI 


SACH CHI,1 
SACL CLO 


BNZ OK 


ZAC 
SACL CEXP 
B SETSIN 


LAC CLO,1 


SACL CLO 
LAC ONE, 14 
AND CHI 
BNZ SETSIN 


LAC CEXP 


SUB ONE 
SACL CEXP 


ZALH CHI 
ADDS CLO 
SACH CHI,1 
LAC CLO,1 
SACL CLO 


ZALS ASIGN 
XOR BSIGN 
BNZ NEG 


ZAC 
SACL CSIGN 
B OUTPUT 


* 


x 


* 


* 


HAS THE EFFECT OF 
(AHI * BLO + ALO * BHI) 
* 2 ke -15 * 


(AHI * BHI) * 


* GET RID OF EXTRA SIGN BIT * 


THE (ALO * BLO * 2 ** -30) IS LOST DUE 
TO THE IEEE FORMAT * 


FINISHED MULTIPLYING THE MANTISSAS * 
CHECK SPECIAL CASES AND WRAP THINGS UP 
CHI AND CLO ARE STILL IN THE ACC * 
IF C IS ZERO LOAD CEXP WITH ZERO * 
BRANCH TO SET THE SIGN * 
* TAKING CARE OF EXTRA SIGN BIT 
AS ABOVE * 
* MASK OFF POSSIBLE MSB * 
* BRANCH IF NORMALIZATION NOT 
NECESSARY * 


* HERE A LEFT SHIFT OF ONE IS 
NECESSARY * 


* IF ASIGN XOR BSIGN != 0 


THE PRODUCT IS NEGATIVE * 
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0041 0044' 


0166 * 
0167 0042 2011 NEG LAC NEGONE * NEGONE = -1 * 
0168 x 
0169 0043 500B SACL CSIGN 
0170 ig 
0171 0044 490B OUTPUT OUT CSIGN,PA1 
0172 0045 4908 OUT CEXP, PAl 
0173 0046 4909 OUT CHI,PA1 
0174 0047 490A OUT CLO,PA1 
0175 x 
0176 0048 F900 SELF B SELF 
0049 0048' 
0177 . 
0178 m * END THE PROGRAM SEGMENT * 
0179 END 


NO ERRORS, NO WARNINGS 
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IEEEADD 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
~ 0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
0050 
0051 
0052 
0053 
0054 
0055 


224 


0000 


0000 
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0002 
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(eee eee eeeeeEREREREESSE SLES EESLELESESARESSS SELES LES ESS ES ELE SS SE 


THIS IS A FLOATING-POINT ADDITION ROUTINE WHICH 
IMPLEMENTS THE IEEE PROPOSED FLOATING-POINT FORMAT 
ON THE TMS32010. 


KRHAKKKKERRKREEKKEKRKKEEKRKKEEERKRKEREREKEEKREREKEERKEKERKKKRKRRKKRKKRKK 


INITIAL FORMAT (ALL 16 BIT WORDS) 


| ALL 0 OR 1] | ASIGN (0 OR -1) 
lol. 15 BITS | AHI (NORMALIZED) 
10] 9 BITS |--0-| ALO 


| | AEXP (-127 TO 128) 


TO HAVE THIS CORRESPOND TO IEEE FORMAT, INPUT 0.1F * 
2 ** (E + 1) 
INSTEAD OF 1.F * 2 ** E AND SUBTRACT 127 FROM E. 


THE FINAL FORMAT IS THE SAME AS THE INITIAL FORMAT 
EXCEPT THAT FOR CLO WE HAVE: 


| 16 BITS | CLO 


ALL 16 BITS OF CLO ARE VALID. ANYTHING PAST THESE HAS 
BEEN TRUNCATED. 


RRKEKKKKKEKKKEKEKKEKKEKKKKKKKKKKKKKKKKRKKKKEKKEEKEEKEKKKKEKKEKKEKKEKKEKK 


WORST CASE (EXCLUDING INITIALIZATION AND I/O): 
17.2 MICROSECONDS. 

THIS FIGURE INCLUDES THE NORMALIZATION. 

WORDS OF PROGRAM MEMORY: 768 


RHEKKEKKKEKEKEKKEKKKEKEKKEKEKKEKEKKEKEKRHEKRKEKKEKKEEKKREEKEKEKKEKKKKEKKKKRKEK 


+ % % % % HF HF FH HH HH FH HH HH HF HF HH H HF HE HF HF H HF HF HF HF HH HH HH HH HH HH HH 


IDT 'IEEEADD' 
DSEG 

* 

AEXP BSS 1 

AHI BSS 1 

ALO BSS 1 

ASIGN BSS 1 


6. Floating-Point Arithmetic with the TMS32010 


0056 0004 
0057 0005 


0058 0006 
0059 0007 
0060 0008 
0061 0009 
0062 O00A 
0063 000B 
0064 000C 
0065 000D 
0066 OO0E 
0067 OOOF 


0068 0010. 


0069 0011 
0070 0012 
0071 0013 
0072 0014 
0073 0015 
0074 0016 
0075 0017 
0076 0018 
0077 0019 
0078 OO1A 
0079 001B 
0080 O001C 
0081 001D 
0082 OO1E 
0083 OO1F 
0084 0020 
0085 0021 
0086 0022 
0087 0023 
0088 0024 
0089 0025 
0090 0026 
0091 0027 
0092 0028 
0093 0029 
0094 002A 
0095 002B 
0096 002C 
0097 002D 
0098 002E 
0099 002F 
0100 0030 
0101 0031 
0102 0032 
0103 0033 
0104 0034 
0105 0035 
0106 0036 
0107 0037 
0108 0038 
0109 0039 
0110 003A 
0111 003B 


BEXP BSS 1 
BHI BSS 1 


BLO BSS 1 
BSIGN BSS 1 
CEXP BSS 1 
CHI BSS 1 
CLO BSS 1 
CSIGN BSS 1 
Cll BSS 1 
C12 BSS 

C21 BSS 

C22 BSS 

C31 BSS 


1 


SHIFT1 BSS 1 
SHIFT2 BSS 1 
AHITL BSS 1 
BHITL BSS 1 
Nl BSS 
N2 BSS 
N3 BSS 
N4 BSS 
NS BSS 
N6 BSS 
N7 BSS 
N8 BSS 
N9 BSS 
N10 BSS 
N11 BSS 
N12 BSS 
N13 BSS 
N14 BSS 
N15 BSS 
N16 BSS 
N17 BSS 
N18 BSS 
N19 BSS 
N20 BSS 
N21 BSS 
N22 BSS 
N23 BSS 
N24 BSS 
N25 BSS 
N26 BSS 
N27 BSS 
N28 BSS 


el ee ee oe ee 


ee ee ee oe 
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0112 
0113 
0114 
0115 
0116 
0117 
0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 
0134 
0135 
0136 
0137 
0138 


0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 
0153 
0154 
0155 
0156 
0157 
0158 
0159 
0160 
0161 
0162 
0163 
0164 
0165 
0166 
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003C 
003D 
003E 
003F 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
004A 
004B 
004C 
004D 
004E 
O04F 
0050 


0051 
0000 


0000 
0001 


0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
OO0A 
000B 
000C 
000D 
OO0E 
OOOF 
0010 


0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
OO1A 
001B 


F900 
0011' 


0002 
0004 
0008 
0010 
0020 
0040 
0080 
0100 
0200 
0400 
0800 
1000 
2000 
4000 
8000 


6E00 
7EO1 
503E 
7020 
711D 
7F89 
103E 
6880 
SOA1 
F400 
0017' 


N29 BSS 1 
N30 BSS 1 
ONE BSS 1 
Two BSS 1 
FIFTEN BSS 1 
SIXTEN BSS 1 


M0002 
M0003 
MOOOF 
MOO1F 
MO0O7F 
MOOFF 
MO1FF 
MO3FF 
MO7FF 
MOFFF 
M1FFF 
M3FFF 
M7FFF 
M8000 


BSS 
BSS 
BSS 


BSS 


OS BSS l 
x 


DEND 


* 


PSEG 


* 


* 
SHIFTS 


START 


LOOP 


te ed 


* BEGIN THE PROGRAM SEGMENT * 


B START 


DATA 2 
DATA 4 
DATA 8 
DATA 16 
DATA 32 
DATA 64 
DATA 128 
DATA 256 
DATA 512 
DATA 1024 
DATA 2048 
DATA 4096 
DATA 8192 
DATA 16384 
DATA 32768 


* A LITTLE INITIALIZATION * 


LDPK 0 
LACK 1 

SACL ONE 
LARK ARO,N1 
LARK AR1,29 
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0167 001C 7E02 LACK 2 


0168 001D 5042 SACL M0002 
0169 OO1E 503F SACL TWO 
0170 OO1F 7E03 LACK 3 
0171 0020 5043 SACL M0003 
0172 0021 7EOF LACK 15 
0173 0022 5044 SACL MOOOF 
0174 0023 5040 SACL FIFTEN 
0175 0024 7E10 LACK 16 
0176 0025 5041 SACL SIXTEN 
0177 0026 7E1F LACK 31 
0178 0027 5045 SACL MOOI1F 
0179 0028 7E7F LACK 127 
0180 0029 5046 SACL MO0O7F 
0181 002A 2444 LAC MOOOF,4 
0182 002B 0044 ADD MOOOF 
0183 002C 5047 SACL MOOFF 
0184 002D 083E ADD ONE,8 
0185 O002E 5048 SACL MOI1FF 
0186 002F 093E ADD ONE,9 
0187 0030 5049 SACL MO3FF 
0188 0031 OA3E ADD ONE, 10 
0189 0032 504A SACL MO7FF 
0190 0033 OB3E ADD ONE,11 
0191 0034 504B SACL MOFFF 
0192 0035 O0C3E ADD ONE,12 
0193 0036 504C SACL MIFFF 
0194 0037 OD3E ADD ONE,13 
0195 0038 504D SACL M3FFF 
0196 0039 OB3E ADD ONE, 14 
0197 003A 504E SACL M7FFF 
0198 003B 2F3E LAC ONE,15 
0199 003C 504F SACL M8000 
0200 003D 6A3E LT ONE 
0201 003E 8002 MPYK SHIFTS 
0202 003F 7F8E PAC 
0203 0040 5050 SACL OS 
0204 x 
0205 = * FINISHED INITIALIZATION * 
0206 gi 
0207 0041 4003 IN ASIGN, PAO 
0208 0042 4000 IN AEXP, PAO 
0209 0043 4001 IN AHI,PAO 
0210 0044 4002 IN ALO, PAO 
0211 0045 4007 IN BSIGN, PAO 
0212 0046 4004 IN BEXP, PAO 
0213 0047. 4005 IN BHI,PAO 
0214 9043 4006 IN BLO,PAO 
0215 ad 
0216 0049 2000 LAC AEXP 
0217 004A 1004 SUB BEXP 
0218 004B FAQQ BLZ ALTB * BRANCH IF AEXP < BEXP * 
004C 0093' 
0219 ” 
0220 004D FFOO BZ AEQB * BRANCH IF AEXP = BEXP * 
004E OOEE' 
0221 * 
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0222 
0223 
0224 
0225 
0226 
0227 
0228 
0229 


0230 
0231 
0232 
0233 
0234 
0235 
0236 
0237 
0238 
0239 
0240 
0241 
0242 
0243 
0244 
0245 
0246 
0247 
0248 
0249 
0250 
0251 
0252 
0253 
0254 


0255 
0256 
0257 
0258 


0259 
0260 
0261 
0262 
0263 
0264 
0265 
0266 
0267 
0268 
0269 
0270 
0271 
0272 
0273 
0274 
0275 
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004F 


0050 
0051 


0052 
0053 
0054 
0055 


0056 
0057 


0058 
0059 


O0SA 
005B 
005C 
005D 
005E 
OO5F 


0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 


0069 
006A 


006B 
006C 
006D 


O06E 
OO6F 
0070 
0971 
0072 
0073 
0074 


0075 
0076 


2102 
5002 


AGTB 


x 
* 


BHIZER 


* 


+ + + H 


SACL D 


LAC AEXP 
SACL CEXP 


LAC FIFTEN 
SUB D 
BLEZ BHIZER 


ADD OS 
TBLR SHIFT1 


LAC BLO,1 
SACL BLO 


LT BHI 
MPY SHIFT1 
PAC 


ABS 


SACH BHI 
SACL BHITL 


LT BLO 
MPY SHIFT1 
PAC 


APAC 
ABS 


ADDH BHITL 
SACH BLO 
LAC ALO,1 
SACL ALO 


B DUN 


ADD ONE 
BLEZ BLOLUZ 


LAC BLO, 2 
SACH BLO 

ZALS BLO 

ADD BHI,1 
SACL BLO 

ZAC 

SACL BHI 


LAC ALO, 1 
SACL ALO 


* D IS THE RIGHT SHIFT NEEDED FOR B 


* THE EXP FOR THE RESULT IS AEXP * 
* A > B SO SHIFT B TO THE RIGHT D * 


* IF D >= 15 BHI IS ZERO * 


10] 15 BITS | BHI 
(Oo! 8 BITS | | BLO 
|! Is 
|| CHANGED 
VV TO 
VV 
VV 
| -O- | 15-D | BHI 
| D | 16-D | BLO 


* FINISHED SHIFT OF B 
BY D TO THE RIGHT * 


* IF D >= 16 BLO LOSES BITS * 


* WE ONLY GET HERE IF D = 15 * 


10] 15 BITS | BHI 
10! 8 BITS | | BLO 
| | 
VV 
| -OQ- | BHI 
| 15 BITS {?1 BLO 
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0276 * 


0277 0077 F900 B DUN 
0078 OOF4' 
0278 * 
0279 x 
0280 0079 FFQQ BLOLUZ BZ SHB16 
007A 008B' 
0281 007B 0040 ADD FIFTEN 
0282 007C FBOO BLEZ BZERO 
007D O0E4' 
0283 - 
0284 O07E 0050 ADD OS 
0285 OO7F 671C TBLR SHIFT1 
0286 = 
0287 0080 6A05 LT BHI 
0288 0081 6D1C MPY SHIFT1 
0289 0082 7F8E PAC 
0290 0083 7F88 ABS 
0291 0084 5806 SACH BLO 
0292 0085 7F89 ZAC 
0293 0086 5005 SACL BHI 
0294 * 
0295 * 
0296 * 
0297 = 
0298 0087 2102 LAC ALO,1 
0299 0088 5002 SACL ALO 
0300 * 
0301 0089 F900 B DUN 
OO8A OOF4' 
0302 si 
0303 008B 2005 SHB16 LAC BHI 
0304 008C 5006 SACL BLO 
0305 008D 7F89 ZAC 
0306 O08E 5005 SACL BHI 
0307 * 
0308 OO8F 2102 LAC ALO,1 
0309 0090 5002 SACL ALO 
0310 * 
0311 0091 F900 B DUN 
0092 OO0F4' 
0312 * 
0313 0093 7F88 ALTB ABS 
0314 as 
0315 0094 501B SACL D 
0316 * 
0317 0095 2004 LAC BEXP 
0318 0096 5008 SACL CEXP 
0319 m 
0320 0097 2040 LAC FIFTEN 
0321 0098 101B SUB D 
0322 0099 FBOO BLEZ AHIZER 
009A 00BO' 
0323 sd 
0324 009B 0050 ADD OS 
0325 009C 671C TBLR SHIFT] 
0326 i 
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* FINISHED SHIFT OF B 


BY D TO THE RIGHT * 


* IF D >= 31, THEN B IS ZERO. 


10] 15 BITS | 
lol 8 BITS | | 
1 | 

VV 

VV 
lO| ---0--- | 
| --0O-- | 31-DIl 


BHI 


BLO 


BHI 


BLO 


* 


* TO SEE WHAT IS GOING ON LOOK AT 


THE PREVIOUS CASE FOR AGTB * 
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0327 
0328 
0329 
0330 
0331 
0332 
0333 
0334 
0335 
0336 
0337 
0338 
0339 
0340 
0341 
0342 
0343 
0344 
0345 
0346 
0347 


0348 
0349 
0350 


0351 
0352. 
0353 
0354 
0355 
0356 
0357 
0358 
0359 
0360 
0361 
0362 


0363 
0364 


0365 
0366 


0367 
0368 
0369 
0370 
0371 
0372 
0373 
0374 
0375 
0376 
0377 
0378 
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009D 
OO9E 


0O9F 
OOA0 
OOA1 
O0A2 
00A3 
OO0A4 


O0A5 
O0A6 
OO0A7 
00A8 
OOA9 
OOAA 
OOAB 
OOAC 
OOAD 


OOAE 
OOAF 


00BO0 
OOB1 
00B2 


00B3 
00B4 
OOBS 
O00B6 
00B7 
00B8 
00B9 
OOBA 
OOBB 


OOBC 
OOBD 


OOBE 
OOBF 
00c0 
00c1l 
00C2 


00C3 
00C4 


00C5 
00C6 
00C7 
00C8 
00Cc9 
OOCA 
00CB 
00CC 


2102 
5002 


6A01 
6D1C 
7F8E 
7F88 
5801 
SO1E 


6A02 
6D1D 
7F8E 
7F8F 
7F88 
601E 
5802 
2106 
5006 


F900 
OOF4' 

* 
003E AHIZER 
FBOO 
OOBE' 

x 
2202 
5802 
2002 
0101 
5002 
789 
5001 
2106 
5006 


F900 
OOF4' 

x 
FFOOQ ALOLUZ 
00DO' 
0040 
FB00 
OODA' 

* 
0050 
671C 


6A01 
6D1C 
7F8E 
7F88 
5802 
7F89 
5001 
2106 


LAC ALO,1 
SACL ALO 


LT AHI 
MPY SHIFT1 
PAC 


ABS 


SACH AHI 
SACL AHITL 


LT ALO 
MPY SHIFT2 
PAC 


APAC 
ABS 


ADDH AHITL 
SACH ALO 
LAC BLO,1 
SACL BLO 

B DUN 


ADD ONE 
BLEZ ALOLUZ 


LAC ALO, 2 


LAC BLO,1 
SACL BLO 


B DUN 


BZ SHA16 


ADD FIFTEN 


BLEZ AZERO 


ADD OS 
TBLR SHIFT1 


LT AHI 
MPY SHIFT1 
PAC 


ABS 


SACH ALO 

ZAC 

SACL AHI 

LAC BLO,1 
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0379 
0380 
0381 


0382 
0383 
0384 
0385 
0386 
0387 
0388 
0389 
0390 
0391 


0392 
0393 


0394 
0395 
0396 
0397 
0398 
0399 
0400 
0401 
0402 
0403 


0404 
0405 
0406 
0407 
0408 
0409 
0410 
0411 
0412 
0413 


0414 
0415 
0416 
0417 
0418 
0419 
0420 
0421 
0422 
0423 
0424 
0425 
0426 


0427 
0428 
0429 


00CD 


00CE 
0OCF 


00DO 
00D1 
00D2 
00D3 


00D4 
0OD5 


00D6 
00D7 


00D8 
00D9 


OODA 
00DB 
00DC 
0ODD 
OODE 
OODF 
00E0 
0OE1 
00E2 
00E3 


O0B4 
00ES 
00E6 
00E7 
O0E8 
00E9 
OOEA 
OOEB 
OOEC 
OOED 


OOEE 
OOEF 
OOFO 
OOF1 
OOF2 
00F3 


OOF4 
OOFS 
OOF6 
0OF7 


0OF8 


SHA16 


* 
AZERO 


* 


BZERO 


AEQB 


DUN 


SACL BLO 


B DUN 


LAC AHI 
SACL ALO 
ZAC 

SACL AHI 


LAC BLO,1 
SACL BLO 


B DUN 


B DUN 


LAC BHI 
SACL CHI 
LAC BEXP 
SACL CEXP 
LAC BLO,1 
SACL CLO 
LAC BSIGN 
SACL CSIGN 
B OUTPUT 


LAC AHI 
SACL CHI 
LAC AEXP 
SACL CEXP 
LAC ALO,1 
SACL CLO 
LAC ASIGN 
SACL CSIGN 
B OUTPUT 


LAC ALO,1 
SACL ALO 
LAC BLO,1 
SACL BLO 
LAC AEXP 

SACL CEXP 


LAC ASIGN 
SUB BSIGN 
BNZ DIFSIN 


ZALH AHI 
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* GO TO DUN * 


* BRANCH IF THERE IS A 


SIGN DIFFERENCE * 
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0430 OOF9 

0431 OOFA 

0432 OOFB 

0433 

0434 OOFC 

0435 OOFD 

0436 OOFE 
OOFF 

0437 

0438 

0439 0100 
0101 

0440 

0441 0102 

0442 0103 

0443 0104 

0444 

0445 0105 

0446 0106 

0447 0107 

0448 

0449 0108 

0450 0109 

0451 O10A 

0452 010B 

0453 010C 

0454 010D 

0455 

0456 O10E 

0457 O10F 

0458 0110 

0459 

0460 0111 

0461 0112 

0462 

0463 0113 
0114 

0464 

0465 0115 

0466 0116 

0467 

0468 0117 
0118 

0469 

0470 0119 
O11A 

0471 

0472 011B 

0473 

0474 011C 

0475 O11D 

0476 O11E 

0477 

0478 O11F 
0120 

0479 0121 
0122 


232 


DIFSIN 


x 
CHBS 
x 


ADDS ALO 
ADDS BLO 
ADDH BHI 


SACH CHI 
SACL CLO 
BZ CZERO 


BGEZ NOOV 


LAC CHI,15 
SACH CHI 
SACL CTEMP 


LAC CHI 
AND M7FFF 
SACL CHI 


LAC CLO,15 
SACH CLO 
LAC CLO 
AND M7FFF 
ADDS -CTEMP 
SACL “CLO 


LAC CEXP 
ADD ONE 
SACL CEXP 


LAC ASIGN 


SACL CSIGN 
B OUTPUT 

LAC ASIGN 
SACL CSIGN 


B NORM 


BLZ CHAS 


ZALH AHI 
ADDS ALO 
SUBS BLO 
SUBH BHI 
BZ CZERO 


BLZ CNEG 


ee ee ee 


CHECKING FOR OVERFLOW DUE TO ADD * 


SHIFT TO RIGHT ONE TO CANCEL OVERFLOW 


CANCEL SIGN EXTENSION * 


CANCEL SIGN EXTENSION * 


DUE TO RIGHT SHIFT * 


FINISHED RIGHT SHIFT * 


* IF < 0 DO (B - A) * 


* po (IAI - |Bl) 
SINCE B < 0 AND A > O * 
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0480 
0481 
0482 
0483 
0484 
0485 
0486 
0487 


0488 
0489 
0490 
0491 
0492 
0493 
0494 
0495 


0496 


0497 
0498 
0499 
0500 
0501 
0502 
0503 
0504 


0505 
0506 
0507 
0508 


0509 
0510 
0511 
0512 
0513 
0514 
0515 
0516 
0517 
0518 
0519 
0520 
0521 
0522 
0523 
0524 
0525 
0526 
0527 
0528 
0529 
0530 
0531 


0123 
0124 


0125 
0126 


0127 
0128 


0129 


012A 
012B 
012C 


012D- 


012E 
012F 
0130 


0131 
0132 


0133 
0134 


0135 
0136 


0137 
0138 
0139 
013A 


013B 
013C 
013D 
013E 
013F 


SACH CHI 
SACL CLO 


ZAC 
SACL CSIGN 


B NORM 


CHAS ZALH BHI * DO (IBI - IAI) 

* SINCE A < 0 AND B > Q # 
ADDS BLO 
SUBS ALO 
SUBH AHI 


BZ CZERO 


BLZ CNEG 


SACH CHI 
SACL CLO 


ZAC 
SACL CSIGN 


B NORM 


* 


CZERO ZAC 
SACL CEXP 
B OUTPUT 


CNEG ABS 
SACH CHI 
SACL CLO 
LAC Nl 
SACL CSIGN 


* 

* * GO TO NORM * 

* 

* 
KRKEKKKKEKREKKEKKEKEKEKEKKEKKEEKKKEKKEKKEEEKKEKKKKKKKKEKKKKKKKKKKKKK 
* 

* NORM DOES THE NORMALIZATION. THAT IS TO SAY THAT IT 
* FINDS THE MSB OF CHI AND CLO. IN THIS CASE THE MSB 

x WILL BE THE FIRST ONE (1) FOUND. THE SEARCH FOR THIS 
* SPECIAL ONE IS DONE WITH A BINARY SEARCH. THE 

* NOTATION USED IS SUMMARIZED HERE: 

* 

* 

* [Sewer sere eeHeese=ss 16 BITS, 2seS=ss2—424 2542" | 

* 

S- Ghashe eee See aoa Eero Sees e See SSeS eee oes soe CHI 
x | CHI OR CLO [OR 
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0532 
0533 
0534 
0535 
0536 
0537 
0538 
0539 
0540 
0541 
0542 
0543 
0544 
0545 
0546 
0547 
0548 
0549 
0550 
0551 
0552 
0553 
0554 
0555 
0556 
0557 
0558 


0559 
0560 
0561 
0562 
0563 
0564 


0565 
0566 
0567 
0568 
0569 
0570 


0571 
0572 
0573 
0574 
0575 
0576 


0577 
0578 


0579 
0580 
0581 
0582 
0583 
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0140 
0141 
0142 


0143 
0144 


0145 
0146 
0147 


0148 
0149 


O14A 
014B 
014C 


014D 
014E 


014F 
0150 
0151 


0152 
0153 


0154 
0155 


0156 


2809 
FFOO 
0229 


580C 
500D 


2C0C 
FFOO 
01AC' 


580E 
500F 


2E0E 
FFOO 
O16F' 


5810 
5011 


2010 
FFOO 
0154' 


F900 
02FC' 


2011 
794F 
FFOO 


BO enn nnn nr ne rn renner nr ennn- CLO 
* 
CR eb eS a ee ee ee oe 8 
x | Cll I C12 [BITS 
Bo nr rn rer nr nme renner cencae EACH 
x 
Fes ts es ae ae ne ne ee me oe ae ee we ee a a a ee a ee 4 
* C21 I | C23 I C24 [BITS 
RO en ne err renner nner nn- EACH 
* 
KK mm ea ae ee ee ee eee a a ee ee ee oe ee 2 
* | C31 I C32 | C33 I C34 | C35 I C36 | C37 I C38 IBITS 
ea a a a il EACH 
* 
Feat ce ae ce ce ae ee a ee we a a i a ee ew ee oe 1 
* b | — Tot bf tf tf — ot ot t ft (arr 
0 (See Se pease ia eee ee eS eee ee eee eee es EACH 
* 
* THE I'S REPRESENT THE BOUNDARY BETWEEN THE ACC HIGH 
* BITS AND THE ACC LOW BITS WHEN THE HIGHER LEVEL IS 
* LOADED INTO THE ACC WITH THE NECESSARY SHIFT TO 
. SPLIT BY A FACTOR OF TWO. 
* WORST CASE: 6.4 MICROSECONDS. 
x 
RHEKKKKEKKKREKKEKKEKKKKKKKKKKKKKKKKKKKKEKKKKKKKKRKRKKKKKKKKKkRKRKKKKEK 
NORM LAC CHI,8 
BZ CLOB * BRANCH IF MSB IS IN CLO * 
* 
SACH Cll * SPLIT THE 16 BIT WORD INTO TWO 
SACL C12 8 BIT WORDS * 
x 
LAC C11,12 
BZ C12B * IF THERE IS A ONE IN Cll IT WILL 
* BE NONZERO * 
SACH C21 
SACL C22 
x 
LAC C21,14 
BZ C22B * IF THERE IS A ONE ETC. * 
* 
SACH C31 
SACL C32 
x 
LAC C31 
BZ C32B * IF THERE IS A ONE ETC. * 
* 
B OUTPUT * THE MSB CANNOT BE IN BIT ONE 
* BECAUSE THIS 
* WAS HANDLED EARLIER * 
C32B LAC C32 
AND M8000 
BZ MSB4 * MSB# MEANS THE MSB IS IN BIT 
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0584 
0585 
0586 
0587 
0588 
0589 
0590 
0591 
0592 
0593 
0594 
0595 
0596 
0597 
0598 


0599 
0600 
0601 
0602 
0603 
0604 
0605 
0606 
0607 
0608 
0609 
0610 
0611 
0612 


0613 
0614 
0615 
0616 
0617 
0618 
0619 


0620 
0621 
0622 


0623 
0624 
0625 
0626 
0627 
0628 
0629 
0630 
0631 
0632 
0633 
0634 
0635 


0157 


0158 
0159 
015A 


015B 
015C 
015D 
O15E 
O15F 
0160 


0161 
0162 


0163 
0164 
0165 


0166 
0167 
0168 
0169 
016A 
016B 
016C 


016D 
016E 


016F 
0170 
0171 


0172 
0173 
0174 


0175 
0176 
0177 


0178 
0179 
O17A 
017B 
017C 
017D 
O17E 
017F 
0180 
0181 


0182 


0163' 


* 


MSB4 


C22B 


LAC Nl . 
ADD CEXP 
SACL CEXP 


ZALH CHI 
ADDS CLO 
ADDH CHI 
ADDS CLO 
SACH CHI 
SACL CLO 


B OUTPUT 


LAC N2 
ADD CEXP 
SACL CEXP 


LAC CLO, 2 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND M0003 
ADD CHI,2 
SACL CHI 


B OUTPUT 


LAC C22,2 
SACH C33 
SACH C34 


LAC C33 
BZ C34B 


AND M0002 
BZ MSB6 


LAC N3 
ADD. CEXP 
SACL CEXP 
LAC CLO, 3 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND MOOOF 
ADD CHI,3 
SACL CHI 


B OUTPUT 
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# FROM LEFT 
TO RIGHT * 


* LEFT SHIFT OF 1 * 


* LEFT SHIFT OF 2 * 


* BRANCH IF NO ONE * 


* LEFT SHIFT OF THREE * 
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0636 
0637 
0638 
0639 
0640 
0641 
0642 
0643 
0644 
0645 
0646 
0647 
0648 
0649 


0650 
0651 
0652 
0653 
0654 


0655 
0656 
0657 
0658 
0659 
0660 
0661 
0662 
0663 
0664 
0665 
0666 
0667 
0668 


0669 
0670 
0671 
0672 
0673 
0674 
0675 
0676 
0677 
0678 
0679 
0680 
0681 
0682 


0683 
0684 
0685 
0686 
0687 
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0183 


0184 
0185 
0186 


0187 
0188 
0189 
018A 
018B 
018C 
018D 


018E 
018F 


0190 


0191 
0192 
0193 


0194 
0195 
0196 


0197 
0198 
0199 
019A 
019B 
019C 
019D 


019E 
O19F 


01A0 
01Al 
01A2 


01A3 
O1A4 
Q1A5 
O1A6 
01A7 
01A8 
O1A9 


O1AA 
O1AB 


O1AC 
01AD 
O1AE 


MSB6 


MSB8 


C12B 


LAC N4 
ADD CEXP 
SACL CEXP 


LAC CLO,4 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND MOOOF 
ADD CHI,4 
SACL CHI 


B OUTPUT 


LAC C34 


AND M8000 
BZ MSB8 


LAC N5 
ADD CEXP 
SACL CEXP 


LAC CLO,5 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND MOO1F 
ADD CHI,5 
SACL CHI 


B OUTPUT 


LAC N6 
ADD CEXP 
SACL CEXP 


LAC CLO,6 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND MOO1F 
ADD CHI,5 
SACL CHI 


B OUTPUT 
LAC C12,4 


SACH C23 
SACL C24 


fx at 


* LEFT SHIFT OF 4 * 


* LEFT SHIFT OF 5 * 


* LEFT SHIFT OF 6 * 


* “*“ COMPLETES TOP 8 BITS “* * 
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0688 
0689 


0690 
0691 
0692 
0693 
0694 
0695 


0696 
0697 
0698 


0699 
0700 
0701 
0702 
0703 
0704 
0705 
0706 
0707 
0708 
0709 
0710 
0711 
0712 


0713 
0714 
0715 
0716 
0717 
0718 
0719 
0720 
0721 
0722 
0723 
0724 
0725 
0726 
0727 
0728 


0729 
0730 
0731 
0732 
0733 
0734 
0735 
0736 
0737 
0738 
0739 


O1AF 
01B0 
01B1 


01B2 
01B3 


01B4 
01B5 
01B6 


01B7 
01B8 
01B9 


O1BA 
01BB 
01BC 


01BD 
O1BE 
O1BF 
01C0 
01¢c1 
01C2 
01C3 


01C4 
01C5 


01C6 
01C7 
01C8 


01C9 
O1CA 
01CB 
01CC 
01CD 
01CE 
01CF 


01D0 
01D1 
01D2 
01D3 
01D4 
01D4 
01D5 
01D6 
01D7 
01D8 
01D9 
O1DA 
01DB 
01DC 
01DD 


2E14 
FFOO 
O1EC' 


5816 
5017 


2016 
FFOO 
01D0' 


7942 
FFOO 
01C6' 


2026 
0008 
5008 


270A 
SOOA 
S81A 
201A 
7946 
0709 
5009 


F900 
O02FC' 


2027 
0008 
5008 


280A 
500A 
581A 
201A 
7947 
0809 
5009 


2017 
794F 
FFOO 
01E0' 


2028 
0008 
5008 
290A 
SOOA 
S81A 
201A 
7948 
0909 
5009 


* 


MSB10 


C36B 


LAC C23,14 
BZ C24B 


SACH C35 
SACL C36 


LAC C35 
BZ C36B 


AND M0002 
BZ MSB10 


LAC N7 
ADD CEXP 
SACL CEXP 


LAC CLO,7 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND M00O7F 
ADD CHI,7 
SACL CHI 


B OUTPUT 


LAC N8 
ADD CEXP 
SACL CEXP 


LAC CLO,8 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND MOOFF 
ADD CHI,8 
SACL CHI 


LAC C36 
AND M8000 
BZ MSB12 


LAC NQY 

ADD CEXP 
SACL CEXP 
LAC CLO,9 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND MO1FF 
ADD CHI,9 
SACL CHI 
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* LEFT SHIFT OF 7 * 


* LEFT SHIFT OF 8 * 


* LEFT SHIFT OF 9 * 
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0740 
0741 


0742 
0743 
0744 
0745 
0746 
0747 
0748 
0749 
0750 
0751 
0752 
0753 
0754 
0755 


0756 
0757 
0758 
0759 
0760 
0761 
0762 


0763 
0764 
0765 


0766 
0767 
0768 
0769 
0770 
0771 
0772 
0773 
0774 
0775 
0776 
0777 
0778 
0779 


0780 
0781 
0782 
0783 
0784 
0785 
0786 
0787 
0788 
0789 
0790 
0791 
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O1DE 
01DF 


01E0 
01B1 
01E2 


0153 
01E4 
01E5 
01E6 
O1E7 
01E8 
01E9 


O1EA 
01EB 


01EC 
01ED 
O1EE 


O1EF 
01F0 
O1lF1 


01F2 
01F3 
O1F4 


O1F5 
O1F6 
O1F7 


01F8 
01F9 
O1FA 
O1FB 
O1FC 
O1FD 
O1FE 


O1FF 
0200 


0201 
0202 
0203 


0204 
0205 
0206 
0207 
0208 
0209 
020A 


MSB12 


C24B 


* 


MSB14 


B OUTPUT 


LAC N10 
ADD CEXP 
SACL CEXP 


LAC CLO,10 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND MO3FF 
ADD CHI,10 
SACL CHI 


B OUTPUT 


LAC C24,2 
SACH C37 
SACH C38 


LAC C37 
BZ C38B 


AND M0002 
BZ MSB14 


LAC Nill 
ADD CEXP 
SACL CEXP 


LAC CLO,11 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND MO7FF 
ADD CHI,11 
SACL CHI 


B OUTPUT 


LAC Nl2 
ADD CEXP 
SACL CEXP 


LAC CLO,12 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND MOFFF 
ADD CHI,12 
SACL CHI 


* LEFT SHIFT OF 10 * 


* LEFT SHIFT OF 11 * 


* LEFT SHIFT OF 12 * 
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0792 
0793 


0794 
0795 
0796 
0797 


0798 
0799 
0800 
0801 
0802 
0803 
0804 
0805 
0806 
0807 
0808 
0809 
0810 
0811 


0812 
0813 
0814 
0815 
0816 
0817 
0818 
0819 
0820 
0821 
0822 
0823 
0824 
0825 


0826 
0827 
0828 
0829 
0830 
0831 
0832 
0833 


0834 
0835 
0836 
0837 
0838 
0839 


0840 
0841 
0842 


020B 
020C 


020D 
020E 
020F 
0210 


0211 
0212 
0213 


0214 
0215 
0216 
0217 
0218 
0219 
021A 


021B 
021C 


021D 
O21E 
021F 


0220 
0221 
0222 
0223 
0224 
0225 
0226 


0227 
0228 


0229 


022A 
022B 


022C 
022D 
022E 


022F 
0230 


0231 
0232 
0233 


0234 
0235 


C38B 


* 


MSB16 


B OUTPUT 


LAC C38 
AND M8000 
BZ MSB16 


LAC N13 
ADD CEXP 
SACL CEXP 


LAC CLO,13 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND M1FFF 
ADD CHI,13 
SACL CHI 


B OUTPUT 


LAC N14 
ADD CEXP 
SACL CEXP 


LAC CLO,14 
SACL CLO 
SACH CTEMP 
LAC CTEMP 
AND M3FFF 
ADD CHI,14 
SACL CHI 


B OUTPUT 


LAC CLO,8 


SACH Cll 
SACL C12 


LAC Cll1,12 
BZ C12BP 
SACH C21] 
SACL Cl2 
LAC C21,14 
BZ C22BP 


SACH C31 
SACL C32 
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* LEFT SHIFT OF 13 * 


* LEFT SHIFT OF 14 * 


* CHI IS ZERO * 


* SPLIT THE 16 BIT WORD INTO TWO 


8 BIT PIECES * 


* IF THERE IS A ONE ETC. 


* IF THERE IS A ONE ETC. 


* 


* 
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0843 
0844 
0845 


0846 
0847 
0848 


0849 
0850 
0851 
0852 
0853 
0854 
0855 
0856 
0857 
0858 
0859 
0860 
0861 


0862 
0863 
0864 
0865 
0866 
0867 
0868 
0869 
0870 
0871 
0872 


0873 
0874 
0875 
0876 


0877 
0878 
0879 
0880 
0881 
0882 
0883 
0884 
0885 
0886 
0887 


0888 
0889 
0890 
0891 
0892 
0893 
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0236 
0237 
0238 


0239 
023A 
023B 


023C 
023D 
023E 


023F 
0240 
0241 
0242 
0243 
0244 


0245 
0246 


0247 
0248 
0249 


024A 
024B 
024C 
024D 


O24E 
024F 


0250 
0251 
0252 
0253 


0254 
0255 
0256 


0257 
0258 
0259 
025A 


025B 
025C 


025D 
O25E 
O25F 


0260 


* 


MSB18 


* 


C32BP 


* 


MSB20 


LAC C31 
BZ C32BP 


AND M0002 
BZ MSB18 


LAC N15 
ADD CEXP 
SACL CEXP 


LAC CLO,15 
SACH CHI 
SACL CLO 
LAC CHI 
AND M8000 
SACL CHI 


B OUTPUT 


LAC N16 
ADD CEXP 
SACL CEXP 


LAC CLO 
SACL CHI 
ZAC 

SACL CLO 


B OUTPUT 


LAC C32 


AND M8000 
BZ MSB20 


LAC N17 
ADD CEXP 
SACL CEXP 


LAC CLO, 1 
SACL CHI 

ZAC 

SACL CLO 


B OUTPUT 
LAC N18 

ADD CEXP 
SACL CEXP 


LAC CLO, 2 


* IF THERE IS A ONE ETC, * 


* LEFT SHIFT OF 15 * 


* LEFT SHIFT OF 16 * 


* LEFT SHIFT OF 17 * 


* LEFT SHIFT OF 18 * 
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0894 
0895 
0896 
0897 
0898. 


0899 
0900 
0901 
0902 
0903 
0904 
0905 


0906 
0907 
0908 


0909 
0910 
0911 
0912 
0913 
0914 
0915 
0916 
0917 
0918 
0919 


0920 
0921 
0922 
0923 
0924 
0925 
0926 
0927 
0928 
0929 
0930 


0931 
0932 
0933 
0934 


0935 
0936 
0937 
0938 
0939 
0940 
0941 
0942 
0943 
0944 


0261 
0262 
0263 


0264 
0265 


0266 
0267 
0268 


0269 


026A 


026B 


026C 
026D 
026E 


026F 
0270 
0271 


0272 
0273 
0274 
0275 


0276 
0277 


0278 
0279 
O27A 


027B 
027C 
027D 
O27E 


027F 
0280 


0281 
0282 
0283 
0284 


0285 
0286 
0287 


0288 
0289 
028A 
028B 


C22BP 


MSB22 


x 


C34BP 


SACL CHI 
ZAC 
SACL CLO 


B OUTPUT 


LAC C22,2 
SACH C33 
SACH C34 


LAC C33 
BZ C34BP 


AND M0002 
BZ MSB22 


LAC N19 
ADD CEXP 
SACL CEXP 


LAC CLO, 3 
SACL CHI 

ZAC 

SACL CLO 


B OUTPUT 


LAC N20 
ADD CEXP 
SACL CEXP 


LAC CLO,4 
SACL CHI 

ZAC 

SACL CLO 


B OUTPUT 


LAC C34 
AND M8000 
BZ MSB24 


LAC N21 
ADD CEXP 
SACL CEXP 


LAC CLO,5 
SACL CHI 

ZAC 

SACL CLO 
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* BRANCH IF NO ONE * 


* LEFT SHIFT OF 19 * 


* LEFT SHIFT OF 20 * 


* LEFT SHIFT OF 21 * 
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0945 


0946 
0947 
0948 
0949 
0950 
0951 
0952 
0953 
0954 
0955 


0956 


0957 
0958 
0959 
0960 
0961 
0962 
0963 


0964 
0965 
0966 
0967 
0968 


0969 
0970 
0971 


0972 
0973 
0974 
0975 
0976 
0977 
0978 
0979 
0980 
0981 
0982 


0983 
0984 
0985 
0986 
0987 
0988 
0989 
0990 
0991 
0992 
0993 


0994 
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028C 
028D 


028E 
028F 
0290 


0291 
0292 


0293 
0294 


0295 
0296 


0297 
0298 
0299 


029A 
029B 
029C 
029D 
O29E 


029F 
02A0 
O2Al 


Q2A2 
02A3 
02A4 


O2A5 
O2A6 
O2A7 


02A8 
02A9 
O2AA 
02AB 


O2AC 
O2AD 


O2AE 
O2AF 
02B0 


02Bl 
02B2 
02B3 
02B4 


02B5 
02B6 


MSB24 


C12BP 


* 
MSB26 


B OUTPUT 


LAC N22 
ADD CEXP 
SACL CEXP 


LAC CLO,6 
SACL CHI 

ZAC 

SACL CLO 


B OUTPUT 
LAC Cl2,4 


SACH C23 
SACL C24 


LAC C23,14 


BZ C24BP 


SACH C35 
SACL C36 


LAC C35 
BZ C36BP 


AND M0002 
BZ MSB26 


LAC N23 
ADD CEXP 
SACL CEXP 


LAC CLO, 7 
SACL CHI 

ZAC 

SACL CLO 


B OUTPUT 


LAC N24 
ADD CEXP 
SACL CEXP 


LAC CLO,8 
SACL CHI 

ZAC 

SACL CLO 


B OUTPUT 


* LEFT SHIFT OF 22 * 


* “* COMPLETES TOP 8 “* * 


* LEFT SHIFT OF 23 * 


* LEFT SHIFT OF 24 * 
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0995 
0996 
0997 


0998. 


0999 
1000 
1001 
1002 
1003 
1004 
1005 


1006. 


1007 


1008 
1009 
1010 
1011 
1012 
1013 
1014 
1015 
1016 
1017 
1018 


1019 
1020 
1021 
1022 
1023 
1024 
1025 


1026 
1027 
1028 


1029 
1030 
1031 
1032 
1033 
1034 
1035 
1036 
1037 
1038 
1039 


1040 
1041 
1042 
1043 
1044 
1045 


02B7 
02B8 
02B9 
O2BA 


02BB 
02BC 
02BD 
02BE 
O2BF 
02Cc0 
02C1 


02C2 
02C3 


02C4 
02C5 
02C6 


02C7 
02C8 
02C9 
O2CA 


02CB 
02CC 


02CD 
02CE 
02CF 


02D0 
02D1 
02D2 


02D3 
02D4 
02D5 


02D6 
02D7 
02D8 


02D9 
O2DA 
02DB 
O?DC 


02DD 
O2DE 


O2DF 
02E0 
02E1 


02E2 


C36BP 


* 
MSB28 


* 


C24BP 


MSB30 


LAC C36 
AND M8000 
BZ MSB28 


LAC N25 
ADD CEXP 
SACL CEXP 
LAC CLO,9 
SACL CHI 
ZAC 

SACL CLO 


B OUTPUT 


LAC N26 
ADD CEXP 
SACL CEXP 


LAC CLO,10 
SACL CHI 
ZAC 

SACL CLO 


B OUTPUT 


LAC C24,2 
SACH C37 
SACL C38 


LAC C37 
BZ C38BP 


AND M0002 
BZ MSB30 


LAC N27 
ADD CEXP 
SACL CEXP 


LAC CLO,11 
SACL CHI 
ZAC 

SACL CLO 


B OUTPUT 
LAC N28 

ADD CEXP 
SACL CEXP 


LAC CLO,12 
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* LEFT SHIFT OF 25 * 


* LEFT SHIFT OF 26 * 


* LEFT SHIFT OF 27 * 


* LEFT SHIFT OF 28 * 
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1046 02E3 
1047 0254 
1048 02E5 
1049 
1050 O2E6 
O2E7 
1051 
1052 
1053 


1054 


1055 
1056 
1057 
1058 
1059 
1060 
1061 
1062 
1063 
1064 
1065 


02E8 
02E9 
O2EA 
O2EB 


02EC 
02ED 
02EE 


O2EF 
02F0 
O2F1 
O2F2 


O2F3 
O2F4 
1066 
1067 
1068 
1069 
1070 
1071 
1072 
1073 
1074 
1075 
1076 
1077 
1078 
1079 
1080 
1081 
1082 
1083 


O2F5 
O2F6 
O2F7 


O2F8 
Q2F9 
O2FA 
O2FB 


O2FC 
O2FD 
O2FE 
O2FF 


0300 
0301 
1084 
1085 
NO ERRORS, 
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C38BP 


* 


MSB32 


* 
* 
x 


490B OUTPUT 
4908 
4909 


490A 


F900 
0300' 


SELF 


* 


NO WARNINGS 


SACL CHI 
ZAC 
SACL CLO 


B OUTPUT 


LAC C38 
AND M8000 
BZ MSB32 


LAC N29 
ADD CEXP 
SACL CEXP 


LAC CLO,13 
SACL CHI 
ZAC 

SACL CLO 


B OUTPUT 


LAC N30 
ADD CEXP 
SACL CEXP 


LAC CLO,14 
SACL CHI 
ZAC 

SACL CLO 


OUT CSIGN,PA1 
OUT CEXP,PA1 
OUT CHI,PA1 
OUT CLO,PA1 


B SELF 


END 


* LEFT SHIFT OF 29 * 


* LEFT SHIFT OF 30 * 
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Digital Signal Processing Routines 


7. Floating-Point Arithmetic with the 
TMS32020 


Charles Crowell 
Digital Signal Processing - Semiconductor Group 
Texas Instruments 
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INTRODUCTION 


The TMS32020 Digital Signal Processor is a fixed-point 
16/32-bit microprocessor. However, it can also perform 
floating-point computations at a speed comparable to some 
dedicated floating-point processors. 

The purpose of this application report is to analyze an 
implementation of floating-point addition, multiplication, and 
division on the TMS32020. The floating-point single- 
precision standard proposed by the IEEE will be examined. 
Using this standard, the TMS32020 performs a floating-point 
multiplication in 7.8 microseconds, a floating-point addition 
in 15.4 microseconds, and a floating-point division in 22.8 
microseconds. 

To illustrate floating-point formats and the tradeoffs 
involved in making a choice between different floating-point 
formats, a review of floating-point arithmetic notation and 
of addition, multiplication, and division algorithms is first 
presented. 


FLOATING-POINT NOTATION 


The floating-point number f may be written in floating- 
point format as 


f = mxbé 
where 
m = mantissa 
b = base 
€ = exponent 


For example, 6,789,320 may be written as 


0.6789320 x 107 


In this case, 
m = 0.6789320 
b = 10 
e = 7 


The two floating-point numbers f; and f2 may be written as 


fy 
f2 


m, xbel 
mz x be2 


Floating-point addition/subtraction, multiplication, and 
division for f; and fz are defined as follows: 
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f) xf2 = (mj tmp xb~ ©1—©2)) x bel if ey =e (1) 


Or 


(m, xb~ 2-0) + my) x be2 if ey <e2 


my X my x C1 +2) (2) 


f; x f2 


f,/f2 = (my/mg) xb &1-€2) (3) 


A cursory examination of these expressions reveals 
some of the factors involved in the implementation of 
floating-point arithmetic. For addition, it is necessary to shift 
the mantissa of the floating-point number which has the 
smaller exponent to the right by the difference in the 
magnitude of the two exponents. This is shown in the 
multiplication by the terms 


b~ (1-2) and b~2-en) 


This right shift can result in mantissa underflow. There 
are also possibilities for mantissa overflow. Addition and 
subtraction of exponents can lead to exponent underflow and 
overflow. To alleviate underflow and overflow, it is 
necessary to decide on some scheme for roundoff. For a 
detailed description and analysis of underflow and overflow 
conditions and rounding schemes, see reference 1. 

It is desirable to have all numbers normalized, i.e., the 
mantissas of f; and f2 have the most significant digit in the 
leftmost position. This provides the representation with the 
greatest accuracy possible for a fixed mantissa length. The 
result of any floating-point operation must also be 
normalized. The factors associated with normalization, 
overflow, and other characteristics of floating-point 
implementations are best illustrated with a few examples. 

Consider the addition of two binary floating-point 
numbers f; and fg where 


0.10100 x 2011 
0.11100 x 2001 


f} 
f2 


Both of these numbers are normalized, i.e., the first 
bit after the binary point is a 1. Addition requires equal 
exponents, so the fractions are aligned by shifting right the 
one with the smaller exponent and adjusting the smaller 
exponent. This yields 


0.00111 x 2011 


f2 
Then, 


0.10100 x 2011 +0.00111 x 2011 
0.11011 x 2011 =f, 


Hl 


fj +f2 
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The sum may overflow the left end by one digit, thus 
requiring a postaddition adjustment or renormalization step. 
Since it is assumed that the register is only of a finite length, 
this renormalization will result in the loss of the lowest order 
bit. 

Another example illustrates the overflow past the most 
significant bit. With an assumed register length of five, let 


0.11100 x 2011 
0.10101 x 2001 


f} 
fy 


Then, 
0.11100 x20il=f; 
+ 0.0010101 x2011 =f, 
1.0000101 x 2011 =f; 

The significance of the two digits underlined in the right 
part of the mantissa is suspect, since it is assumed that the 
corresponding bits of f, are zero. The left underlined digit 
is the overflow past the most significant bit. To finish the 


addition, f3 is shifted to the right and the exponent adjusted 
accordingly. Thus, 


1.0000101 x 2011 = £3 


The shift of the fraction and the adjustment of the exponent 
yield 


0.10000101 x 2100 = fz 
The result may be rounded, giving 
0.10001 x 2100 = f, 
or truncated, giving 
0.10000 x 2100 = f3 
FLOATING-POINT ALGORITHMS 
Multiplication Algorithm 
The algorithm for normalized floating-point 
multiplication is illustrated in Figure 1. This algorithm is an 
implementation of Equation 2 in the section on floating-point 
notation. The floating-point numbers being multiplied are A 
and B written as 
A = ma XbA and B = mpxbB 
The result is 
C = mcxb 


For the resulting mc, there are three special cases. The 


mc may be zero, in which case there is a branch to Step — 


10 to set C =0. If mc #0, then the most significant bit will 
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mA mB 


1 MULTIPLY MANTISSAS 
mo = ma < mg 


CA ep 
ADD EXPONENTS 
2 ec = ea + OB 
3 TEST FOR SPECIAL CASES OF mc 
ZERO 


NORMALIZED 


7 
OVERFLOW 


NO OVERFLOW 


RIGHT SHIFT mc ONE BIT 
ec = ec! 


TEST FOR SPECIAL CASES OF ec 


OVERFLOW 
OR 


UNDERFLOW 


10 | SET SPECIAL VALUES OF RESULT 


C=Ax8B 


Figure 1. Floating-Point Multiplication 


be in either the first or second leftmost bit. If the most 
Significant bit is in the second leftmost bit, then a left shift 
of mc is necessary (see Step 5). Otherwise, C is already in 
normalized form, and there is a branch to Step 6. 

In Step 6, the desired rounding scheme is implemented. 
After this rounding, it is possible that mc will overflow (see 
Step 7). In this case, it is necessary to right-shift mc one 
bit (see Step 8). Special cases of ec , are tested for in Step 9. 


7. Floating-Point Arithmetic with the TMS32020 


If there is an overflow or underflow of ec, it is corrected 
in Step 10. Otherwise, the result is in range, and the 
calculation is complete. 


Addition Algorithm 

The implementation of normalized floating-point 
addition is more involved than for multiplication. This 
addition algorithm, outlined in Figure 2, is an implementation 
of Equation 1 in the section on floating-point notation. 

In Step 1, eq and eg are compared to determine ec. 
For this illustration of the algorithm, it is assumed that 
ea sep. The right shift (d) required to align mag is 
determined in Step 2. The procedure in Step 3 implements 
the right shift of ma. In Step 4, the extra bits of ma are 
discarded by using the desired rounding technique. The 
mantissas of A and B are then added in Step 5S. 


A 8 


[amram eesti Tae 
» [earcr oro ono | 
ad 
oo 
« [owns nema 
« [erro cs orm 


k LEADING ZEROS 


NORMALIZED 


RIGHT SHIFT mc ONE BIT 
ec =ec +1 


LEFT SHIFT mc k BITS 
ec = ec - k 


DISPOSE OF EXTRA BITS: 
ROUNDING OR TRUNCATION 


TEST FOR OVERFLOW OF mc 


NO OVERFLOW OVERFLOW 


RIGHT SHIFT mc ONE BIT 
ec =ec + 
TEST FOR SPECIAL CASES OF ec 


12 


OVERFLOW 
oR 
UNDERFLOW 


14 | SET SPECIAL VALUES OF RESULT 


C=A+8B 


Figure 2. Floating-Point Addition 
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- 


Now, the procedure becomes somewhat more involv- 
ed. The mc may be zero, in which case there is a branch to 
Step 9 which sets ec =0; a branch to Step 14 sets the special 
value of the result. The mc may overflow, making a right 
shift of one necessary (see Step 7). The mc may have k 
leading zeroes; therefore, a left shift of k is required. This 
normalization step is generally the most involved and time- 
consuming step to perform. The procedures in Steps 10, 11, 
and 12 round the mc, test for a possible overflow due to 
the rounding, and adjust ec accordingly. The special case 
of ec is determined in Step 13. Finally, after Step 14, the 
sum C = A + Bis formed. 


Division Agorithm 

Floating-point division is more sophisticated than 
multiplication and addition since fixed-point processors such 
as the TMS32020 are not inherently capable of performing 
division. For example, 1/3 = 0.3333...; only an approx- 
imation can be calculated since 1/3 must be represented in 
a finite number of terms. Several algorithms can be im- 
plemented to find good approximations of such numbers. The 
algorithm implemented in this report is shown in Figure 3. 

Step 1 shows the equivalent of A/B. In Step 2, the latter 
term is expanded using a power series of 1/(1 + X), where 
€ (BLO/BHI) is X (e simply denotes that the term is right- 
shifted 16 bits forming the least significant bits of a 32-bit 
number). The third term in the power series only affects the 
LSB of a 32-bit result; therefore, this term and all the 
following terms can be dropped, as shown in Step 3. 

The equation in Step 3 can be implemented on the 
TMS32020 in two steps. Assuming that the result is a 32-bit 
number Q and that it is composed of a 16-bit QHI and a 16-bit 
QLO, think of the equation in Step 3 in the following 
manner: A/B = Q — eX. The first term is a fair approx- 
imation of the result Q, and the second term is a correction 
term to obtain a better approximation. With this in mind, 
it can be shown that (AHI + eALO)/BHI will give a 16-bit 
quotient and a 16-bit remainder. Due to the architecture of 
the TMS32020, the 16-bit quotient will be in the low word 
of the accumulator and the remainder will be in the high word 
of the accumulator after the division. Since it is desirable 


A divided by B 


where A = AHI + €ALO 
B = BHI + €BLO 
€ = 1 et 
QWORDSIZE 216 


step 1. AHI+ €ALO _ AHI + €ALO 1 
BHI + €BLO BHI BLO 
1+ € eaeg 
BHI 
AH! + €ALO BLO BLO 42 
TEP 2: age capaci 1- 2 a 
, BHI ( €(-ma-) + ©? (Ga) ~) 
STEP 3: ~ 


AHI + EALO € ( BLO ) ( AHI + EALO ) 


BHI BHI BHI 


Figure 3. Division Equation 
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to have a floating-point result, the remainder must be divid- 
ed by BHI to obtain the low word of the quotient. Now QHI 
and QLO have been calculated. When placing Q into the cor- 
rection term (equation in Step 3), note that Q is equal to QHI 
+ QLO. It can be shown that QLO will have no effect on 
the result since the correction term is multiplied by e. 
Therefore, to calculate A divided by B, simply implement 
the following equation: 

A, A__, (310 


x gut) 
B BHI BHI 


where the division is fixed binary (left-shifts and subtracts). 

Figure 4 shows the implementation of the division 
algorithm that was outlined in Figure 3. 

In Step 1, the dividend is right-shifted four times to 
prevent an overflow. Note that the result is not shifted left 
to compensate for this shift, because the normalization routine 
automatically does this. The shift causes the dividend to be 
limited to 27 significant bits instead of 31. In Step 2, a binary 
divide (left-shifts and subtracts) is implemented on the 
dividend by the high 16 bits of the divisor. The 32-bit result 
contains a quotient in the low 16 bits of the accumulator, 
and a remainder (R1) in the high 16 bits of the accumulator. 
R1 is left-shifted fifteen places in Step 3. The new R1 is 
divided by BHI in Step 4 to calculate the lower 16 bits of 
the quotient. 

The quotient has now been approximated. The 32-bit 
result is composed of QHI and QLO, as shown in Figure 3. 
To obtain a better approximation, one term in the power 
series expansion must be added to the quotient. Therefore, 
the procedure in Step 5 calculates a 16-bit correction term, 
which is then added (or subtracted since it is the term 
following the ‘‘1”’ in the power series) to the 32-bit quotient. 

Testing for an overflow of the resulting mantissa is 
necessary. Since the dividend was left-shifted four places, 
the resulting quotient will not be negative if an overflow 
occurred. To detect an overflow, bit 28 in the quotient must 
be tested. If this bit is a 1, an overflow occurred; if it is a 
0, no overflow occurred. If an overflow has occurred, the 
exponent must be incremented. Finally, it is necessary to 
normalize the quotient and output the results. 
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A DIVIDED BY B 


Ww 
~ 
_ 
a 
= 
a 
o 


WHERE A= 


SHIFT “’A’’ RIGHT FOUR TIMES TO 
1 PROTECT FROM OVERFLOW. 


A/BHI = 32-BIT RESULT. 
2 | HIGH 16 BITS ARE REMAINDER #1 (R1). 
LOW 16 BITS ARE HIGH QUOTIENT (QH)). 


‘ SHIFT R1 LEFT FIFTEEN TIMES. 
EQUIVALENT TO R1 x 215. 


R1/BHI = 32-BiT RESULT. 
4] HIGH 16 BITS ARE REMAINDER #2 (R2). 
LOW 16 BITS ARE LOW QUOTIENT (QLO). 


MULTIPLY QHI BY BLO AND DIVIDE BY BHI 


: (QH! x BLO)/BHI = CORRECTION TERM. 


SUBTRACT 16-BIT CORRECTION TERM 
6 FROM 32-BIT QUOTIENT. 
(QHI | QLO) — (0 | CT) = RESULT. 


OVERFLOW NO OVERFLOW 


9 NORMALIZE RESULT. 


10 | OUTPUT Cgign, Coxp. CHI, AND CLO. 


Figure 4. Floating-Point Division 
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IEEE FLOATING-POINT 
SINGLE-PRECISION FORMAT 


Of interest is a set of formats known as the IEEE 
standard. This IEEE recommended format consists of a 
variety of precision formats (single, double, single-extended, 
and double-extended). The IEEE has also proposed several 
techniques for handling special cases such as overflow, 
underflow, + oo, and rounding. For complete details, the 
reader is referred to the proposed IEEE standard.2 

The single-precision format is a 32-bit format consisting 
of a 1-bit sign field s, an 8-bit biased exponent e, and a 23-bit 
fraction f (see Figure 5). The value of a binary floating-point 
number X is determined as follows: 


X = (—1)8 x 2(¢-127) x Lf 


Figure 5. IEEE Floating-Point Single-Precision Format 


The advantage of this format is that it is structured in 
such a way as to provide easy storage and straightforward 
input/output operations on 8-, 16- and 32-bit processors. The 
disadvantage with this format is that the large mantissa will 
generally span several words of memory. 


FLOATING-POINT IMPLEMENTATION 
IEEE Implementation 


The IEEE single-precision format is described here as 
it applies to the addition, multiplication, and division 
algorithms. In these floating-point routines written for the 
TMS32020, all results are truncated to 31 bits to provide 
more flexibility in the user’s development of a rounding 
scheme suitable for his application. The representations of 
+ oo are ignored so that the user can decide how to handle 
these exceptions in a manner that is appropriate for his 
particular application. 


I/O Considerations 

The first consideration is the internal representation of 
the binary floating-point number. If the number is read into 
the TMS32020 as two 16-bit words, some processing is then 
necessary to put the floating-point number into a 
representation which is easier to process. The representation 
used in the TMS32020 programs in the appendices is shown 
in Figure 6. This internal representation may be arrived at 
by a simple manipulation of the IEEE bit fields. For this 
particular algorithm, it is assumed that the floating-point 
number is input to the TMS32020 as the four 16-bit fields 
shown in Figure 6. However, the user can easily supply his 
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own routine to arrive at this format from two 16-bit inputs 
to the TMS32020 where the inputs contain the IEEE single- 
precision format. 

The format in Figure 6 was chosen to minimize the 
execution time of the floating-point addition, multiplication, 
and division routines. The format of the result is shown in 
Figure 7. Notice that it is identical to the format in Figure 
5 except for CLO. CLO has its 16 most significant bits valid 
for both the addition, multiplication, and division routines. 


Normalization 

Since the floating-point routines require normalization, 
a partial binary search algorithm is implemented in the 
addition and division routines in the appendices. To begin 
the normalization routine, note that all mantissas can be 
considered to be positive with the format used for the result 
shown in Figure 7. The binary search for the most significant 
bit (the leftmost 1 since the mantissa is positive) is illustrated 
in Figure 8. 

The first move is to split the result into CHI and CLO. 
If CHI + 0, the most significant bit (MSB) is the CHI; 
otherwise, it is the CLO. For this example, it is in CLO. 


15 1413 121110 9 8 7 6 &§ 43 2 1 0 
ASIGN 
(O iF POSITIVE, 
OR — 1 iF NEGATIVE) 


16 141312 1110 9 8 7 6 & 43 2 1 «0 


AHI 
(NORMALIZED) 


Oo #1 f (most significant 14 bits) 
1§ 14131211109 8 7 6 § 4 3 2 1 9 


ALO O sf {least significant 9 bits) 00000 0 


168 141312 1110 9 8 7 6 &6& 43 2 141 0 


AEXP 


Figure 6. Floating-Point Representation 


16 1413121110 9 8 7 6 § 4 3 2 1 «90 


CSIGN 
(0 OR - 1) 


16 141312 1110 9 8 7 6 & 43 2 1 «0 


CHI 
(NORMALIZED) 


15 1413121110 9 8 7 6 &§ 43 2 1 0 


CLO 


1§ 1413 121110 9 8 7 6 6 43 2 1 #90 


CEXP 


Figure 7. Result Representation 
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31 


000101310034%1000%1 71 «1 


BLZ NOFLOW 
RPT TTEEN 
NORM 


31 


0101003100033 1 900 


y 


CLO 0 


0001010031%10001 1 1 


15 0 


0000000000000000 


GOTO NOFLOW ON OVERFLOW. 
TTEEN = 13, PERFORM 14 ‘‘NORM”’ 


oe ee 0 
0000000000000000 


Figure 8. Partial Binary Search 


The next step is to form a 32-bit result with CLO in the most 
significant word position. It is now possible for the MSB 
to be in the highest bit location since CLO has been left- 
shifted 16 times. If this is the case, an overflow has occurred, 
and the result must be right-shifted once. The normalization 
routine tests this by branching to NOFLOW if the result is 
negative. If the number is not negative, the normalization 
can continue. 

The NORM instruction is used in the repeat mode to 
complete the normalization. Note that this whole 
normalization routine can be replaced by the following two 
instructions: RPTK 29 and NORM. The RPTK instruction 
causes the NORM instruction to be repeated 30 times, thus 
normalizing a 32-bit number. This method is not 
implemented here due to the timing. These two instructions 
always take 31 cycles to normalize a 32-bit number. The 
normalization routine here takes only 22 cycles (worst case) 
for normalizing a 32-bit number. Therefore, if program space 
is more important than timing efficiency, it is best to replace 
the normalization routine with these two instructions. 


Added Precision 


As illustrated in Figure 7, the 16 most significant bits 
of CLO are valid, i.e., C is valid for 31 places beyond the 
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binary point. Oftentimes the user is not as concerned with 
the IEEE standard as in being certain that he has enough 
accuracy for his particular application. Since the TMS32020 
uses 16-bit words, the routines in the appendices implicitly 
maintain a 30-bit mantissa. They also implicitly use a 16-bit 
exponent. If the user desires this added accuracy and dynamic 
range, then it is readily implementable with no additional 
cost in execution time. The normalization for the addition, 
as mentioned previously, operates over the entire 32-bit 
accumulator. For the strict IEEE format, the user will only 
want to normalize over the 25 most significant bits of the 
accumulator. The structure of the normalization routine 
imakes this modification simple. 

The routines in the appendices make no provision for 
the representation of +. © and exponent underflow and 
overflow. The user of the routines should consider the degree 
of significance of these results and the way they should be 
handled for his particular application. Since these routines 
are written to operate at maximum speed, truncation of results 
is used. If the user desires to implement a rounding scheme, 
then he will also need to check for the possibility of overflow 
due to the rounding scheme. This step is shown in the 
multiplication, addition, and division flowcharts (see Figures 
1, 2, and 3). 
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SUMMARY 


The TMS32020 may be used to perform floating-point 
operations with great accuracy, wide dynamic range, and 
high-speed execution. The design engineer has the 
responsibility of deciding what type of floating-point format 
is best for his application. To aid in understanding floating- 
point operations, several examples have been given that 
illustrate the manipulations necessary to implement floating- 
point addition, multiplication, and division algorithms. 
Flowcharts for these algorithms are also included. The 
appendices contain the TMS32020 code for the IEEE 
floating-point single-precision format used in addition, 
multiplication, and division. The addition and multiplication 
routines may also be used without modification to implement 
a format with up to a 30-bit mantissa and a 16-bit exponent 
without any increase in execution time. 
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NOSIDT 


0001 
0002 
0003 
0004 
O00s 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
Oo1sS 
0014 
OO15 
0016 
0017 
0018 
O01? 
0020 
O021 
0022 
0023 
0024 
0025 
00246 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
0024 
0035 
OO34 
0037 
O03 
0039 


0040. 


oo04l! 
0042 
O04 
0044 
o045 
O46 
0047 
O04s 
O04 
0050 
OOS 1 
OO52 
Oo8 Ss 
Om54 
OOS 
0084 


0000 


APPENDIX A 
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### PRERELEASE +++ 


0000 
0001 
O002 
O00S 
O004 
oOoo0s 
OOO 


HNEREH 


* 


we Ke Ke 


HERE 


mw wok we we we ke we oe me me me ee ke ee KK ee me ee ee Oe 


HERE 


* 


we oe ke Ke 


HHEEE 
% 


ASIGN 
AEXF 
AHI 
ALO 
BS IGN 
RE XP 
BHI 


PAGE 0001 
HEELERS HEE HE ESE Et 


THIS IS A FLOATING-POINT ADDITION ROUTINE WHICH 
IMPLEMENTS THE IEEE FROPOSED FLOATING-POINT FORMAT 
ON THE TMS32020. 


me we we ke 


HEHEHE EERE HER 


INITIAL FORMAT (ALL 14-BIT WORDS) 


' ALL 0 OR 1 ' ASIGN (0 OR ~-1) 
101. 15 BITS 1 AHI (NORMALIZED) 
(OO; 9% BITS t--O-! ALO 


H ' AEXF (-127 TO 128) 


TO CORRESPOND WITH IEEE FORMAT, 
INPUT O.1F # 2 #% (E + 1) 
INSTEAD OF 1.F # 2 ##E, AND SUBTRACT i27 FROM E. 


THE FINAL FORMAT IS THE SAME AS THE INITIAL FORMAT 
EXCEPT THAT FOR CLO WE HAVE: 


i 14 BITS i CLO 


ALL 14 BITS OF CLO ARE VALID. ANYTHING PAST THESE HAS 
BEEN TRUNCATED. 


FETE SESE SE TERE EEE HE EE ESE EE EEE EEE EEE EEE ESE EEE EEE HHH HHH HE 


WORST CASE (EXCLUDING INITIALIZATION AND 1/0): # 
15.4 MICROSECONDS. + 
THIS TIMING INCLUDES THE NORMALIZATION. # 
WORDS OF PROGRAM MEMORY: 170 # 

# 


HAGE RHHAE EHH HHH HHE HHH HHHHHHEEHHHHEHHHEHE RHEE 


ADRG 
Eau 
Equ 
FOL 
Eau 
EGU 
EAL! 
Er 


PU &e oh S 
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256 


NOS TOT 


OOR7 
Ooo 
OOa? 
00460 
NOL1 
OO42 
NOLS 
00464 
OO6S 
OO64E 
OO&7 
O04E 
OO 
0070 
0071 
OO72Z 
0073 
0074 
0075 
OO7A 
0077 
oOo7e 


OO79 
O08G 
Oosy 
00n2 
OO8S 
OOB4 
OOS 
OO 
O87 
OOBE 
Oono 
NOVO 
O07 1 
QOV? 
OOPS 
OO94 
OOoFs 
OOV4 
O097 
OOS 


OOUy 


OLoo 


o1o4 
O102 
OLS 
oO1oa4g 
o1os 
OVO4 
C107 
GOLOs 


O10% 


0000 
O00} 
Ooo? 
0004 
oOo05 
OOO4K 
0007 
ooos 
O00? 
Od0A 
OOOR 
OOO 
OoOT 
OOOE 
QOOOF 
NOLO 


OOL 1 
OO 
OOS 
Oo14 
oOo TS 
OOLE 
OO17 


OOLe 
OoL? 
QOLA 
OO TR 
OOTr 
Oo ns 
OOLE 
OOLF 
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#ee PRERELEASE +++ 


0007 
9008 
O00 
OOOAR 
OOOBR 
o000r 
OoOon 
OOOE 
QOOF 
oo10 
OOLL 
QOL? 


reo4 
CEO7 
Shae 
D100 
0200 
CBO7 
BOAO 
babe bab 
81018) 
CAO1L 
4000 
CAL1O 
4010 
CAO 
400F 
CAOL 
4012 


ZOOL 
1005 
4000 
FARO 
oOozTi 
Faso 
OOA7 


CESS 
OOLa 
400% 


SOK 
2000 
4008 
2004 


LO 


BLO 
CSIGN 
CEXF 
CHI 
CLO 

[i 

DINE 
TEMF 
THREE 
SIXT 
RESIO 
TTEEN 
+ 


a ee 


ATR 


EQU 
Emu 
EAU 
EQUI 
Eri 
Et 
EG 


EL 


Etiu 
Eri 
EG 
Eri 


INITIALIZATION 


LOPE 
SXM 
LARF 
iRLE 


RETR 
IN 

LARF 
LARE 
LACK 
SACL 
LACK. 
SACL 
LACE 
SACL 
LACE 
SACL 


BEGIN FLOATING 


LAC 
SLB 
SACL 
BZ 


BLZ 


NEG 
ADD 
SACL 
LT 
LAL 
SACL 
LAL 
SACL 


4 


{ 
ARL, 7200 


: 
¥+, FAO 
Q) 
ARO, 0 
1 

CINE 


AE XP 
BE XF 
Ui 

AEB 


ALTER 
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PAGE 0002 


BEGIN ON FAGE 4. 
SET SIGN EXTENSION. 


CLEAR EXPONENT REGISTER. 


ONE = i 


POINT ADE 


FIND LARGEST NUMBER. 


IF EXPONENTS ARE THE SAME, JUMP TO AENIE, 


IF A TS LESS THAN B, JUMP TO ALTE. 


I= (14-D) 


4 15 LARGER THAN &, 
THEREFORE, CSIGN = ASSIGN. 
ALTON THE B MANTISSA, 
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NOSIDT 


O11) 
O1it 
0112 
O1123 
0114 
O115 
01146 
0117 
O11 
O119 
0120 
O121 


0122 
0123 
O124 
0125 
O124 
0127 
O12S 
o129 
O1L30 
O14 


0132 
0133 


0134 © 


0135 
O1 346 
0137 
0138 
0139 
0140 
0141 
0142 
O143 
0144 
0145 
0146 
0147 
0148 
O149 
0150 
O1S1 
O15? 
OLE3 


0154 


OLS 


O1LSA 
O157 
O15 
O1S9 
0160 
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0020 


O02 1 
O02 
O02 
0024 
0025 
024 
027 
0028 
029 
QOZA 
007K 
002C 


002D 
002E 
OO2F 
0030 
OO21 
0032 
oOos3 


0034 


OoO3 
Oo3D 
OO3E 
OO3F 
0040 
0041 
0042 
0043 
0044 
0045 
0044 
0047 
0048 


0049 
004A 
0048 
O04c 
OoO4p 
OO4E 
OO4F 
0050 
ooh 
0052 
818 bare 
0054 
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A206 
4206 
4011 
4207 
CE18 
£807 
2007 
4011 
46007 
2103 
6003 
FFS0 
0049 


2000 
6008 
2103 
6003 
2107 
4007 
ZOOL 
6009 
FFSO 
0049 


0010 


» 6000 


3COC 
2004 
£003 
2005 
6007 
4202 
4802 
46011 
4203 
CEL 
4303 
2003 
4011 
400% 
2107 
4007 


2000 
1004 
F480 
007A 
Feo 
OosD 
4002 
4903 
4507 
4404 
FLEO 
QO4E 


AEQB 


ALTB 


% 
CHESGN 


BISNEG 


SACL 
LACT 
SFL 
SACH 
LAC 
OR 
SACL 
LAC 
SACL 


LAL 
SUB 
BZ 


BLZ 


ZALH 
ADDS 
SUBS 
SLIBH 
BZ 


BHI 
BHI 
RESID 
BLO 


BLO 
BLO 
RESID 
BLO 
ALG, 1 
ALC 
CHKSGN 


ASIGN 
SIGN 
ALO, 1 
AL 
BLO, 1 
AiO 
AEXP 
CEXF 
CHESIGN 


SIXT 
ig) 

a) 

BS IGN 
IS TGN 
BE XF 
CEXF 
AHI 
AHI 
RESID 
ALO 


ALU 
ALO 
RESID 
ALO 
BLO, 1 
BLO 


ASSIGN 
BSIGN 
ADNOW 


AISNE 


AHI 
ALO 
BL 
BHI 
IZERC 
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BHI IS SHIFTED RIGHT "DL" TIMES. 
RESIDUAL BITS MUST BE MAINTAINEL. 
BLO IS SHIFTED RIGHT "D" TIMES. 

MSB (THE 0) IS SHIFTED AWAY. 

GET BITS THAT WERE SHIFTED FROM BHI. 
“GET RID OF EXTRA BIT. 


DO BOTH NUMBERS HAVE THE SAME SIGN? 


39 
— 
fa) 
= 
W 
D 
ae 
— 
re] 
= 


IF SIGNS ARE THE SAME, CSIG 


ALIGN MANTISSAS. 


SET C EXPONENT = A EXPONENT. 


00 BOTH NUMBERS HAVE THE SAME SIGN? 


0 = (14-D) 


B IS THE BIGGEST NUMBER. 


THEREFORE, LET THE SIGN OF C = BSIGN. 


SET € EXPONENT = B EXPONENT. 
AHI GETS SHIFTED "D" TIMES. 
MAINTAIN EXTRA BITS. 

ALO GETS SHIFTED "bp" TIMES. 
MSB (THE 0) IS SHIFTED AWAY. 
BET RESIDUAL BITS. 


GET RID OF EXTRA BIT. 


CHECK THE SIGNS. 


IF THEY ARE THE SAME, JUST ADL. 


DO ClAL - TBE), 
SINCE RO AND A > Oo. 


257 


258 


NOSIDT 


0161 


01462 
O163 
0164 
O145 
01446 


01467 
01463 
0169 
0170 
O171 


0172 


0173 
0174 
0175 
0176 
0177 


0178 
0179 
O130 
0181 
O182 
O18 
0164 


0135 
0184 
0187 
018s 
01389 


0170 
O191 


0192 
O12 
O194 
0195 
O196 
O197 
O19e 
0199 


0200 


O201 
O20? 
O203 
O204 


QzZ05 


0055 
OOS4 
0057 
oo58 
0059 
ooSA 
OO5B 
OosC 
oosD 
OOSE 
OOSF 
0060 
0041 
QO42 
0063 
0064 
0065 
0044 
00467 
0068 
0069 
QO6A 


O06B 
006C 
OO4D 
OO4E 
OO06F 
0070 
0071 


0072 
0073 
0074 
0075 
OO74K 
0077 
0078 
007% 


00780 
O0O7B 
OO7K 
oo7o 
OO7TE 
QOO7F 
OOO 
O08] 
OOB2 
o083 


OO34 
Oo0s8s 


OOBA 
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F330 
0072 
&30A 
&00B8 
CAaago 
4008 
FFSO 
0084 
4006 
4907 
4503 
4402 
FLE0 
OO4B 
F380 
0072 
480A 
4008 
CANO 
6008 
FFaO 
0084 


CAQ0O 
4009 
4008 
400A 
4008 
FF30 
OOA7 


CE1B 
S50A 
46008 
DOs 
FFFF 
4008 
FFSO 
N04 


4002 
4703 
4907 
43506 
&£8:00 
600B 
FosO 
oo8s 
F&éR0 
OO4B 


S008 
FéaO 
o0en 


AISNEG 


# 
CZERO 


CNEG 


+ 
ADNGW 


NORMAL. 


PAGE 0004 


BLZ CNEG 
SACH CHI 
SACL clo 
ZAC 
SACL CSIGN 
EB NORMAL. GO AND NORMALIZE RESULT. 
ZALH RHI nO (iB: - tAr), 
AbDS BLO SINCE A < Oo AND BQ, 
SUBS ALO 
SUBH AHI 
BZ CZERO 
BLZ CNEG 
SACH CHI 
SACL CLO 
ZAC 
SACL CSIGN 
B NORMAL G50 AND NORMALIZE RESULTS. 
ZAC HERE, ONLY IF RESULT = 0. 
SACL TEXF 
SACL CSIGN 
SACL HI 
SACL CLO 
B AROQUNEIE OUTPUT A ZERO, 
ABS HERE, IF RESULT IS NEGATIVE. 
SACH CHI 
SACL CLA 
LALK >FRFFE 
SACL CSIGN 
NORMAL Gl NORMALIZE RESULT. 
ZALH AHI IF SIGNS ARE THE SAME, IUST ADL. 
ADDS ALO 
ADS BLO 
ADTIH BHI 
SACH CHI 
SACL CLO 
BY OVFELOW DIn AN OVERFLOW OCcuR? 
BZ mZERO TS RESULT = 0 * 
NORMALIZE 
LAL CHI ONES CHI HAVE THE MSE? 
BR? LO} 
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NOSIDT 


0206 
0207 
0208 
0209 
0210 


0211 
0212 
0213 


0214 
0215 
0216 


0217 
0218 
0219 
0220 
0221 

0222 
022%. 
0224 
0225 
0226 
0227 
0228 

0229 
0230 
0231 


O232 
0233 
0234 
0235 
0234 
0237 
0238 
0239 
0240 
O741 
0242 
0242 
0244 
o245 
O244 
0247 
o24e 
O249 
O250 
0251 
O252 
O253 


0254 


0087 
0082 
0089 
008A 
OOSB 
008C 
008D 
O0O08E 
OO8F 
0070 
0091 


0092 


0093 
0094 


0095 
0094 
0097 
0093 
0099 
OO9A 
OO%R 
009C 
oOo?9n 


009E 
OO9F 
00A0 


OOA1L 
DOAZ 
00A2 
00A4 
00AS 
OORE 


OOA7 
OO0AS 
O0OA? 
O0AA 


NO ERRORS, 
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400A 
490B 
4Bi2 
CEAZ 
FF80 
OOAI 
400B LO! 
C010 
F380 
OO9E 
4Biz2 
CEA2 
FF20 
OOAL 


De ke eK eK Ke ke 


CE046 
CE19 
680A 
400B 
2009 
oo0on 
4009 
FFR0O 
OOA7 

+ 

+ 


+ 
S990 NOFLOW 
CEO4 
CE1? 


*k eK Ke OR 


7O0E OUTPUT 
420A 
400B 
2009 
100€ 
L009 


# 
S222 AROUND 


CEIF 
NO WARNINGS 


ZALH 
ADDS 
RPT 
NORM 
B 


ZALH 
LARK 
BLZ 


RFT 
NORM 
B 


CHI 
CLO 
TTEEN 
OUTPLIT 
cLo 
ARO, 14 
NOFLOW 
TTEEN 


QUTPLUT 
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IF YES, NORMALIZE RESULT. 

WILL PERFORM 14 "NORMS" 

GO OUTPUT RESULTS. 

HERE IF CLO HAS MSB. 

OFFSET EXPONENT BY 14. 

DID BIT SEARCH CAUSE OVERFLOW? 

IF NOT, NORMALIZE RESULT. 


GO OUTPUT RESULT. 


FINISHED WITH NORMALIZATION 


HERE GNLY IF OVERFLOW OCCURRED DURING ADDITION 


RSXM 
SFR 
SACH 
SACL 
LAC 
ADD 
SACL 
B 


OVERLOW 


MAR 


RSXM 
SFR 


CHI 
CLO 
CEXP 
ONE 
CEXP 
AROUND 


RESET SIGN EXTENSION TO SHIFT RIGHT. 
SHIFT RIGHT. 
STORE NORMALIZED MANTISSA. 


DECREMENT. EXPONENT. 


GO OUTPUT RESULTS. 


OCCURRED DURING BIT SEARCH 


t— 


DECREMENT EXPONENT. 
RSXM FOR LOGICAL RIGHT SHIFT. 
PERFORM RIGHT SHIFT. 


TAKE CARE OF EXPONENT & NORMALIZED MANTISSA, 
THEN OUTPUT RESULTS. 


SAR 
SACH 
SACL 
LAC 
SUB 
SACL 


LARP 
RPT 
OUT 
IDLE 
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ARO, TEMP 
CHI 

CLO 

CEXP 
TEMP 
EXP 


1 
THREE 
#+ PAO 


HERE AFTER NORMALIZATION. 
SAVE NORMALIZED MANTISSA. 


ADJUST EXPONENT. 


RESET POINTER. 


WAIT FOR INTERRUPT. 
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NOSIDT 


OOO1 
0002 
O00 
0004 
oOoos 
OOOK 
OO07 
0008 
0009 
OO1O 
oor! 
OOLZ 
OO13 
0014 
0015 
OO14 
0017 
oo18 
OO1LY 
0020 
0071 
O07? 
002% 
0024 
0025 
0026 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
QO34 
03s 
0034 
0037 
0038 
0039 
0040 
0041 
0042 
0042 
0044 
0045 
0046 
0047 
0043 
0049 
0050 
0051 
0052 
OOSS 
0054 
0035 
0056 


0000 


APPENDIX B 


32020 FAMILY MACRO ASSEMBLER FCO, 7 £4,346 15:24:53 oO2-77-25 
### PRERELEASE ### 


0000 
0001 
0002 
0003 
0004 
0005 
0004 


FAGE 0001 
HERRERA EEE HEE IESE ER SEE HEE HE HEE HE HE 


THIS IS A FLOATING-POINT MULTIPLICATION ROUTINE WHICH 
IMPLEMENTS THE IEEE PROPOSED FLOATING-POINT FORMAT 
ON THE TMS32020. 


we Kk eK eK & 
mw me eK 


MHEG HEHEHE EHH HHH HHH 


INITIAL FORMAT (ALL 14-BIT WORDS) 


‘ ALL © OR 1 H ASIGN (0 OR -1) 
(Or. iS BITS H AHI (NORMALIZED) 
101 69 BITS t--O-; ALO 


ee ee re re ee ee ee eee ee 


AEXF (-127 TO 125) 


TG CORRESPOND WITH IEEE FORMAT, 
INPUT O.1F # 2 #% (E + 1) 
INSTEAD OF 1.F # 2 #*#E, AND SUBTRACT 127 FROM E. 


THE FINAL FORMAT IS THE SAME AS THE INITIAL FORMAT 
EXCEPT THAT FOR CLO WE HAVE: 


1 16 BITS 1 CLO 


ALL 14 BITS OF CLO ARE VALID. ANYTHING PAST THESE HAS 
BEEN TRUNCATED. 


| 


FETE TE FETE TEE HE HE HE SE HE SE SE SE HE SE Se Se SE SE SE HE SEE HE SE SE SE OE SE SE SE SE SE SE SE IE HE EE ESE HE HEE SE HE SE HE EE et EE EE 


# 

* WORST CASE (EXCLUDING INITIALIZATION AND I/O): 
# 7.3 MICROSECONDS. 

# THIS TIMING INCLUDES THE NORMALIZATION. 

* WORDS OF PROGRAM MEMORY: 40 

# 


we we we Ke OK 


HEHEHE EERE ERE ERE EHH 
*% 


AORG 
ASIGN EGIL 0 
AEXP ERL 1 
AHI EQU 2 
ALO En 3 
BSIGN EQU 4 
BEXP  ERRU 3 
BHI EGU & 
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261 


NOSILT 


OOR7 
OoOSe 
oo0sa? 
0040 
OO41 
00462 
OO4K3 
0044 
NO046S 
0064 
0047 
Q0488 
0069 
0070 
0071 
0072 
0073 
0074 
0075 


0076 
0077 
0O78 
0079 
0080 


0031 
0082 
0083 
0084 
0085 
0086 
0087 
0088 
0089 
0090 
OO91 
0072 
00393 
0094 
0095 
0096 
0097 
0092 
o0ags9 
0100 
0101 
o102 
0103 
0104 
0105 
0104 
0107 
0108 
0109 
0110 
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0000 
0001 
0002 
0003 
0004 
000s 
OO004 
0007 
0008 
0009 
000A 
OOOB 


Q00C 
ooon 
OOOE 


OQOOF 
0010 
OO11 
0012 
0013 


0014 
O015 


0014 
0017 


0018 
0019 
OO1A 


OO1B 
O01 
0010 


32020 FAMILY MACRO ASSEMBLER 


### FRERELEASE #%% 


0007 
0008 
0007 
OOO0A 
OOOB 
O00 
ooon 
OOOE 
OOOF 


C304 
CEO7 
soey 
D100 
0200 
CBO7 
S0A0 
C000 
55238 
Doo1 
FFFF 
6001 


2001 
0005 
6009 


3C03 
3804 
CE14 
680C 
600E 


3C02 
3807 


CE1iS 
CE1S 


4800 
490E 
6800 


3804 
CE14 
470C 


BLO 
CSIGN 
CEXF 
CHI 
CLO 
THI 
NEGONE 
TLO 
TEMP 

# 


koe OK Ok 


Ett 
Ems 
Ei 
EU 
Emu 
Emu 
Emu 
EAU 
Emu 


INITIALIZATION 


LOPK 
SSOXM 
LARP 
LRLK 


RPTE 
IN 

LARK 
LARP 
LALK 


SACL 


4 


1 
AR1, >200 


7 

#+ FAQ 
ARO, O 
Q 
>FFFF 


NEGONE 


POO.7 34, 348 


PAGE 0002 


BEGIN ON PAGE 4. 
SET SIGN EXTENSION. 


READ NUMBERS INTO BLOCK BO. 


CLEAR EXPONENT REGISTER. 


NEGONE = -1 


BEGIN FLOATING-POINT MULTIPLICATION. 


LAC 
ADD 
SACL 


LT 
MPY 
PAC 
SACH 
SACL 


LT 
MPY 


AFAC 
APAC 


ADDH 
ADDS 
SACH 


MPY 
PAC 
ADDS 


AE XP 
BE XP 
CEXP 


ALO 
BHI 


THI 
TLO 


AHI 
BLO 


THI 
TLO 
THI 
BHI 


THI 


ADL EXPONENTS. 


FIRST PRODUCT 


SECOND PRODUCT 


(ALD * BHI) 


(AHT #* BLO) 


HAS EFFECT OF (AHI # BLO + ALO *# BHI) # 2 ## -15. 


(AHI * BHI) 
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NOSIDIT 


O111 
0112 
0113 
0114 


0115 
0114 
0117 


0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0124 
0127 
0123 
0129 
01230 
0131 


0132 
0133 
0134 


0135 
0136 
0137 
0132 
0139 
0140 
NC ERR 
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OO1E 
OO1LF 


0020 
0021 
OO2Z2 
0023 
0024 
0025 


0026 
0027 
0028 
0029 
002A 
002B 
002c 
0020 
O02E 


OO2F 
0030 
0081 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0029 
003A 
003B 
0030 
ORS, 
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### PRERELEASE #+% 


620A 
610B 


F580 
0024 
CA0O 
6009 
FFSO 
002F 


400A OK 
490B 
CEA2 
680A 
4£00B 
7OOF 
2009 
100F 
4009 
+ 
4100 SETSIN 
4004 
F580 
0037 
CAQ0 
60028 
FF80 
0039 
200D NEG 
6008 
3587 OUTPUT 
CHO3 
EOAO 
CE1F 
NQ WARNINGS 


SACH 
SACL 


BNZ 


CHI, 1 
PLO,1 


OK 


CEXF 
SETSIN 


CHI 
CLO 


CHI 
CLO 

ARO, TEMP 
CEXP 
TEMP 
CEXP 


ASIGN 
BSIGN 
NEG 


CSIGN 
OUTPUT 


NEGONE 
CSIGN 
1 
3 
#+, PAO 


FCO.7 34.348 {Si S4253 O3-27-85 


PAGE 0003 


GET RID OF EXTRA SIGN BITS. 


IS RESULT ZERO? 


NORMALIZE AND WRAP UF. 


WHAT IS SIGN OF RESULT? 


OUTPUT RESULTS. 
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NOSIDT 


0001 
0002 
O003 
0004 
O00 
OOO 
0007 
OOOE 
0oo0oyY 
OOLO 
OO14 
OO1L2 
0O13 
0014 
0015 
OO146 
0017 
0018 
O01? 
0020 
OOF! 
OOZ2 
OOZS 
OO24 
oozE 
N02Z6 
O027 
O28 
OO29 
0030 
0031 
0032 
0033 
0034 
0035 
00346 
0037 
QO38 
0039 
0040 
0041 
0042 
004% 
0044 
0045 
0046 
0047 
0048 
0049 
0080 
OOS1 
oOosz 
0053 
O04 
oos5 
OOS 


OOOO 
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APPENDIX C 


##% PRERELEASE ### 


0000 
0001 
0002 
OO0O3 
0004 
0005 
0006 


PLO.7 84, 245 LS Shi 17 


O3-27-H5 


FAGE OOO1 


HHRHKHHHHHHHKHHAHEHHHRHHHHHHRAHHHEHHEHHHHHAHHRHHHHHHHHHHH 


we ee OK 


THIS IS A FLOATING-POINT DIVISION ROUTINE WHICH 


og 
*% 


IMPLEMENTS THE IEEE PROPOSED FLOATING-FOINT FORMAT # 


DON THE TMEs2020, 


+ 
+ 


HEHEHE RHR HHHHHHHHEHHHEHHHHHHHH EH 


mw owe me mw Ke we me Ke Ke Ke KK em KO Kea 


INITIAL FORMAT (ALL 146-BIT WORDS) 


ee ee em ee ee cee cee re ee ee ee ee ee ees ee ees 


; ALL OOR1 } 
Of. IS BITS 3 AHI 
101 9 BITS {--0-! ALO 


ee ee 


H ' AE XF 


ASSIGN (0 OR -1) 


(NORMAL IZED) 


(<127 TU: 128) 


TO CORRESPOND WITH IEEE FORMAT, 


INPUT O.1F # 2 ## (E + 1) 


INSTEAD OF 1.F * 2 ##E, AND SUBTRACT 127 FROM E. 


THE FINAL FORMAT IS THE SAME AS THE INITIAL FORMAT 


EXCEPT THAT FOR CLO WE HAVE: 


' 14 BITS ' CLO 


ee ee ee ere eee ee ee re ee ee ee ee ee oe 


ALL 14 BITS OF CLO ARE VALID. 


BEEN TRUNCATED. 


ANYTHING PAST THESE HAS 


FHL SEES EEE EEE HEE ESE EER EH Ht 


mK me Ke Ke & * 


WORST CASE (EXCLUDING INITIALIZATION AND 1/0): 


22.8 MICROSECONDS. 


THIS TIMING INCLUDES THE NORMALIZATION. 


WORDS OF PROGRAM MEMORY: %2 


wok OK oe ok 


HHH HEHEHE HHEHHEEEHHRHHEEHHHHHHHHHHHHHEH HE 


+ 


AS IGN 
AE XP 
AHI 
ALO 
BSIGN 
BE XP 
BHI 


AORG QO 
Eu 8) 
EMU 1 
Em 2 
EQU 3 
Eau 4 
EQU ra 
Eau 6 
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NOSIDVT 


ona 7 
Ooss 
OOeS 
O60 
OG41 
OO62 
O63 
OO44 
Oo6S 
O64 
QOO47 
OO48 
Ca? 
OO7G 
C07 i 
QO72 
O73 
0074 
907% 
OOS 
OO77 
OO7s 
OO7 2 
0080 
O081 
OO? 
OOM 
Oo8g 


OoOns 


oor 
OOS7 
OOS 
OO 
OOVO 


OOF 1 


Gove 


ooy7s 
oOoUg 
OCIS, 
OO A, 
Oe? 
Ove 
Oe 
O,oOo 
OVO 
OOS 
O1as 
OLo4 
o105 
MLO 
107 
Ovoes 


oOLoy 


0000 
0004 
Ooa? 
Oo003 
o004 


000s £ 
O004 F 
Q007 © 


OOO 
Q00 
OODA 
OOOB 
OoOoC: 
QOOT 
QODE 
QOOF 
OOLO 
ood 
Oot 
oo1s 
elen es 
OLS 
OOLA 
O17 
OOS 
oot 
OOLA 
OOVB 
OG1c 


a2020 FAMILY MACRO ASSEMBLER 


### FRERELEASE i+ 


O07 
OOS 
OOO 
OOO 
NOOR 
OOOr 
Oocogr 
ONOE 
OOOF 
oOo10 
OO 
OO12 
OOS 
OO14 
OO1S 
OOLE 
OOL7 
OOS 
oOo. 


Ce04 
CEO7 
S389 
D100 
O200 


Cooo 
Doo, 
FFFF 
4000 
Dao, 
{ooo 
£014 
CAQ4 
E00E 
CAG 1 
GOOLE 
CAO 
46016 
CAQE 
6017 
CALE 
A048 
CAaon 
GOL? 
CAO 


S00 


BLO 

mS TGN 
EXP 
THI 
Cha 
NEGONE 
TEMF 
FOUR 
iM 


oe 
M1000 
(NE 
THREE 
FITEEN 
THIRTY 
TTEEN 
# 


kK OR Ke x 


Emu 
EW 
Emu 
Emu 
EG 
Er! 
Er 
Eu 
EG 
EG 
Emu 
EG 
eu 
Ett 
EG 
Er 
Er 
EG 
EGiu 


ora ns a | 


~ 
—“ Sat) 
paler 


INITIALIZATION 


LOFE 
XM 
LARF 
LAL 


RETE 
IN 

LARF 
LARE: 
LALK. 


SACL 
LALK 


SACL 
LACE 
SACL 
LATE 


“SACL 


LACK 
SACL 
LACK 
SACL 
LACE 
SACL 
LACE 
SAL 
Zar. 

SACL 


4 


1 
ARL, 2200 


7 

%+ FAQ 
QO 
ARO, 0 
SFRFFF 


NEGONE 
>1000 


MIOOO 
4 
FOUR 


me 
= 


POO. ? 34, 345 1 


mh 


25217 Ot-27- 


8 ae! Neate athe 2 Ba) ew 


PAGE O002 


BEGIN ON FAGE 4. 
SET SIGN EXTENSION. 


READ NUMBERS INTO BLOCK BO. 


CLEAR EXPONENT REGISTER. 


NEGONE = -1 


M1000 = >1000 


FOUR = 4 


ONE = 1 


THREE 3 


FITEEN = 15 


THIRTY 


NV 


TTEEN = 13 


CLEAR CEXF 
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NOS IDT 


0110 
o1ii 
O112 
0113 
0114 
O115 
O114 


0117 
o112 
O119 
0120 
O121 
O12Z2 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
O131 
0132 
0133 
0134 
0135 
0136 
01:37 
0138 
O139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 


0148 
0149 
0150 
o1si1 
0152 
0153 
0154 
O155 
0154 
01357 
01538 
O159 


01460 
0161 
0142 
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oOo1D 
OOLE 
OO1F 
0020 
0021 
0022 


0023 
0024 
0025 
00246 


0027 
0028 
0027 
002A 
0028 
002C 
oo2D 
002E 
002F 


0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0035 
0039 
003A 
OO3B 
003r 
oo2D 
OOZE 
O03F 
0040 
0041 
0042 
0043 
0044 


0045 
0044 
0047 
0048 
0049 
0044 


32020 FAMILY MACRO ASSEMBLER 
##% PRERELEASE ##+# 


2000 
6003 
1004 
F480 
0023 
2000 


4002 
4903 
4B14 
CE1Y 


4B17 
4706 
6811 
&00F 
2F 11 
4517 
4704 
6812 
6010 


3COF 
3307 
CE14 
4B1i7 
4706 
4013 
400F 
4910 
1013 
6008 
680A 
200A 
4E14 
F480 
0041 
2015 
4009 
2001 
1005 
000? 
4009 


200A 
F430 
O004E 
400A 
490B 
4B1i9 


* 


Ok: 


NOOVF 


we Ke we 


NORMAL 


PCO.7 34.248 15225517 03-27-85 
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FINISHED WITH INITIALIZATION 


LAC ASIGN 
SACL CSIGN 
SUB BSIGN 
BZ OK 
LAC NEGONE 
SACL CSIGN 
ZALH AHI 
Anns ALO 
RPT THREE 
SFR 

RPT FITEEN 
SUBC BHI 
SACH Ri 
SACL QM 
LAC R1,15 
RPT FITEEN 
SUBC BHI 
SACH R2 
SACL GL 

LT QM 
MPY BLO 
FAC 

RPT FITEEN 
SUBC BHI 
SACL cL 
ZALH tM 
ADDS OL 
SUB CL 
SACL CLO 
SACH CHI 
LAC CHI 
AND M1000 
BZ NOQVF 
LAC ONE 
SACL CEXP 
LAC AEXP 
SUB BE XP 
ADD CEXP 
SACL CEXP 
NORMAL I ZE 

LAC CHI 
BZ LOI 
ZALH CHI 
Anns CLO 
RPT TTEEN 


CSIGN = ASIGN, IF ASIGN = BSIGN, 


ELSE, CSIGN = -1. 


SHIFT DIVIDEND TO PROTECT FROM OVERFLOW. 


QM = AHI;ALO / BHI, Ri = REMAINDER. 


HIGH ACCUMULATOR RETAINS REMAINDER. 


(Rl # 2##15) / BHI GIVES GL, AND R2. 


COMPUTES (Ri # 2##15) / BHI. 

HIGH ACCUMULATOR RETAINS REMAINDER. 
CORRECTION TERM = (QM # BLO) / BHI. 
COMPLITES (QM # BLO). 


COMPUTES (GM # BLO) / BHI. 


AMI@L - O:CL = CHI'CLO 


DID AN OVERFLOW OCCUR? 
IF NOT, GOTO NOOVF. 
ELSE, INCREMENT CEXP. 


COMPUTE RESULTING EXPONENT. 


DOES CHI HAVE THE MSB? 


IF YES, NORMALIZE RESULT. 


WILL FERFORM 14 "NORMS". 
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##% PRERELEASE ##+# 
PAGE oo04 


O16 OO4R CEAZ NORM . 
O164 0040 FFSO B QUTPLIT 60 OLITPUT RESULTS. 
O04 OG57 
O165 OO4E 400BR Lit ZALH CLO HERE, IF CLO HAS MSR, 
O144 O04F Faso BLZ NOFLOW DIO BIT SEARCH CAUSE OVERFLOW? 
ONS Ooseé 
0147 OO31 4B1% RET TTEEN IF NOT, NORMALIZE RESULT. 
O145 0082 CEA NORM 
O1469 OOF S FSO B OUT RUIT GO GUTPUT RESULT. 
OO54 OO87 
0170 
O171 
O172 
OLy 
0174 
O175 
O174 OOF 
O177 OOF 
OL178 
0179 
OLso 
OLS 
0182 
OLES 
O184 0087 480A OUTPUT SACH CHI SAVE NORMALIZED MANTISSA. 
oO1e5 o0o58 4008 SACL ceo 
N184 OO89 Shay LARF 1 RESET POINTER. 
0187 OO5A 4614 RPT THREE QUTFUT RESULTS, CSIGN, CEXP, CHI, AND CLO, 
Nise ooSB FQAO OUT H+ FAO 
O189 OOD CEIF IDLE WAIT FOR INTERRUPT. 
NO ERRORS, NO WARNINGS 


FINISHED WITH NORMALIZATION 


OVERFLOW OCCURRED DURING BIT SEARCH 


a ee ee 


at 
ui 


FEO4 NOFLOW RSXM RSXM FOR LOGICAL RIGHT SHIFT. 
4 CE19 SFR FERFORM RIGHT SHIFT. 


A 


TAKE CARE OF EXPONENT & NORMALIZED MANTIS54, 
THEN OUTPUT RESULTS. 


a a a ee ee ee) 
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Digital Signal Processing Routines 


8. Precision Digital Sine-Wave Generation with 
the TMS32010 


Domingo Garcia 
Digital Signal Processing - Semiconductor Group 
Texas Instruments 
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INTRODUCTION 


Sine-wave generators are fundamental building blocks of 
signal processing systems which are used in diverse applica- 
tions, such as communication, instrumentation, and control. 
In the past, engineers usually designed these oscillators with 
analog circuitry. Now, however, new high-speed digital 
signal processors like the TMS32010 present designers with 


an alternative that in many cases is superior. The TMS32010. 


provides the speed and accuracy to produce stable, low- 
distortion sine waves over a wide range of frequencies. 

This application report describes two different methods 
for implementing a digital sine wave generator using the 
TMS32010. The first method is a fast direct table lookup 
scheme suitable for applications not requiring extreme ac- 
curacy. The second approach, an enhancement of the first, 
includes linear interpolation to provide sine waveforms with 
a minimum of harmonic distortion. 


DIRECT TABLE LOOKUP METHOD 
The first algorithm is a simple, fast table lookup 
scheme. The sine values for N angles which are uniformly 


spaced around the unit circle are stored in a table which has 
the following format: 


OUTPUT VALUES 


ACTUAL kK—e 


N = TABLE LENGTH = 8 
DELTA = STEP SIZE = 0.6 


INDEX ANGLE SINE TABLE 
0 0 X 360°/N S[0] = sin(O0 °/N) 
1 1 X 360°/N S[1] =sin(360 °/N) 
2 2 X 360°/N S[2] = sin(720 °/N) 


S(N-2] =sin((N-2) X 360°/N) 
S[N-1] =sin((N-1) X 360°/N) 


N-2 (N-2) X 360°/N 
N-1 (N-1) X 360°/N 


A sine wave is generated by stepping through the table 
at a constant rate (in effect, moving counterclockwise around 
the unit circle), wrapping around at the end of the table 
whenever 360° is exceeded. Using the table index as the 
angle parameter and DELTA as the step size, this lookup 
method generates the sequence: 

S{mod(k X DELTA,N)] fork = 1, 2, 3, 4, ... 
remainder of the division a/b when 


this quotient is computed as an integer 
[e.g., mod(22.34,5) = 2.34)] 


where mod(a,b) = 


The ‘mod’ operator provides the wraparound at the end 
of the table. Figure 1 illustrates this algorithm. 


‘x’ DENOTES A SAMPLE POINT 


Figure 1. Direct Table Lookup 
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The sampled waveform generated is only an approx- 
imation to a sampled sinusoid. In general, the longer the table 
is the more resolution it provides, and consequently, the 
closer the approximation will be. 

The frequency, f, of the sine wave depends on two 
factors: 


(1) The time interval between successive samples, i.e., 
the sampling interval, t 


(2) The step size, DELTA 
f is given by the equation: 


f= a [Hz] where t is expressed in seconds 


Note that to satisfy the Nyquist criterion there must be 
at least two samples generated each sinusoid period. This 
requires that DELTA < N/2. 

In Figure 1, N = 8 and DELTA = 0.6. If, for in- 
stance, eight samples are generated each millisecond, then 
t = 0.000125 seconds and 


gee OO 2. és 
f= 3x0.000125 Hz = 900 Hz 
TMS32010 Implementation 


This section describes the concise TMS32010 
subroutine, given in Appendix B, which implements the table 
lookup scheme based on a sine table with 128 entries. Each 
time this subroutine is called, the next sample point is 
calculated. This subroutine uses: 


(1) 138(=128 + 10) words of program memory space 
(128 words for sine table storage and 10 words for 
program memory) 

(2) 6 words in data memory as working registers 


If this program is used as a subroutine, each sample 
can be computed in 3.0 microseconds. However, if the code 
is inserted directly in line with the code of a master program, 
avoiding the overhead of a subroutine, a sample can be com- 
- puted in 2.2 microseconds. 

The values in the sine table are all scaled. The decimal 
values, + 1.0 and — 1.0, are represented by the two’s com- 
plement hexadecimal values 4000 and C000, respectively. 
All other values are scaled and rounded to the closest hexa- 
decimal number. Rounding is used, rather than truncation, 
to avoid adding unnecessary distortion. 

The 16-bit data memory location ‘ALPHA’ serves as 
a modulo 128 counter which cycles through the sine table 
to select the sample points. ALPHA is regarded as having 
an integer and fractional part with the format: 
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QQQAVAV™|Y . AQ°9AQQWQA 
1514131211109 8 76543210 


The 16-bit data memory locaion ‘DELTA’ contains the 
step size. DELTA has the same (integer.fraction) format as 
ALPHA. Every time the sine wave subroutine is called, the 
contents of ALPHA are incremented by the contents of 
DELTA. The integer portion of ALPHA (i.e., the eight 
MSBs) is the pointer to the sine table. However, because 
the table starts at address location SINE, this pointer is off- 
set by the value for that address before the table is accessed. 
The eight most significant bits of ALPHA are masked when 
ALPHA is updated to insure that they never exceed 127. The 
routine returns the sine value in the data memory location 
‘SINA’. 

For any given sampling interval, t, the frequencies 
which can be generated must be of the form 


DELTA . 
f= TX DE (Hz) where t is expressed in seconds 


Since DELTA has a precision of eight bits to the right 
of the decimal place, any desired frequency (< 1/2t [Hz]) 
can be approximated with an error of no more than 


1/256 i 
cx 128 Hl = —aagEKT (THz! 


For example, if the sampling frequency is 8 kHz, then 
the frequency resolution is 


8000 
Harmonic Distortion 
Due to approximations made in calculating the samples 
of a sine wave of frequency f, a certain amount of the 
‘‘energy’’ of the samples’ waveform will fall into other fre- 
quencies as well. These frequencies are either: 


(1) Harmonic frequencies, nf, where n = 
2, 3, 4, ..., OF 

(2) Subharmonic frequencies, nf/m, where n and m are 
integers. 


This spurious energy results in noise which is referred 
to as ‘‘harmonic distortion.’’ It is usually measured in terms 
of Total Harmonic Distortion (THD) which is defined as the 
ratio 


THD = spurious harmonic energy 
~ total energy of the waveform 
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There are two sources of error in the table lookup TABLE LENGTH: 32 


algorithm which cause harmonic distortion: DELTA THD 
a. a 2.0 0.00000024 
(1) Quantization error is introduced by representing the 2.25 0.00300893 
“sine table values by 16-bit numbers. 2.50 0.00240751 
(2) Larger errors are introduced when points between 2.75 0.00300917 
table entries are sampled. This occurs when DELTA 3.0 0.00000024 
is not an integer. 8.25 0.00300924 
‘ ae 11.625 0.00315807 
The longer the sine table is, the less significant the 
second error source will be. Consequently, harmonic distor- TABLE LENGTH: 64 
tion decreases with increasing table length. Furthermore, DELTA THD 
when DELTA is an integer, quantization is the only error 2.00 90.00000048 
source, and THD is extremely small regardless of table size. 2.25 0.00075269 
THD is given for several table lengths and values of DELTA 2.50 0.00060219 
in Figure 2. Note that the figures in this table only repre- 2.75 0.00075239 
sent the THD in the digitized sine wave. If the sine wave 3.00 0.00000018 
is reconstructed using a digital-to-analog converter and analog 8.25 0.00075204 
filters, these analog devices will contribute additional distor- 11.625 0.00079078 
tion. (The procedure for computing THD is described in 
Appendix A.) TABLE LENGTH: 128 
DELTA THD 
LINEAR INTERPOLATION METHOD cis rapist 
2.25 0.00018859 
2.50 0.00015080 


To decrease the harmonic distortion for a given table 


size, an interpolation scheme can be used to compute the sine need 0.000.188 92 
values between table entries more accurately. Linear inter- 6 ie caleba 
polation is the simplest method to implement. This method Oe eeee 

11.625 0.00020128 


uses the values of two consecutive table entries as the end 
points of a line segment. Sample points for parameter values 
falling between table entries assume values on the line seg- 
ment between the points. This algorithm is illustrated in 
Figure 3. 


Figure 2. Total Harmonic Distortion Using 
Direct Table Lookup 


ACTUAL SINE 
WAVE VALUES 


‘Xx’ DENOTES A SAMPLE POINT oe 


INTERPOLATED 
ouTPUT 
VALUES y vf, 
/, | 
4.8 5.6 6.4 7.2 y} | 
i l l Pe 
) 14 , 2 3 , 4 5 | 6 | 7 | j I 40 
| I 
| | | I / 
Yt | Y 
N = TABLE LENGTH = 8 \ | | l ’) 
DELTA = STEP SIZE = 0.8 | | 
! ] 
| 
”” 
~ =, ef 


Figure 3. Linear Interpolation 
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This algorithm is based on the linear approximation 


sin(360 °(1+D)/N) = sin(360°I/N) 
+ Dx {sin(360°( + 1)/N) 
— sin(360°I/N)} 


= SIU + DX { Sil+1) — SI} 

where N is the sine table length, 
I is an integer such that 0 < I < N-1, and 
D is a decimal number such that 0 < D < 1.0 


The value, S{I+ 1] — SII], is the slope of the line seg- 
ment between the two sample points which bracket the value 
I+D(.e., 1 s 1+D < I+). 

All the values required for this interpolation scheme 
are stored in the following two tables: 


INDEX ANGLE SINE TABLE SLOPE TABLE 

Oo 0 X 360°/N S{0] = sin(O0°/N) St1}- S[0} 

1 1 X 360°/N Si1) =sin(360 °/N) S{2)-S{1} 

2 2 X 360°/N S{2] = sin(720°/N) S13] - S{2} 
N-2 (N-2) X 360°/N SIN - 2] = sin(360 °(N — 2)/N) SIN- 1]-SIN- 2] 
N-1 (N-1) X 360°/N SIN — 1] = s8in(360 °(N — 1)/N) S(O) - SIN- 1} 


TMS32010 Implementation 

The sample TMS32010 implementation of this linear 
interpolation scheme, given in Appendix C, is an enhance- 
ment of the table lookup method. This subroutine is based 
on 128-entry sine and slope tables. Each time this subroutine 
is called, the next sample point is calculated. This subroutine 
uses: 


(1) 276 (=128 + 128 + 20) words of program memory 
space 
(128 words for sine table storage, 
128 words for slope table storage, and 
20 words for program memory) 
(2) 9 words in data memory as working registers 


If this program is used as a subroutine, each sample 
can be computed in 5.4 microseconds. However, if the code 
is inserted directly in line with the code of a master program, 
avoiding the overhead of a subroutine, a sample can be com- 
puted in only 4.6 microseconds. 

Just as in the table lookup algorithm, a sine wave is 
generated by stepping through the sine table at a constant 
rate, wrapping around at the end of the table whenever 360° 
is exceeded. The table index is used as the angle parameter, 
denoted by ALPHA. 

DELTA denotes the step size for this routine also. In 
this case, however, sample points falling between the samples 
in the sine table are evaluated using the linear approxima- 
tion formula given above. 

The values in both the sine and slope tables are 
calculated in the same way as they were for the table lookup 
program. The decimal values, +1.0 and —1.0, are 
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represented by the two’s complement hexadecimal values 
4000 and C000, respectively. All hexadecimal values are 
rounded rather than truncated to the closest 16-bit represen- 
tations to reduce quantization noise. . 

Because the method to compute the step size is the same 
as that used in the table lookup scheme, the frequency resolu- 
tion will also be the same. However, because of the linear 
interpolation between table entries, sine values are no longer 
limited to the values stored in the table. This allows the 
error between the computed value and the actual value to 
be less. 


Harmonic Distortion 

Figure 4 lists the distortion of several sine waves 
generated using the TMS32010 linear interpolation routine 
for various table lengths and step sizes. These results clear- 
ly show that the distortion for a particular fractional step size 
decreases if the size of the table is increased just as in the 
direct table lookup case. However, for the same non-integer 
step size and the same table length, the distortion for the 
linear distortion method is much lower than that of direct 
table lookup. . 

These values were experimentally determined and the 
method used to compute them is given in Appendix A. 


TABLE LENTH: 32 


DELTA THD 
2.0 0.00000024 
2.25 0.00169343 
2.50 0.00135476 
2.75 0.00169379 
3.0 0.00000024 
8.25 0.00169361 

11.625 0.00177808 


TABLE LENGTH: 64 


DELTA THD 
2.00 0.00000048 
2.25 0.00018884 
2.50 0.00015055 
2.75 0.00018771 
3.00 0.00000018 
8.25 0.00018806 

11.625 0.00019815 


TABLE LENGTH: 128 


DELTA THD 
2.00 0.00000054 
2.25 0.00000054 
2.50 0.000000 12 
2.75 0.00000101 
3.00 0.00000012 
8.25 0.00000006 

11.625 0.00000155 


Figure 4. Harmonic Distortion Using Linear Interpolation 
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IMPLEMENTATION TRADE-OFFS 


There are three trade-offs that must be considered when 
implementing the algorithms described above. They are 
speed, accuracy, and the size of the table ROM. 

The direct table lookup method is the fastest implemen- 
tation. Using a table that ranges from 0° to 360°, the routine 
needs only to address the table and compute the next angle. 
However, the table occupies more program memory space 
than is absolutely required. 

To minimize the amount of program memory required 
for the sine table, one can take advantage of the symmetry 
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of the sine function. By keeping track of the quadrant as 
ALPHA is increased, a table that ranges from 0° to 90° will 
be sufficient. This decreases the size of the table by three- 
fourths. However, the extra code necessary to keep track 
of the quadrant will increase the execution time of the routine. 

If harmonic distortion is important, then some form of 
interpolation is needed. One can use the linear interpolation 
method of the second example or other approximations such 
as a Taylor Series or Maclaurin Series expansions carried 
out to the second- or third-order term, or beyond. These 
schemes will, however, also increase the amount of code as 
well as the execution time. 
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APPENDIX A: COMPUTATION OF TOTAL HARMONIC DISTORTION 


To determine the Total Harmonic Distortion (THD) of a sampled data sine wave, the amount of energy due to frequency 
components other than the fundamental is divided by the total energy of the wave. This is computed from the formula: 


THD = [E(total) — E(fundamental)] / E(total) 


For the most accurate results, these energy terms should be calculated over a full cycle of the signal. In the case of a sine 
wave generated by either of the two methods, a full cycle may actually consist of several sinusoid periods. For instance, 
If N = table length = 128 and if DELTA = step size = 1.5, a cycle will only be completed for the smallest n for which 
n x 1.5 is evenly divisible by 128. This occurs for n = 256 which marks the end of the second sinusoid period. 


In general, if DELTA = A/B where A and B are relatively prime integers, and N = table length, then the sequence x(n), 
n = 1,2,3, ... of sine-wave samples will cycle after no more than B x N points. 


The amount of total ‘‘energy’’ in a cycle of this length is 


BN - 1 


E(total) = > x2(n) 


n=0 


The amount of ‘‘energy’’ in the fundamental frequency over this period is 


lI 


E(fundamental) = 1/BN (|X(A)|2 + |X(BN-—A)|2) 


2/BN |X(A)|2 for a real sequence 


where the X(k) terms are terms of the Discrete Fourier Transform defined by the equation 
BN - 1 
X(k) = > x(n)exp( — j(24/N)nk) 
n=0 | 


The values given in Figures 2 and 4 are based on actual values computed by the TMS32010 for the two sample sine-wave 
generator programs. The computation of THD was carried out on a VAX 11/780 using the above formulas with double- 
precision floating-point arithmetic. 


8. Precision Digital Sine-Wave Generation with the TMS32010 277 


278 8. Precision Digital Sine-Wave Generation with the TMS32010 


GENER1 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0003 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
00193 


0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
0050 
0051 
0052 
0053 


0000 
0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
OO0A 
000B 
000C 
000D 
000E 
OOOF 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
OO1A 
001B 
001C 
001D 
OO1E 
001F 
0020 
0021 
0022 
0023 
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F900 
0083' 
0000 
0324 
0646 
0964 
0C7C 
OF8D 
1294 
1590 
187E 
1B5D 
1E2B 
20E7 
238E 
2620 
289A 
2AFB 
2D41 
2F6C 
3179 
3368 
3537 
36E5 
3871 
39DB 
3B21 
3C42 
3D3F 
3E15 
3EC5 
3F4F 
3FB1 
3FEC 
4000 
3FEC 


IDT 


*GENER1' 


PAGE 0001 


rEeESESEESLELESESESESESSLESLE SESE LE SELES SESE SE SESE SESS SEE LE SES ES 


x 
x 
x 
* 
* 
x 
x 
* 
x 
* 
* 
x 
x 
® 
x 
* 


(15-i) FRACTIONAL BITS. 


B 


SINE DATA 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


NOTE: 


SINE WAVE GENERATOR 
DIRECT TABLE LOOKUP METHOD 

THIS PROGRAM USES A LOOKUP TABLE OF SINE VALUES TO 
COMPUTE THE SAMPLES OF THE WAVE. THE FREQUENCY IS 
DETERMINED BY THE SIZE BY WHICH ONE STEPS THROUGH THE 
TABLE. THE TABLE CONSISTS OF 128 ENTRIES THAT CORRESPOND 
TO EQUALLY SPACED ANGLES BETWEEN 0 AND 360 DEGREES. 
HKEKRKERREK EKER KEKE IRE RRR ERE KIER EKEREKREEKEREEREKEKRKA 
Q NOTATION 
THE TMS32010 USES FIXED-POINT TWO'S COMPLEMENT NUMBERS. 
EACH 16-BIT NUMBER HAS A SIGN BIT, i INTEGER BITS, AND 
THE VALUE AFTER THE LETTER Q 
REFERS TO THE NUMBER OF FRACTIONAL BITS THAT ARE 
REPRESENTED BY THAT NUMBER, 


CONSIDERED TO HAVE 14 FRACTIONAL BITS. 
KHER KERR KRERERKEKRERERKEREREEEKREEEEEREREREEKREEERARIN 


START 


>0 
>324 
>646 
>964 
>C7C 
>F8D 
>1294 
>1590 
>187E 
>1B5D 
>1E2B 
>20E7 
>238E 
>2620 
>289A 
>2AFB 
>2D41 
>2F6C 
>3179 
>3368 
>3537 
>36E5 
>3871 
>39DB 
>3B21 
>3C42 
>3D3F 
>3E15 
>3EC5 
>3F4F 
>3FB1 
>3FEC 
>4000 
>3FEC 
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i.e., A Q1l4 NUMBER IS 


e€eeeene eee + H+ © 4H ££ HH F 


*THE SINE TABLE 

*VALUES ARE REPRESENTED IN 
*Q14 FORMAT, i.e., THERE 
*ARE 14 BITS AFTER THE 
*BINARY POINT. 
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GENER1 


0054 
0055 
0056 
0057 
0058 
0059 
0060 
0061 
0062 
0063 
0064. 
0065 
0066 
0067 
0068 
0069 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
0080 
0081 
0082 
0083 
0084 
0085 
0086 
0087 
0088 
0089 
0090 
0091 
0092 
0093 
0094 
0095 
0096 
0097 
0098 
0099 
0100 
0101 
0102 
0103 
0104 
0105 
0106 
0107 


280 


0024 
0025 
0026 
0027 
0028 
0029 
002A 
002B 
002C 
002D 
002E 
002F 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
003A 
003B 
003C 
003D 
003E 
003F 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
004A 
004B 
004c 
004D 
004E 
004F 
0050 
0051 
0052 
0053 
0054 
0055 
0056 
0057 
0058 
0059 
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3FB1 
3F4F 
3EC5 
3E15 
3D3F 
3C42 
3B21 
39DB 
3871 
36E5 
3537 
3368 
3179 
2F6C 
2D41 
2AFB 
289A 
2620 
238E 
20E7 
1E2B 
1B5D 
187E 
1590 
1294 
OF8D 
0C7C 
0964 
0646 
0324 
0000 
FCDC 
FOBA 
F69C 
F384 
F073 
ED6C 
EA70 
E782 
E4A3 
E1D5 
DF19 
DC72 
DIEO 
D766 
D505 
D2BF 
D094 
CE87 
ccg98 
CAC9 
C91B 
C78F 
C625 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


>3FB1l 
>3F4F 
>3EC5 
>3E15 
>3D3F 
>3C42 
>3B21 
>39DB 
>3871 
>36E5 
>3537 
>3368 
>3179 
>2F6C 
>2D41 
>2AFB 
>289A 
>2620 
>238E 
>20E7 
>1lE2B 
>1B5D 
>187E 
>1590 
>1294 
>F8D 
>C7C 
>964 
>646 
>324 
>0 
>FCDC 
>FOBA 
>F69C 
>F384 
>FO73 
>ED6C 
>EA70 
>E782 
>E4A3 
>E1D5 
>DF19 
>DC72 
>D9SEO 
>D766 
>D505 
>D2BF 
>D094 
>CE87 
>CC98 
>CAC9 
>C91B 
>C78F 
>C625 


PAGE 0002 
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GENER1 


0108 
0109 
0110 
0111 
0112 
0113 
0114 
0115 
0116 
0117 
0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 
0134 
0135 
0136 
0137 
0138 
0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 
0153 
0154 
0155 
0156 
0157 
0158 
0159 
0160 


005A 
005B 
005C 
005D 
005E 
005F 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
006A 
006B 
006C 
006D 
006E 
006F 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
007A 
007B 
007C 
007D 
007E 
007F 
0080 
0081 
0082 
0082 
0083 


0083 
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C4DF DATA >C4DF 

C3BE DATA >C3BE 

C2cl DATA >C2Cl 

C1EB DATA >C1EB 

C13B DATA >C13B 

COB1 DATA >C0B1l 

CO4F DATA >CO4F 

c014 DATA >C014 

C000 DATA >C000 

C014 DATA >c014 

CO4F DATA >CO4F 

COB1 DATA >COB1l 

C13B DATA >C13B 

C1EB DATA >C1EB 

C2cl DATA >C2Cl 

C3BE DATA >C3BE 

C4DF DATA >C4DF 

C625 DATA >C625 

C78F DATA >C78F 

C91B DATA >C91B 

CAC9 DATA >CAC9 

CCc98 DATA >CC98 

CE87 DATA >CE87 

D094 DATA >D094 

D2BF DATA >D2BF 

D505 DATA >D505 

D766 DATA >D766 

DSEO DATA >DIYEO 

DC72 DATA >DC72 

DF19 DATA >DF19 

E1DS DATA >E1D5 

E4A3 DATA >E4A3 

E782 DATA >E782 

EA70 DATA >EA70 

ED6C DATA >ED6C 

F073 DATA >F073 

F384 DATA >F384 

F69C DATA >F69C 

FOBA DATA >FOBA 

FCDC DATA >FCDC 

7FFF Ml DATA >7FFF 
RAKE KKKKKEKKKEKKKKKKKKKKKKR RK KEK 
*DATA MEMORY LOCATIONS USED* 
KkeKRKKKKKRKKKRRKK KKK KK KKKKKEK 

0000 DELTA EQU 0 

0001 ALPHA EQU 1 

0002 SINA EQU 2 

0003 TEMP EQU 3 *WORKSPACE REGISTER 

0004 MASK EQU 4 

0005 OFSET EQU 5 
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GENER1 


0161 
0162 
0163 
0164 
0165 
0166 
0167 
0168 
0169 
0170 
0171 
0172 
0173 
0174 
0175 
0176 
0177 


0178 
0179 
0180 
0181 


0182 
0183 
0184 
0185 
0186 
0187 
0188 
0189 
0190 
0191 
0192 
0193 
0194 
0195 
0196 
0197 
0198 
0199 
0200 
0201 


0083 
0084 
0085 
0086 
0087 
0088 
0089 
OO8A 
008B 
008C 


008D 
008E 
008F 


0O08F 
0090 
0091 
0092 
0093 
0094 
0095 
0096 
0097 
0098 


NO ERRORS, 
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PAGE 0004 

KKK KKKERE KK EK KEKE RR KEE KEKE RE KEK ERE KR ERE RHEE KREEKEKKKK 
NECESSARY INITIALIZATIONS: * 
MASK INITIALIZED TO >7FFF FOR 128 POINT TABLE % 
OFSET INITIALIZED TO THE ADDRESS AT THE BEGINNING * 
OF TABLE. * 

ALPHA INITIALLY CLEARED * 


*e % 


* 


7E82 
6704 
7EO2 
5005 
7F89 
5001 
4100 
F800 Ll 
OO08F' 


DELTA INITIALIZED TO INCREMENT VALUE USING Q8 FORMAT * 
HIKE IIR KKK IIE HIKE KKK EEE KARE EKER EEE KKK KKK 


6F00 START LDP 


0 * SET DATA PAGE POINTER 


ALPHA 
DELTA, PA1l * IN THIS EXAMPLE, 
SWAVE1 * DELTA IS INPUT 


KRKKKKKKEKEKKREKEREKKEEKKEKREKKEKKKAKK 
REST OF PROGRAM * 


RHKKEKKKEKKKKRKKKKKKKKEKKRKKKKKRKRKKKK 


* 


F900 
008B' 


B 


Ll 


KREEKEKKEKEKKKEEKKKKRKRKRKEKKKRKKKKKRKRKRKKKKKRKKKKKKKKRKKKKRKKRKKKKKRKKRKK 


SINE WAVE SUBROUTINE: 


+ 6 + H 


* 


"SINA’. 


TO EXECUTE. 


2801 SWAVE1 LAC 


5803 
2003 
0005 
6702 
2001 
0000 
7904 


SACH 


* 
THIS ROUTINE EXTRACTS THE SINE OF AN ANGLE FROM THE * 
TABLE AND RETURNS THE VALUE IN THE DATA LOCATION * 
IT USES A FRACTIONAL STEP SIZE TO COMPUTE * 
THE NEXT POINT OF THE WAVE. IT TAKES 2.6 microseconds * 
* 
eR EERE SE SLES RELE SEES SELAR SEES EERE SESE SEES ESE SESS ESE SESS SE SE SY 
ALPHA, 8 
TEMP *ISOLATE INTEGER PORTION 
TEMP 
OFSET 
SINA *SINE VALUE FROM TABLE (Q14) 
ALPHA 
DELTA *COMPUTE NEXT ADDRRESS 
MASK *MODULO 128 MASK = >7FFF 
ALPHA *SAVE NEXT ADDRESS 


5001 
7F8D 


NO WARNI 


NGS 


*RETURN TO MAIN PROGRAM 
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GENER2 


0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 


0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
0050 
0051 
0052 
0053 


0000 
0000 
0001 


0002 
0003 


0004 
0005 
0006 
0007 
0008 
0009 
OOOA 
000B 
000C 
000D 
OO0E 
OOOF 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 


APPENDIX C: TMS32010 LINEAR INTERPOLATION ROUTINE 
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F900 
0104' 


7FFF 
OFFF 


0000 
0324 
0646 
0964 
0Cc7C 
OF8D 
1294 
1590 
187E 
1B5D 
1E2B 
20E7 
238E 
2620 
289A 
2AFB 
2D41 
2F6C 
3179 
3368 
3537 


PAGE 0001 


IDT 'GENER2' 

KKK KKKHKKKKRKKKKKKKKRKRKKRKKRKKKKKKRKK 
x SINE WAVE GENERATOR 

* LINEAR INTERPOLATION METHOD 

* THIS PROGRAM USES A LOOKUP TABLE OF SINE VALUES TO 

* COMPUTE THE SAMPLES OF THE WAVE. THE FREQUENCY IS 

* DETERMINED BY THE SIZE BY WHICH ONE STEPS THROUGH 

* THE TABLE. THE TABLE CONSISTS OF 128 ENTRIES THAT 

* CORRESPOND TO EQUALLY SPACED ANGLES BETWEEN 0 AND 

* 360 DEGREES. POINTS BETWEEN THE TABLE ENTRIES ARE 

* APPROXIMATED USING A LINEAR APPROXIMATION, 

* sin(A) ~= sin(INT[A]) 

* + {sin(INT([A]+1)-sin(INT([A])}xFRACT[A] 
* ALL THE POSSIBLE SLOPES BETWEEN ANY TWO CONSECUTIVE* 
* SINE TABLE ENTRIES ARE STORED IN A SEPARATE TABLE. * 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 


+ + + + + FH HF HF HF HF 


THE SLOPE TABLE IS ALSO 128 ENTRIES LONG. * 
ree See eeeeeceeeeeseeece see ee eee eeeeeeceeeeeeecee ee eee ee ee & 
NOTE: Q NOTATION * 

THE TMS32010 USES FIXED-POINT TWO'S COMPLEMENT * 

NUMBERS. EACH 16-BIT NUMBER HAS A SIGN BIT, i * 

INTEGER BITS, AND (15-i) FRACTIONAL BITS. THE * 

VALUE AFTER THE LETTER Q REFERS TO THE NUMBER OF * 

FRACTIONAL BITS THAT ARE REPRESENTED BY THAT * 

NUMBER, i.e., A Ql14 NUMBER IS CONSIDERED TO HAVE * 

14 FRACTIONAL BITS. * 

* 


KHEKEKKEKKEKKEKEKEKKEKKEKKEKEKKEKKKKKKEKKKEKKEKEKKKKKKKKKKKEK 


B START 
x 
M1 DATA >7FFF *MASK VALUES 
M2 DATA >OFFF 
x 
SINE DATA >0 *THE SINE TABLE 
DATA >324 *VALUES ARE REPRESENTED 
DATA >646 *IN Q14 FORMAT, i.e., 
DATA >964 *THERE ARE 14 BITS AFTER 
DATA >C7C *THE BINARY POINT. 
DATA >F8D 
DATA >1294 
DATA >1590 
DATA >187E 
DATA >1B5D 
DATA >1E2B 


DATA >20E7 
DATA >238E 
DATA >2620 
DATA >289A 
DATA >2AFB 
DATA >2D41 
DATA >2F6C 
DATA >3179 
DATA >3368 
DATA >3537 
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0054 
0055 
0056 
0057 
0058 
0059 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
0080 
0081 
0082 
0083 
0084 
0085 
0086 
0087 
0088 
0089 
0090 
0091 
0092 
0093 
0094 
0095 
0096 
0097 
0098 
0099 
0100 
0101 
0102 
0103 
0104 
0105 
0106 
0107 
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0019 
001A 
001B 
001C 
001D 
OO1E 
OO1F 
0020 
0021 
0022 
0023 
0024 
0025 
0026 
0027 
0028 
0029 
002A 
002B 
002C 
002D 
002E 
002F 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
0038 
0039 
OO3A 
003B 
003C 
003D 
003E 
003F 
0040 
0041 
0042 
0043 
0044 
0045 
0046 
0047 
0048 
0049 
004A 
004B 
004C 
004D 
004E 
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36E5 
3871 
39DB 
3B21 
3C42 
3D3F 
3E15 
3EC5 
3F4F 
3FB1 
3FEC 
4000 
3FEC 
3FB1 
3F4F 
3EC5 
3E15 
3D3F 
3C42 
3B21 
39DB 
3871 
36E5 
3537 
3368 
3179 
2F6C 
2D41 
2AFB 
289A 
2620 
238E 
20E7 
1E2B 
1B5D 
187E 
1590 
1294 
OF8D 
0Cc7C 
0964 
0646 
0324 
0000 
FCDC 
FOBA 
F69C 
F384 
F073 
ED6C 
EA70 
E782 
E4A3 
E1D5 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 

DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


->36E5 


>3871 
>39DB 
>3B21 
>3C42 
>3D3F 
>3E15 
>3EC5 
>3F4F 
>3FB1 
>3FEC 
>4000 
>3FEC 
>3FB1 
>3F4F 
>3EC5 
>3E15 
>3D3F 
>3C42 
>3B21 
>39DB 
>3871 
>36E5 
>3537 
>3368 
>3179 
>2F6C 
>2D41 
>2AFB 
>289A 
>2620 
>238E 
>20E7 
>1E2B 
>1B5D 
>187E 
>1590 
>1294 
>F8D 
>C7C 
>964 
>646 
>324 
>0 
>FCDC 
>FOBA 
>F69C 
>F384 
>F073 
>ED6C 
>EA70 
>E782 
>E4A3 
>E1D5 
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0108 
0109 
0110 
0111 
0112 
0113 
0114 
0115 
0116 
0117 
0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 
0134 
0135 
0136 
0137 
0138 
0139 
0140 
0141 
0142 
0143 
0144 
0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 
0153 
0154 
0155 
0156 
0157 
0158 
0159 
0160 
0161 


004F 
0050 
0051 
0052 
0053 
0054 
0055 
0056 
0057 
0058 
0059 
OO5SA 
0O5B 
005C 
005D 
005E 
OOSF 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
006A 
006B 
006C 
006D 
O06E 
OO6F 
0070 
0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 
007A 
007B 
007C 
007D 
OO7E 
OO7F 
0080 
0081 
0082 
0083 
0084 
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DF19 DATA >DF19 
DC72 DATA >DC72 
D9IJEO DATA >D9EN 
D766 DATA >D766 
D505 DATA >D505 
D2BF DATA >D2BF 
D094 DATA >D094 
CE87 DATA >CE87 
ccg98 DATA >CC98 
CAC9 DATA >CAC9I 
C91B DATA >C91B 
C78F DATA >C78F 
C625 DATA >C625 
C4DF DATA >C4DF 
C3BE DATA >C3BE 
C2Ccl DATA >C2C1l 
ClEB DATA >C1EB 
C13B DATA >C13B 
COB1 DATA >COB1 
CO4F DATA >CO4F 
C014 DATA >C014 
co00 DATA >C000 
C014 DATA >C014 
CO4F DATA >CO4F 
COB1 DATA >COB1 
C13B DATA >C13B 
C1EB DATA >C1EB 
C2cl DATA >C2C1 
C3BE DATA >C3BE 
C4DF DATA >C4DF 
C625 DATA >C625 
C78F DATA >C78F 
C91B DATA >C91B 
CAC9 DATA >CAC9 
ccg98g DATA >CC98 
CE87 - DATA >CE87 
D094 DATA >DO094 
D2BF DATA >D2BF 
D505 DATA >D505 
D766 DATA >D766 
D9EO DATA >DIEO 
DC72 DATA >DC72 
DF19 DATA >DF19 
E1D5 DATA >E1D5 
E4A3 DATA >E4A3 
E782 DATA >E782 
EA70 DATA >EA70 
ED6C DATA >ED6C 
F073 DATA >F0O73 
F384 DATA >F384 
F69C DATA >F69C 
FOBA DATA >FIBA 
FCDC DATA >FCDC 
0324 TSLOPE DATA >324 *SLOPE BETWEEN TWO 
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0162 
0163 
0164 
0165 
0166 
0167 
0168 
0169 
0170 
0171 
0172 
0173 
0174 
0175 
0176 
0177 
0178 
0179 
0180 
0181 
0182 
0183 
0184 
0185 
0186 
0187 
0188 
0189 
0190 
0191 
0192 
0193 
0194 
0195 
0196 
0197 
0198 
0199 
0200 
0201 
0202 
0203 
0204 
0205 
0206 
0207 
0208 
0209 
0210 
0211 
0212 
0213 
0214 
0215 
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0085 
0086 
0087 
0088 
0089 
008A 
008B 
008C 
008D 
008E 
008F 
0090 
0091 
0092 
0093 
0094 
0095 
0096 
0097 
0098 
0099 
OOSA 
009B 
009C 
009D 
O09E 
009F 
0OA0 
OOA1 
O0A2 
00A3 
O0A4 
OOA5 
O0A6 
00A7 
OOA8 
O0A9 
OOAA 
OOAB 
OOAC 
OOAD 
OOAE 
OOAF 
00B0 
OOB1 
00B2 
00B3 
00B4 
00B5 
00B6 
00B7 
00B8 
0OB9 
OOBA 
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0322 
O31E 
0318 
0311 
0307 
O2FC 
02EE 
O2DF 
02CE 
02BC 
02A7 
0291 
027A 
0261 
0246 
022B 
020D 
O1EF 
01CF 
O1AE 
018C 
016A 
0146 
0121 
OOFC 
00D6 
00B0 
0089 
0062 
003B 
0014 
FFEC 
FFC5 
FF9E 
FF77 
FF50 
FF2A 
FFO4 
FEDF 
FEBA 
FE96 
FE74 
FE52 
FE31 
FE11 
FDF3 
FDD5 
FDBA 
FD9OF 
FD86 
FD6F 
FD59 
FD44 
FD32 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


>322 
>31E 
>318 
>311 
>307 
>2FC 
>2EE 
>2DF 
>2CE 
>2BC 
>2A7 
>291 
>27A 
>261 
>246 
>22B 
>20D 
>1lEF 
>1CF 
>1lAE 
>18C 
>16A 
>146 
>121 
>FC 
>D6 
>BO 
>89 
>62 
>3B 
>14 
>FFEC 
>FFC5 
>FFIE 
>FF77 
>FF50 
>FF2A 
>FFO4 
>FEDF 
>FEBA 
>FEI6 
>FE74 
>FE52 
>FE31 
>FE11 
>FDF3 
>FDD5 
>FDBA 
>FDIF 
>FD86 
>FD6F 
>FD59 
>FD44 
>FD32 
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*SINE ENTRIES (Q14) 
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0216 
0217 
0218 
0219 
0220 
0221 
0222 
0223 
0224 
0225 
0226 
0227 
0228 
0229 
0230 
0231 
0232 
0233 
0234 
0235 
0236 
0237 
0238 
0239 
0240 
0241 
0242 
0243 
0244 
0245 
0246 
0247 
0248 
0249 
0250 
0251 
0252 
0253 
0254 
0255 
0256 
0257 
0258 
0259 
0260 
0261 
0262 
0263 
0264 
0265 
0266 
0267 
0268 
0269 


00BB 
OOBC 
OOBD 
OOBE 
OOBF 
00cO 
00C1 
00C2 
00C3 
00C4 
00Cc5 
00C6 
00C7 
00c8 
00Cc9 
00CA 
O0OCB 
00CcCc 
00CD 
OOCE 
OOCF 
00D0 
00D1 
00D2 
00D3 
00D4 
00D5 
00D6 
00D7 
00D8 
00D9 
OODA 
OODB 
0ODC 
0ODD 
OODE 
OODF 
00EO 
00E1 
O0E2 
00E3 
OOE4 
OOES5 
OOE6 
00E7 
00E8 
00E9 
OOEA 
OOEB 
00EC 
OOED 
OOEE 
OOEF 
0OFO 
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FD21 
FD12 
FD04 
FCF9 
FCEF 
FCE8 
FCE2 
FCDE 
FCDC 
FCDC 
FCDE 
FCE2 
FCE8 
FCEF 
FCF9 
FD04 
FD12 
FD21 
FD32 
FD44 
FD59 
FD6F 
FD86 
FDSF 
FDBA 
FDD5 
FDF3 
FE11 
FE31 
FE52 
FE74 
FE96 
FEBA 
FEDF 
FFO4 
FF2A 
FF50 
FF77 
FF9E 
FFC5 
FFEC 
0014 
003B 
0062 
0089 
00BO 
00D6 
OOFC 
0121 
0146 
016A 
018C 
01AE 
01CF 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


>FD21 
>FD12 
>FDO4 
>FCF9 
>FCEF 
>FCE8 
>FCE2 
>FCDE 
>FCDC 
>FCDC 
>FCDE 
>FCE2 
>FCE8 
>FCEF 
>FCF9 
>FDO4 
>FD12 
>FD21 
>FD32 
>FD44 
>FD59 
>FD6F 
>FD86 
>FDIF 
>FDBA 
>FDD5 
>FDF3 
>FE11 
>FE31 
>FE52 
>FE74 
>FE96 
>FEBA 
>FEDF 
>FFO4 
>FF2A 
>FF50 
>FF77 
>FFIE 
>FFCS 
>FFEC 
>14 
>3B 
>62 
>89 
>BO 
>D6 
>FC 
>121 
>146 
>16A 
>18C 
>1lLAE 
>1CF 
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0270 
0271 
0272 
0273 
0274 
0275 
0276 
0277 
0278 
0279 
0280 
0281 
0282 
0283 
0284 
0285 
0286 
0287 
0288 
0289 
0290 
0291 
0292 
0293 
0294 
0295 
0296 
0297 
0298 
0299 
0300 
0301 
0302 
0303 
0304 
0305 
0306 
0307 
0308 
0309 
0310 
0311 
0312 
0313 
0314 
0315 
0316 
0317 
0318 
0319 
0320 
0321 
0322 
0323 
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OOF1 
OOF2 
00OF3 
OOF4 
OOFS 
OOF6 
OOF7 
OOF8 
0OFS 
OOFA 
OOFB 
OOFC 
OOFD 
OOFE 
OOFF 
0100 
0101 
0102 
0103 
0104 


0104 
0104 
0105 
0106 
0107 
0108 
0109 
O10A 
010B 
010C 
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O1EF DATA >1EF 
020D DATA >20D 
022B DATA >22B 
0246 DATA >246 
0261 DATA >261 
O27A DATA >27A 
0291 DATA >291 
02A7 DATA >2A7 
02BC DATA >2BC 
02CE DATA >2CE 
02DF DATA >2DF 
02EE DATA >2EE 
O2FC DATA >2FC 
0307 DATA >307 
0311 DATA >311 
0318 DATA >318 
O31E DATA >31E 
0322 DATA >322 
0324 DATA >324 


0000 
0001 
0002 
0003 
0004 
0005 
0006 
0007 
0008 


6F00 
7E04 
5007 
7E84 
1007 
5008 
7E02 
6705 
7E03 


KREEKREKKKEKEKKEKRKEKKKKKKKKKKRKRKEKEK 


*DATA MEMORY LOCATIONS USED * 
KKK KKEEKRREKKEREKRREKEKREAKKKKKEKK 


DELTA EQU 
ALPHA EQU 
SLOPE EQU 
SINA EQU 
TEMP EQU 
MASK1 EQU 
MASK2 EQU 
OFSET1 EQU 
OFSET2 EQU 


NO Of WN FE © 


8 


*SCRATCH PAD LOCATION 

*MASK FOR MODULO 128 POINTER 
*ISOLATE FRACTIONAL PART 
*ADDRESS OF SINE TABLE 
*ADDRESS OF SLOPE TABLE 


REKKKKKKEKKEKEEEKEKKEKKKKKKEKKEKKKEKKKEEKKEKKKKKKKRKKKKKKEKKKKK 


* NECESSARY INITIALIZATIONS: * 
* MASK1 INITIALIZED TO >7FFF FOR 128 POINT TABLE * 
* MASK2 INITIALIZED TO >OFFF * 
* OFSET1 SET TO THE ADDRESS AT THE BEGINNING OF 
* SINE TABLE * 
* OFSET2 SET TO THE ADDRESS AT THE BEGINNING OF * 
* SLOPE TABLE WITH RESPECT TO SINE TABLE * 
* ALPHA INITIALLY CLEARED * 
* DELTA INITIALIZED TO INCREMENT VALUE USING * 
* Q8 FORMAT * 
RHEKEKEKKEKKEKKKKKKKKKEKKKKKKKKKKKKKKRKKKKKKKKKRKKKKKKKRKKKKKKKK 
START LDP 0 *SET DATA PAGE POINTER. 

LACK SINE *SINE TABLE ADDRESS. 

SACL OFSET1 

LACK TSLOPE *SLOPE TABLE ADDRESS. 

SUB OFSET1 

SACL OFSET2 

LACK M1 *RETRIEVE MASK1. 

TBLR = MASK1 

LACK M2 *RETRIEVE MASK2. 
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PAGE 0007 

0324 010D 6706 TBLR MASK2 
0325 O10OE 7F89 ZAC 
0326 O1OF 5001 SACL ALPHA *START ANGLE AT ZERO. 
0327 0110 4100 IN DELTA, PA1 *FOR THIS EXAMPLE, 
0328 0111 F800 Lil CALL SWAVE2 *DELTA IS INPUT. 

0112 0115' 
0329 KREAEKRKRKKKRKKKKKKRKKRKKRKKKKRKKRKRKKK 
0330 * REST OF PROGRAM * 
0331 KEKE KKEKEKEKEKKKRKK 
0332 0113 F900 B Ll 

0114 0111’ 
0333 0115 
0334 0115 
0335 PES SESERESESLELSEE LESSEE SESE LESSEE SERA SA SS ESESSESS ESS ELSES SE 
0336 * SINE WAVE SUBROUTINE: * 
0337 * THIS ROUTINE COMPUTES THE SINE OF AN ANGLE AND * 
0338 * RETURNS THE VALUE IN DATA LOCATION 'SINA'‘., IT USES * 
0339 * A FRACTIONAL STEP SIZE TO AUTOMATICALLY COMPUTE THE * 
0340 * ADDRESS OF THE NEXT POINT OF THE SINE WAVE. IT * 
0341 * TAKES 5.0 microseconds TO EXECUTE. * 
0342 KREKRKREKRKKEKRKKRKERKEKRKKEKEKKREKRKKRKEKKEKRKKEKKRKKRKKKKKRKKRKKREK 
0343 0115 2801 SWAVE2 LAC ALPHA, 8 
0344 0116 5804 SACH TEMP * ISOLATE INTEGER PORTION 
0345 0117 2004 LAC TEMP 
0346 0118 0007 ADD OFSET1 
0347 0119 6703 TBLR SINA *GET CLOSEST LOWER SINE- 
0348 011A 0008 ADD OFSET2 *TABLE ENTRY (b). 
0349 011B 6702 TBLR SLOPE *SLOPE FOR CORRESPONDING INTERVA 
0350 011C 2401 LAC ALPHA, 4 * ISOLATE FRACTIONAL PORTION 
0351 011D 7906 AND MASK2 *AND MAKE Q8 NUMBER INTO Q12. 
0352 O11E 5004 SACL TEMP 
0353 O11F 6A04 LT TEMP 
0354 0120 6D02 MPY SLOPE * mx 
0355 0121 7F8E PAC 
0356 0122 0C03 ADD SINA,12 
0357 0123 5C03 SACH SINA,4 * y = mx + b. 
0358 0124 2001 LAC ALPHA *COMPUTE ADDRESS OF NEXT POINT. 
0359 0125 0000 ADD DELTA *OF WAVE. 
0360 0126 7905 AND MASK1 *CLEAR MOST SIGNIFICANT BIT. 
0361 0127 5001 SACL ALPHA *SAVE NEXT ADDRESS. 
0362 0128 7F8D RET *RETURN TO MAIN PROGRAM. 
0363 END 


NO ERRORS, NO WARNINGS 
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INTRODUCTION 


Matrix multiplication is useful in applications such as 
graphics, numerical analysis, or high-speed control. The 
purpose of this application report is to illustrate matrix 
multiplication on two digital signal processors, the 
TMS32010 and TMS32020. 

Both the TMS32010 and TMS32020 can multiply any 
two matrices of size M x N and N xP. The programs for 
the TMS32010 and TMS32020, included in the appendices, 
can multiply large matrices and are only limited by the 
amount of internal data RAM available. Assuming a 200-ns 
cycle time, the TMS32010 and TMS32020 can calculate 
[1 x3] x [3x3] in 5.4 microseconds. 

Before discussing the two versions of implementing a 
matrix multiplication algorithm, a brief review of matrix 
multiplication is presented along with three examples of 
graphics applications. 


MATRIX MULTIPLICATION 


The size of a matrix is defined by the number of rows 
and columns it contains. For example, the following is a 5 x3 
matrix since it contains five rows and three columns. 


all aj? a}3 
a2] a2? a23 
A= a3} a32 a33 
a4l a42 a43 
a5] as2 a53 


Any two matrices can be multiplied together as long 
as the second matrix has the same number of rows as the 
first has of columns. This condition is called conformability. 
For example, if a matrix A is an M XN matix and a matrix 
B is an N x P matrix, then the two can be multiplied together 
with the resulting matrix being of size M xP. 


[3] [8 


MXN = 2x2 NxP = 2x1 MxP = 2x1 


Example: (3)(4) + (4)(6) = 36 


Given the two conformable matrices A and B, the 
elements of C = A XB are given by: 


N 


yD aik X Dk; 


Cc; = 
S k=] 


fori = 1,....M andj = 1,...,P 
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Q12 FORMAT 


Applications often require multiplication of mixed 
numbers. Since the TMS32010 and TMS32020 implement 
fixed-point arithmetic, the programs in the appendices assume 
a Q12 format, i.e., 12 bits follow an assumed binary point. 
The bits to the right of the assumed binary point represent 
the fractional part of the number and the four bits to the left 
represent the integer part of the number. An example of Q12 
format is as follows: 


0001.110111100000 = 1.866 
ASSUMED BINARY POINT 


0000.110111100000 = 0.866 in Q12 
x 0000.100000000000 = 0.5 in Q12 


00000000 .011011110000000000000000 = 0.433 in Q24 


The result of a Q12 by Q12 multiplication is a number 
in a Q24 format that can easily be converted to Q12 by a 
logical left-shift of four. The first four bits will be lost as 
well as the last twelve, but these bits are insignificant for 
Q12. Note that the programs in the appendices provide no 
protection against overflow; therefore, the design engineer 
should implement a format that best fits the application. 


GRAPHICS APPLICATIONS 


Operations in graphics applications, such as translation, 
scaling, or rotation, require matrix manipulations to be 
performed in a limited amount of time. Therefore, the 
TMS32010 and TMS32020 processors are ideal for these 
applications. Graphics applications, such as scaling and 
rotation of points in a coordinate system, require 
multiplication of matrices. Translation is typically 
implemented by addition of two matrices. However, when 
points are represented in a homogeneous coordinate system, 
translation can be implemented by multiplication. In a 
homogeneous coordinate system, a point P(x,y) is 
represented as P(X,Y,1). This type of coordinate system is 
desirable since it relates translation with scaling and rotation. 

Translation can be defined as the moving of a point 
or points in a coordinate system from one location to another 
without rotating. This is accomplished by adding a 
displacement value Dx to the X coordinate of a point and 
adding a displacement value Dy to the Y coordinate, thus 
moving the point from one location to another. Figure 1 
shows both addition and multiplication methods of translation 
and an example of each. 

Similar to translation, scaling can be implemented by 
matrix multiplication. Points can be scaled by multiplying 
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Rotation of the coordinates of a point (or points) about 
an angle theta can also be accomplished by a matrix 
multiplication. The following set of equations results with 
the matrix multiplication required to rotate an object about 


any angle. 
y 
(Xnew, Ynew) 
ADDITION METHOD 
(Xwew Ynew! = [Xoo Yorn! + (Dx Dy] 
where D, = 5 and D, = 1 (Xoup, Youn) 
MULTIPLICATION METHOD 
1 0 0 
(Xwew Ynew 1] = [Xoin Yoo 1) * | O 61 z 
D, Dy 1 Xo_p = Fr cos¢ 
where D, = 5 and D, = 1 Youtp = ' sing 
Xnew = © cos (@+¢) = r cos¢ cosO — r sing sind 
Figure 1. Translation of Coordinates Ynew = rsin (0+¢) = r cos¢ sin@ + r sind cosO 


Xnew = Xoip cos®8 — Yoip sind 
YNew = Xoipb sinO + Youo cosO 


each coordinate of a point (or points) by a scaling value Sx 


and Sy. Scaling an object is similar to stretching or shrinking oR 

an object. The coordinates of each point that makes up the 

object are multiplied by a scaling value which scales the cosQ—s sin@—s«OO 
object to a larger or smaller scale. Figure 2 shows the scaling (Xnew Ynew 1] = [Xotn Youn 1] ¢ |-sin@ cos6 0 
of an object from-one size to another. 2 . 4 

BEFORE SCALING AFTER SCALING 
y y 
x x 


Let the scaling factors S, and S, = 0.5 


Ss, 0 oO 

[Xwew Ynew 1] = (Xoip Youn 1) 2° | 9 Sy O 
o oOo 1 

05 Oo 0 

IXY 1] = [441] e} 0 05 O 
o o 1 


IX Y' 1] = (22 1] 
Figure 2. Scaling From One Size To Another 
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Figure 3 shows an implementation of these equations 
to rotate an object 30 degrees about the origin. 

Figures 4 and 5 show a segment of straight-line 
TMS32010 and TMS32020 code, respectively. These 
programs calculate the coordinate rotation example using a 
Q12 format. Note that once the matrices are loaded into 
memory, the procssors can calculate the results in 5.4 
microseconds. The segment of TMS32020 code in Figure 5 
implements the MAC instruction. For small matrices, the 
MAC instruction in conjunction with the RPT instruction 
gains little due to the overhead timing of the MAC 
instruction. However, for larger matrices, this method is 
most efficient since the MAC instruction becomes single- 
cycle in the repeat mode. For applications that only require 
translation, scaling, or rotation of coordinates, straight-line 
code as in Figures 4 and 5 is more efficient than the larger 
programs in the appendices. 


P’ (5,5.2) 


x 
0.866 05 O 
[IX ¥ 1] = [7 2 1] °e| -0.5 0.866 0 
0 0 1 


[X' ¥’ 1] = [5.0 5.2 1] 


Figure 3. Implementation of Rotation Matrix 
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Figure 4. TMS32010 Code for Rotation 
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vA hd 
71a? 
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4640AQ 
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ANS, FAQ 


# CALCULATE NEW Y COORDINATES. 


# TONVERT TO mice AND OUTPUT RESULT. 


* FINISH HOMOGENEOUS MATRIX. 


Figure 4. TMS32010 Code for Rotation (Concluded) 
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APAC 
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APAC 
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MPYK 
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RPTK 
MAC 


APAC 
SACH 
OUT 
RET 
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AR], 2300 
5 

e+, PAG 
AR1L, #200 


*+,PA0 


=O 
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»FEFOO ’ a+ 
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20 


sFFOO, *+ 


ANS, 4 
ANS, PAO 
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> FEOO, #+ 


ANS, 4 
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& 
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USE AUXILIARY REGISTER 1. 


INITIALIZE ACCUMULATOR. 
LOAD ROTATION MATRIX INTO Bl. 


LOAD COORDINATES INTO BLOCK &O. 


* CONFIGURE BO AS PROGRAM MEMORY. 


CLEAR P REGISTER. 


CALCULATE THE NEW X COORDINATE. 


OUTPUT NEW X COORDINATE. 


* CLEAR P REGISTER. 


CALCULATE NEW Y COORDINATE. 


* OUTPUT NEW Y COORDINATE. 
* CLEAR P REGISTER. 


FINISH HOMOGENEOUS MATRIX. 


Figure 5. TMS32020 Code for Rotation 
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To combine translation, scaling, and rotation, a more 
general matrix can be implemented. 


GENERAL MATRIX FOR 
TWO-DIMENSIONAL SYSTEMS 


Tl] rj2 0 
r2] r22 0 


The upper 2 x 2 matrix is a combination rotation matrix 
and scaling matrix. The tx and ty values are the translation 
values. A three-dimensional general matrix can be developed 
similar to the two-dimensional translation, scaling, and 
rotation matrix. 


INITIALIZATION 
INPUT M, N, AND P. 


CALCULATE SIZE OF MATRIX A 
AND B. AIS MxN, 
AND B iS N xP. 


INPUT THE A MATRIX BY ROWS. STORE 
THESE VALUES IN MEMORY IMMEDIATELY 
AFTER THE INITIALIZATION VALUES. 


INPUT THE B MATRIX BY COLUMNS. 
STORE THESE VALUES IN MEMORY 
FOLLOWING THE A MATRIX VALUES. 


MCOUNT = 


INCREMENT MCOUNT AND DEFINE 
THE BEGINNING OF THE FIRST 
UNPROCESSED ROW AS BDIS. SET 
POINTER2 POINTING AT THE 
BEGINNING OF THE B MATRIX. 
SET PCOUNT = 0. 


GENERAL MATRIX FOR 
THREE-DIMENSIONAL SYSTEMS 


T1] T12 113 0 
12] r22 123 0 
T3] 132 133 0 
ty ty ty 1 


IMPLEMENTATION OF THE MATRIX 
MULTIPLICATION ALGORITHM 
FOR THE TMS32010 


The implementation of the algorithm for the TMS32010 
shown in Figure 6 assumes that the two matrices to be 
multiplied together are of size M x N and N x P. Three major 


INCREMENT PCOUNT. LOAD POINTER1 
WITH BDIS (FIRST UNPROCESSED ROW). 
SET NCOUNT = 0. CLEAR ACCUMULATOR. 


INCREMENT NCOUNT. MULTIPLY VALUE 
POINTED AT BY POINTER1 WITH 
VALUE POINTED AT BY POINTER2 
AND ACCUMULATE. INCREMENT 
BOTH POINTERS. 


YES 


Figure 6. TMS32010 Flowchart 
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INITIALIZATION 
INPUT M, N, AND P. 


READ B MATRIX 
INTO BLOCK B1. 


LAST ROW 
BEEN ENTERED 


YET? 


INPUT A ROW 
OF THE A MATRIX. 


CLEAR ACCUMULATOR 
AND P REGISTER. 


MULTIPLY THE ROW OF THE 
A MATRIX BY A COLUMN IN 
THE B MATRIX. 


OUTPUT RESULT. 


Figure 7. TMS32020 Flowchart 


loops are included to multiply the two matrices. The outside 
loop control is labeled MCOUNT since it controls which row 
in the A matrix is being referenced during the multiplication. 
The secondary loop control is labeled PCOUNT because it 
counts how many columns in the B matrix have been 
processed. The inside loop control is labeled NCOUNT since 
it controls the multiplication of the values in the A matrix 
with the values in the B matrix. 


IMPLEMENTATION OF THE MATRIX 
MULTIPLICATION ALGORITHM 
FOR THE TMS32020 


The implementation of the algorithm for the TMS32020 
is somewhat different since its advanced instruction set allows 
for a more efficient method of computing matrix 
multiplication. The TMS32020 version in Figure 7 also 
assumes that the two matrices to be multiplied are of size 
M XN and N xP. This program takes a row of the A matrix, 
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loads it into block BO of data memory, and then multiplies 
this row by all columns in the B matrix. The TMS32020 
continues this process until all the rows in the A matrix have 
been multiplied by all the columns in the B matrix. The 
TMS32020 version is similar to the TMS32010 in that the 
A matrix must be entered by rows and the B matrix by 
columns. This allows for a faster execution time. Figure 7 
shows the basic implementation of the matrix multiplication 
algorithm that the TMS32020 uses to multiply two matrices. 

Since the programs in the appendices treat the matrices 
differently, a memory map is included to help in 
understanding the two versions. Figure 8 shows how the 
matrices should look in memory after they have been entered. 
Note that for the TMS32020 version, the A matrix values 
reside in program memory since the CNFP (configure as 
program memory) instruction was implemented. Note also 
that only one row of the A matrix is in this block since the 
program enters one row at a time. 
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For the following matrices, 


A=} a a2] pe] bu biz bis 
a2] a2? b21 b22 b23 


the memory would be configured in this manner for the TMS32010 and TMS32020. 


TMS32020 
DATA MEMORY PROGRAM MEMORY 
LOCATION VALUE LOCATION VALUE 
(IN HEX) (IN HEX) 
>308 by 1 >FFOO aj 
>309 b21 >FFO1 ai2 
>30A b, 2 
>30B b22 
>30C by 3 
>30D b23 


Figure 8. Memory Maps 


TMS32010 
DATA MEMORY 
LOCATION VALUE 
(IN HEX) 

> OOF a4] 
>010 a1? 
>011 a21 
>012 a22 
>013 b+ 
>014 b4 
>015 b12 
>016 b92 
>017 b13 
>018 b23 

SUMMARY 


The TMS32010 and TMS32020 processors can be used 
to multiply large matrices efficiently. A brief review of 
matrix multiplication has been given to assist in the 
understanding of fundamental matrix multiplication. Three 
examples of graphics applications have been presented since 
these applications often require multiplication of matrices. 

The TMS320 family has the power and flexibility to 
cost-effectively implement a wide range of high-speed 
graphics, numerical analysis, digital signal processing, and 
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control applications. Since the TMS32010 and TMS32020 
combine the flexibility of a high-speed controller with the 
numerical capability of an array processor, a new approach 
to applications such as graphics can now be considered. 
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INTRODUCTION 


Interrupt (INT), reset (RS), and branch on I/O (BIO) 
are inputs to the TMS32010 microprocessor that are typically 
provided from asynchronous sources within a system. These 
three inputs are subject to certain considerations in addition 
to those relevant to signals that are synchronized to 
TMS32010 operation. Observing these considerations is 
important to insure reliable operation of these three input 
functions. 


INTERRUPT AND BIO INPUTS 


Interrupt (INT) and BIO are the two inputs on which 
synchronization is most important. 

The BIO input provides a convenient approach to 
implementing polled I/O on the TMS32010. BIO, unlike 
INT, is not latched internally, so the state of the BIO input 
must be maintained while the BIOZ instruction is executing. 
The previous state of BIO is irrelevant. When properly 
synchronized to CLKOUT, BIO is recognized by the 
TMS32010 on the next CLKOUT cycle. It should also be 
noted that the cycle during which BIO is sampled by the 
BIOZ instruction is the first of the two cycles that it takes 
BIOZ to execute. 

Several aspects of interrupt operation are important for 
proper implementation of the interrupt function within a 
system. These fall into three basic categories: 


1. Interrupt and input synchronization 
2. Internal interrupt control logic 
3. Interrupt programming. 


Interrupt and BIO Input Synchronization 

Synchronization of the interrupt and BIO inputs is 
absolutely critical in maintaining proper operation of these 
functions. This must be accomplished externally since the 
TMS32010 does not contain the necessary synchronizing 
logic. This requirement results from the fact that these inputs 
are sampled within the TMS32010 on the falling edge of 
CLKOUT. If the input level is changing at this point, the 
internal circuitry may receive an invalid logic level, causing 
unpredictable operation. Specifically, the input must be at 
a stable, valid logic level at least 50 ns before the falling 
edge of the CLKOUT signal, and must remain stable for at 
least one full CLKOUT cycle (200 ns for a 20-MHz 
TMS32010). These timing relationships are shown 
graphically in Figure 1. 

The issue of synchronization has been addressed at 
length in numerous publications,! so details of the subject 
are not presented here. It can be shown that adequate 
synchronization for the TMS32010 may be accomplished by 
simply passing the interrupt input through a sufficiently fast 
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jo} 200 ns MIN 


NOTE: Timing measurements are referenced to and from a low 
voltage of 0.8 V and a high voltage of 2.0 V. 


Figure 1. Input Timing Relationships 


positive edge-triggered flip-flop clocked by CLKOUT. The 
flip-flop used must have a clock-rising to output-valid delay 
of no more than 15 ns. Examples of circuits that accomplish 
input synchronization using a flip-flop are shown in Figure 2. 
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Figure 2. Sample Input Synchronization Circuits 


All further references to the interrupt or BIO functions 
in this application report assume that this synchronization 
has. been provided. These functions should never be 
implemented without synchronization; behavior of 
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unsynchronized inputs is unpredictable and not guaranteed. 
For example, the interrupt input signal travels through two 
different paths with unequal propagation delays. If 
asynchronous interrupts are allowed, interrupts may become 
disabled before an interrupt can be accepted. Since the 
pending interrupt is not accepted, the interrupt service routine 
is not executed. Therefore, probably no EINT instruction 
will be executed, since this instruction is unlikely to be 
included in normal program flow and interrupts will remain 
disabled unless the system is reset. 


Internal Interrupt Control Logic 

The interrupt input on the TMS32010 is both edge and 
level triggered, i.e., a low-going pulse on the interrupt in- 
put can generate an interrupt (provided the pulse is at least 
as long as one CLKOUT cycle). However, if the interrupt 
input goes low and remains low, the interrupt will remain 
pending until the input goes high and the interrupt is cleared 


Q 
INTERRUPT 


D 


INT 


= 0 interrupts enabled 
Q = 1 interrupts disabled 


SET 


INTERRUPT | | 
MASK INTERRUPT 
REGISTER* | PROCESSOR | 
Of} nT | | 
INTERRUPT | | 
REQUEST | | 
(IREQ) 
| INTERRUPT | 
ACTIVE 


(see Figure 3). This results in several distinct situations which 
can occur with respect to how interrupts are serviced. Three 
specific cases are presented for illustration. 

In the first case, shown in Figure 4, a low pulse oc- 
curs on the interrupt input. The falling edge of this pulse 
causes IFLAG to be set to one, resulting in IREQ and IACT 
going high (since INTM is high). Then, when the INT input 
returns to a one, IFLAG (and therefore, IREQ and IACT) 
remain high until the interrupt is accepted (according to the 
rules described in the next section). When the interrupt is 
accepted, the interrupt service routine is entered, and the 
IACK signal is pulsed, which disables the interrupt by set- 
ting the INTM bit. At the end of the interrupt service routine, 
interrupts are enabled using the EINT instruction, which 
clears the INTM bit. The important point in this case is that, 
once the interrupt input goes high and the IFLAG flip-flop 
is cleared, control returns directly to the mainline program 
following the interrupt service routine. 


INTERRUPT 
ACKNOWLEDGE 
(IACK) 


cLR a 


INTERNAL 


OO oes 


Figure 3. Internal Interrupt Control Logic 
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Figure 4. Pulse-Triggered Interrupt 


In the second case, INT goes low and stays low for The result of this is that, when interrupts are reenabled at 
an indefinite period (see Figure 5). Here IFLAG is set to _ the end of the interrupt service routine, the interrupt will be 
one, and IREQ and IACT go high as they did in the case accepted again, and the interrupt routine will be reentered 
of a pulsed interrupt. However, when the interrupt service without returning to the mainline program. This will con- 
routine is entered and the [ACK pulse is generated, IFLAG __ tinue until the interrupt is removed. 
is cleared, but IREQ remains high since INT is still low. 
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Figure 5. Level-Triggered Interrupt 
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The third case is a situation where, after a pulse has IACK pulse, another INT pulse may cause a second inter- 
occurred on INT and the normal interrupt processing has rupt to become pending. This interrupt will be serviced im- 
begun, another interrupt occurs before servicing of the first mediately following the return from the first interrupt ser- 
interrupt is complete (see Figure 6). It is evident from the vice routine (the mainline program will not be reentered un- 
timing diagram that, once the first interrupt is cleared by the til the second interrupt is serviced). 
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Figure 6. Multiple-Pulse Interrupts 
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The following general rule describes all of these situa- 
tions: An interrupt may be generated either by the falling 
edge of a low-going pulse or by a low level on the INT in- 
put. It should be noted that this is distinctly different from 
many other types of systems in which an actual edge is re- 
quired to produce an interrupt, and a level alone has no ef- 
fect. Level triggering is provided on the TMS32010 in order 
to allow multiple external devices to interrupt the processor 
and be handled in a polled fashion. 


Interrupt Programming 
From a software standpoint, operation of interrupts on 


the TMS32010 is quite straightforward and very similar to 
interrupts on most other microprocessors, with only two 
exceptions. 

Interrupt operation in general can be characterized by 
the following rules: 


1. If an interrupt occurs while interrupts are enabl- 
ed, it is accepted following completion of the in- 
struction currently executing, even if the current 
instruction is a multicycle instruction. 

2.  If.an interrupt occurs while interrupts are disabl- 
ed, the interrupt remains pending and will be ac- 
cepted as soon as interrupts are enabled, unless 
cleared in the meantime. —_ 

3. Pending interrupts are cleared by RS or by en- 
try into the interrupt service routine unless INT 
has remained low. 


Table I shows a summary of interrupt operation depen- 
ding upon the circumstances under which the interrupt oc- 
curred. It should be emphasized that, although it may not 
be immediately obvious, most of these cases are a direct con- 
sequence of standard interrupt processing. The two excep- 
tions to the general rules are those marked with an asterisk. 


As an example of standard interrupt processing, con- 
sider the situation in which interrupts are enabled, and an 
interrupt occurs during the execution of a DINT (disable in- 
terrupts) instruction. Even though interrupts are enabled, the 
general rules state that an interrupt will not be accepted 
until the execution of the current instruction is completed. 
In this case, since the completion of this instruction results 
in interrupts being disabled, the interrupt will not be accepted. 

Also, it should be noted that, even though some instruc- 
tions take more than one cycle to execute, interrupt accep- 
tance will always be delayed until the instruction has fully 
completed execution. 

The first of the two exceptions to the general rules is 
the situation in which interrupts are enabled and an inter- 
rupt occurs during an MPY (or an MPYK) instruction. In 
this case, acceptance of the interrupt is delayed one more 
instruction cycle, until after the instruction following the 
MPY(K) has been executed. The reason for this is that the 
contents of the P Register (the results of the multiply) are 
very difficult to recreate. Delaying acceptance of interrupts 
an additional cycle guarantees that the program may safely 
store the results of the operation before trapping into the 
interrupt service routine, which might altér the contents of 
the P Register. (Thus, the user would normally code an 
instruction which stores the P Register into the accumulator 
immediately following a multiply instruction if interrupts are 
present in the system). However, if the interrupt service 
routine does not require the use of the P Register, it is not 
necessary that its contents be stored. 

The second exception to the general rules occurs in the 
situation where interrupts are disabled (such as in an inter- 
rupt service routine), and an interrupt occurs during the ex- 
ecution of an EINT (enable interrupts) instruction. Here also, 
acceptance of the interrupt is delayed one additional instruc- 
tion cycle, until after the instruction following the EINT has 
been executed. The reason for this is that, if the as 


Table 1. Interrupt Operation in Software 


And Interrupts Are: 


if | : 
Disebled 


During DINT 


During EINT 
executed 


During MPY or MPYK 


The interrupt is ignored (but remains pending 
until cleared or an EINT is executed) 
The interrupt is accepted after the EINT is 


*The instruction following the MPY(K) is com- 
pleted and then the interrupt is accepted 


The interrupt is ignored (but remains pending 
until cleared or an EINT is executed) 

* The instruction following the EINT is com- 
pleted and then the interrupt is accepted 


(unless the instruction is a DINT) 
The interrupt is ignored (but remains pending 
until cleared or an EINT is executed) 


(unless the instruction is a DINT) 


During any other instruction 


"Exception to standard interrupt servicing 
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The interrupt is accepted after the instruction 
executes (even if the instruction is multicycle) 


The interrupt is ignored (but remains pending 
until cleared or an EINT is executed) 
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instruction normally following an EINT in an interrupt 


service routine is not allowed to execute before another in-- 


terrupt is accepted, a stack overflow can eventually result, 
destroying return linkage to the mainline program. 

In these two exceptions to the general rules, note that 
if the instruction following an EINT (or MPY or MPYK) 
instruction is a DINT instruction, the interrupt will not be 
accepted. 

The following sequence of instructions illustrates a 
combination of several of these concepts: 


DINT 

EINT 

MPY VALI 
MPY VAL2 
MPYK CONST 
DINT 


An interrupt occurring at any point during this sequence 
of instructions would never be accepted (at least during this 
sequence). It would, however, remain pending until cleared 
or accepted in a later section of code following an EINT. 
The reasons for this are as follows: if the interrupt occurred 
during the DINT, it would wait until the instruction had 
executed before being considered for acceptance. Since the 
completion of this instruction results in interrupts being 
disabled, it would not be accepted at this point. If the inter- 
rupt occurred during the EINT or was still pending at this 


time, interrupts would still be disabled, and the interrupt 
would be ignored. After the EINT had executed (because 
of the two exceptions to the general rules), all of the 
multiplies would be executed. Then finally, after the MPYK, 
the DINT would also be executed (since it follows a multip- 
ly instruction) and would disable interrupts again. 


RS INPUT 


RS serves as the master reset for the TMS32010. Asyn- 
chronous inputs may be used on RS since this input is pro- 
vided with an internal synchronizer circuit. However, if it 
is necessary to maintain synchronization of program execu- 
tion to the reset input, the 50-ns setup time between RS and 
CLKOUT must be observed. This may be required, for 
example, when multiple TMS32010s are used concurrently 
in a system. Figure 7 shows the timing of the RS input. 

It should be noted that reset should only be used to per- 
form system initialization-type functions (such as would be 
used after powerup). Reset cannot be used to perform hold 
or interrupt functions, since it does not preserve any linkage 
to the previous program environment. Also, the state of the 
internal logic (including data RAM, status and general- 
purpose registers, accumulator contents, etc.) 1s not predict- 
able after reset; once RS is driven low and then high, the 
results are simply that interrupts are disabled, the interrupt 
flag is cleared, and program execution is forced to location 
zero. 


Jo-e>}— 50 ns MIN 


—e] le 50 ns MIN 


¥s —\ , 


~-——— 1 us MIN 


NOTE: Timing measurements are referenced to and from a low voltage of 0.8 V and a high voltage of 2.0 V. 


Figure 7. RS Input Timing 


SUMMARY 


Adherence to the basic considerations described in this 
application report regarding these three inputs provides 
efficient and reliable system operation. Briefly restated, these 
considerations are as follows: 


1. The interrupt and BIO inputs must be synchronized. 
Interrupts are either level or edge triggered, and 
function according to standard interrupt processing, 
except for the special cases of the EINT, MPY, and 
MPYK instructions. 

3. On the RS input, no synchronization is required, 
except in cases where precise timing immediately 
following reset is necessary. 
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INTRODUCTION 


External ROM or RAM can be interfaced to the memory 
bus of the TMS32010 microprocessor in applications that 
require additional RAM or program memory. 

The purpose of this application report is to describe 
two basic low-cost methods for expanding the TMS32010’s 
memory configuration: 

1. Direct expansion, utilizing a standard memory 
cycle for memory access 

2. Extended memory interface, utilizing an address 
(latched using a standard memory cycle) that 
automatically increments or decrements after 
each access. 

Of the two methods, the first method is very efficient 
for program and small data memory expansion, whereas the 
second method is more useful for large data memory 
expansion. 

The design techniques presented here can easily be 
extended to encompass interface of other devices to the 
TMS32010. Each of the circuits discussed in this application 
report has been built and tested to verify its operation. 


RAM/ROM PROGRAM MEMORY 
EXPANSION 


For systems requiring additional program memory or 
smal] amounts of external data memory, the direct expansion 
circuit described in this section provides a straightforward 
approach. 

The TMS32010 program memory can be expanded 
beyond the 1.5K-word internal capacity in two ways: 

1. Implementing an additional 2.5K words 
externally (MC/\MP = 1), or 

2. Implementing the full 4K-word program memory 
space externally (MC/MP = 0). 

In either case, the memory is accessed in a single 
memory cycle and appears no different to the TMS32010 
than internal program memory. The circuit described in this 
section uses the full 4K-word program memory space 
implemented externally using MC/MP = 0. This 
configuration is useful in applications where, perhaps for cost 
reasons, using a masked ROM version of the TMS32010 is 
impractical. 


Design Considerations 

An important consideration in the design of the direct 
expansion circuit is the desire to minimize chip count in order 
to reduce cost. This is an important factor in digital signal 
processing (DSP) systems since their cost must compete with 
that of analog approaches. In this circuit, as little additional 
logic as possible is used without sacrificing performance. 
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The memories used in the circuit are chosen to provide 
minimum chip count using currently available devices. The 
circuit is configured with half of the address space 
implemented as RAM and the other half as PROM. The 
RAMs used are Advanced Micro Devices Am9128-70 
2K x 8 static NMOS RAMs, and the PROMs used are Texas 
Instruments TBP28S166 2K x 8 TTL PROMs. This memory 
configuration results ina minimum chip count and provides 
an even mix between RAM and PROM; however, other 
RAM/ROM mixes may be used. Note that if PROMs or 
ROMs are the only external memory required in the system, 
no additional logic is needed ‘since the interface to most of 
these devices allows direct connection to the TMS32010. 

Using RAM as program memory allows downloading 
into the address space from slower (possibly EPROM) 
memory or a host system if required, and also allows for 
communication between program and data memory spaces 
using the Table Read (TBLR) and Table Write (TBLW) 
instructions. If internal program ROM and external program 
RAM are required, the same external memory configuration 
may be used with MC/MP set to 1. Note that in this case, 
external RAM located at addresses coincident with those of 
the internal program ROM cannot be used without some 
modification of the address decoding scheme. 


Functional Description 

The direct memory expansion circuit, shown in Figure 
1, consists of the four memory chips, a set of address bus 
buffers, and logic that controls the address bus buffers and 
enables the memories’ three-state outputs. 

The 2K words of PROM are located at addresses 
0-7FF, preventing any conflict with I/O ports if present in 
the system, and RAM is located at addresses 800-FFF. 
PROM or RAM is selected on the basis of the most significant 
address line using a single inverter (U8:8,9). 


Circuit Operation 

Read operations are performed from the PROM/RAM 
memory space either during opcode or operand fetches or 
during TBLR instructions. Write operations occur to RAM 
only during TBLW instructions; write operations to PROM 
have no effect on the circuit. 

The TMS32010 initiates a read operation from either 
the RAM (U2,3) or the PROM (U4,5) by presenting an 
address and driving MEN low (see Figure 2). Since the 
control line to the address latches (buffered WE) is high, 
these latches (U6,7) are transparent, and the address is 
presented to the memories after a short propagation delay. 
After the memory-access time delay, data is available 
internally but is not driven out of the chip since the memories’ 
three-state outputs (controlled by RDEN) are not enabled. 
RDEN is generated so that the memories’ outputs are not 
enabled on a read operation until CLKOUT goes high. This 
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Figure 1. Direct Memory Expansion Circuit 
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ensures that with fast memories, if MEN occurs early, a data 
bus conflict will not occur when the read follows a write 
operation. MEN can be used directly to control the output 
drivers of the memories if it can be guaranteed that the 
memory will not drive the data bus immediately following 
a write operation or if buffers are used on the data bus. 
Buffers are not used in this design in order to minimize chip 
count. 

After CLKOUT goes high, the memory outputs are 
enabled. The TMS32010 processor’s 50-ns read data setup 
time to CLKOUT falling is met from CLKOUT rising by 
an 8-ns propagation delay through the 74AS00 NAND gate 


CLKOUT 


TMS32010 
ADDRESS 


i: arereee i. ae 
pant 
ON eset or ef 
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(U9) and the memories’ 40-ns maximum output turn-on delay 
(with a 100-ns high portion of CLKOUT). At the end of the 
cycle, the TMS32010 requires that read data be held at least 
until CLKOUT falls or MEN goes high, whichever occurs 
first. This is guaranteed since RDEN will not go high until 
at least one of these signals changes state. 

Write operations begin in a similar manner to read 
operations, with the exception that the WE signal is active 
instead of MEN (see Figure 3). Since the address buffers 
are controlled by WEB (buffered WE), which is high for 
the first half of the cycle, the buffers are transparent when 
the address becomes valid. Then, when WEB goes low, the 
address remains latched until after WE goes high. 


Figure 2. Memory Read Timing 
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Figure 3. Memory Write Timing 
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The address bus buffers provide the 5-ns address hold 
time required by the RAMs following WE going high. This 
hold time is provided by the propagation delay of WE 
resulting from the two 74ALS04s (U8, 1-4) and the buffer 
propagation delay. While it is not generally good design 
practice to rely on propagation delays for timing, in this case 
the technique can be used to eliminate the need for more 


cumbersome design approaches, such as delay lines, since - 


the minimum propagation delays for these ALS devices are 
specified. Note that the address bus buffers are not required 
if memories with O-ns write-address hold times (such as the 
INMOS 4K x 4 devices) are used. These devices are not used 
here because their organization does not suit the desired 
configuration of this design. 

The remaining memory bus timing requirements for 
write operations are also easily met by this circuit design. 
The RAMs require a 30-ns data setup time with respect to 
WE rising, of which the TMS32010 provides about 80 ns. 
The data hold time required with respect to WE going high 
is 5 ns, of which the TMS32010 provides a minimum of 
20 ns. 


EXTENDED MEMORY INTERFACE 


If large program memory expansion is required, bank 
switching techniques can be employed with the direct 
expansion scheme to allow greater program memory space, 
~ some of which can still be used for small segments of data. 
These segments of data, however, can only be accessed using 
TBLR/TBLW instructions. For this reason, the direct 
expansion scheme is quite inefficient when large amounts 
of external data storage are required. 

To implement large data memory expansion, the 
extended memory interface can be used. With this approach, 
memory can be accessed in two cycles once an address has 
been loaded, making this technique preferable to the direct 
memory expansion scheme for data storage. Note that the 
primary savings in cycles required to access the memory 
result from loading the address only once and having this 
address increment or decrement with each access. Thus, for 
the most efficient use of this memory, data should be stored 
sequentially to avoid having to reload an address for each 
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access. If data is not saved sequentially, four cycles are 
required for each access, making the direct expansion scheme 
the preferred approach. 

The extended memory interface is more efficient for 
data storage, but may be used to access instructions even 
though they cannot be executed directly. Instructions are 
accessed by using an IN instruction, followed by a TBLW 
instruction, thereby placing the instruction in program 
memory. Because the transfer requires a minimum of five 
cycles, this technique should only be used to store instructions 
that need to be accessed infrequently. This feature is useful, 
for example, for implementing downloads from slow 
memory or a host system. 

The extended memory expansion approach may also 
be used in conjunction with one of the direct memory 
expansion schemes to expand both program and data memory 
efficiently. 


Design Considerations 
A primary consideration of the extended memory 
expansion design is to implement an efficient interface to 
large amounts of data memory. The program interface to this 
memory uses the I/O ports. These ports are accessed in two 
cycles, whereas three cycles are required to access program 
memory via the TBLR/TBLW instructions. 
This interface is mapped into three port locations: 
1. Port 0, which receives the starting address for 
the memory access 
2. Port 1, which decrements the address following 
each access 
3. Port 2, which increments the address following 
each access. 


Functional Description 

The extended memory interface circuit, shown in 
Figure 4, contains the minimum amount of logic required 
to efficiently communicate with larger amounts of memory 
at relatively high speeds. Due to the nature of the interface, 
the devices used for the memory space are not required to 
be as fast as those used in the direct expansion circuit. The 
devices used are Synertek SY2128-1 4K x 4 NMOS static 
RAMs, chosen on the basis of their organization and 
availability. The RAM organization provides a 4K x 16 
memory space using only four chips. 
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Figure 4. Extended Memory Interface Circuit (page 1 of 2) 
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Figure 4. Extended Memory Interface Circuit (page 2 of 2) 


The address used to access these RAMs is derived from 
a 12-bit up/down counter, implemented using three 
74LS169A 4-bit counters (U6-8) cascaded together. An 
address is loaded into the counters, using an OUT instruction 
to port address 0. Then, with each access to port 1 or 2, this 
address is decremented or incremented, respectively. 

The logic controlling this interface consists of a 
74ALS138 decoder (U12), which decodes the three port 
addresses and some miscellaneous gating that generates 
strobe and enable signals (U9-11,13,14). 


Circuit Operation 

The memory in the extended memory interface circuit 

is accessed using three types of memory cycles: 
1. An address load cycle 
2. A read cycle to RAM 
3. A write cycle to RAM. 

Addresses are loaded by writing the desired address 
to port 0. In order to simplify the design, logic to allow 
reading of the address counters was not included. Therefore, 
port 0 should not be read or improper loading of the counters 
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will occur. Note that although reading port 0 may corrupt 
the counters’ contents, succeeding loads will function 
properly. 

After an address is loaded, each access to either port 
1 or 2 decrements or increments, respectively, the memory 
address after the completion of the cycle. Since the effective 
address for the next memory cycle becomes valid shortly after 
the end of the current cycle, the RAMs used can be quite 
slow. Their speed is limited only by the output enable time, 
which is generally significantly faster than the address access 
time. The memories used in this circuit must have an output 
enable time of no more than 42 ns, but their address access 
time can be as slow as 150 ns or more. For this reason, less 
expensive memories can be used. 

Figure 5 shows an address load operation. The address 
presented by the TMS32010 is decoded by the 74ALS138 
and some random logic, consisting of AND and OR gates 
and an inverter, to detect an access to port 0. This decode 
results in the LOAD and CNTEN signals going active (low). 
The LOAD signal indicates that this is a load operation, and 
CNTEN enables the counters. CLK, the clock signal to the 
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counters (in this case, derived from WE), goes high at the 
end of the cycle when both the LOAD and a CNTEN decodes 
are stable. The rising edge of CLK synchronously clocks the 
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Figure 5. Address Load Timing 
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address from the TMS32010 data lines into the counters. 
Shortly after the rising edge of CLK, the loaded address is 
available to the memories at the output of the counters. 
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In a read operation, as shown in Figure 6, the 
TMS32010 address is decoded to detect a port address in 
the same manner as in an address load operation. In this case, 
however, accessing port addresses 1 and 2 results in the 
CNTEN signal being active and the LOAD signal inactive. 
In addition, the U/D signal asserts the correct state of the 
up/down control input to the counters depending on whether 
the cycle is an incrementing or decrementing access. In an 
I/O read cycle from the TMS32010, the DEN signal is active, 
and the interface uses this signal both to enable the memories’ 


output buffers and to clock the counters at the end of the 
buffer access. Since inverted DEN is gated with CLKOUT 
to enable the memories’ output buffers, the buffers will not 
be enabled until CLKOUT goes high. As in the direct 
memory interface, this feature is included to avoid any bus 
conflicts that might occur between the TMS32010 and the 
memories following a write operation. Note that if the system 
reads other port addresses, DEN must be further gated to 
ensure that only the accessed port’s output buffers are 
enabled. 
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Figure 6. RAM Read Timing 
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Figure 7. RAM Write Timing 


The 35-ns output enable time of the SY2128-1 RAMs 
(U2-5), added to the delay due to the 74AS00 (U15), 
acceptably meets the 50-ns data setup time required by the 
TMS32010. At the end of the cycle, DEN going high causes 
CLK to go high which either increments or decrements the 
memory address contained in the counters depending on the 
state of U/D. Thus, the following access is made from the 
next sequential location. 

A write cycle, as shown in Figure 7, occurs in much 
the same manner as a read cycle. The TMS32010 address 
is decoded to activate CNTEN and produce the correct state 
of U/D. At the end of the cycle, when WE goes high, the 
CLK signal generated from WE strobes the data into the 
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memories and increments or decrements the address in the 
counters. 


SUMMARY 


Two basic low-cost methods for expanding the 
TMS32010’s memory configuration have been described in 
this application report. The direct memory expansion scheme 
provides program and small data memory expansion, and 
the extended memory interface provides large data memory 
expansion. The design techniques used in these interfaces 
may be extended to encompass interface of other devices to 
the TMS32010. 
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APPENDIX 


Pacific Microcircuits Ltd. in British Columbia, Canada, has introduced a peripheral chip to support the TMS32010. 
The preliminary specification is included in the appendix of this application report to facilitate minimum chip-count design 
in TMS32010-based systems. In addition to the electrical specification, a schematic for an audio-processor board and an 
application note for PD32HCO1 interrupt handling are provided. 

The PD32HC01 is a digital signal processor interface circuit intended for use in voice-band signal processing applications. 
This CMOS single chip offers an efficient interface between the TMS32010 and external RAM, ROM, and a serial codec 
(see Figure A-1). 

For further information on price, availability, and support, please refer to the list of Pacific Microcircuits Ltd. 
representatives on the last page of this appendix. 


/0 


TMS32010 


Figure A-1. Voice-Band Signal Processing Interface 


11. Interfacing External Memory to the TMS32010 327 


328 11. Interfacing External Memory to the TMS32010 


PD32HCO1 


CIFIC SIGNAL PROCESSOR INTERFACE 


MICROCIRCUITS LTD. for the T.1. TMS32010 


Preliminary - September 1985 


Features Description 

@ Single-chip solution to The PD32HC01 is a DIGITAL SIGNAL 
™S32010 interfacing PROCESSOR INTERFACE circuit, 

@ Serial Codec port intended for use in voice band signal 

e Serial Data comm. port processing circuits. It provides an 

@ 1/0 and Interrupt control optimized interface between the 

® Decoding for external RAM Texas Instruments TMS32010 
and ROM memory digital signal precessor and 

e@ 1/0 expansion interface external RAM, ROM, and Codec. 


@ 2400 Hz bit rate generator 
@ Low-power CMOS technology 


Applications Package Availability 
@ Voice coders/decoders @ 40 Lead DIL Ceramic (PD32HCO1C) 
@ Speech synthesis @ 40 Lead DIL Plastic (PD32HCO1E) 
@ Speech recognition @ 44Lead Surface Mounted Plastic 
@ Digital telephony (PD32HCO1P) 
@ Data communications 
@ Digital radio 


TYPICAL APPLICATION (FIG. 1) 
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FUNCTIONAL BLOCK DIAGRAM (FIG. 2) 
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Pin Name 
1-3 AO-A2 
4 INT 

5 BIO 

6 CLK 
7,8 WPS, RPS 
9-13 A7-All 
14 —— DRIVE 
15 SCAN 
16 RXD 
17. RXC 
18 TXD 
19 TXC 
20 Vs 
21. ~+BRATE 
22 + #CCLK 
23. +RCTD 
24 RODD 
25 ToD 
26-33 00-07 
34. — oS 
35  ROMEN 
36 RAMEN 
37 OE 
38 ODEN 
39 #@FEN 
4 Vo 


1/0 


Inputs 
Output 


Output 


PD32HCO1 Pin Description 
Description 


Address bus from processor. 

Interrupt request to processor. Responds to RXC, TXC, or Codec 
A/D interrupts. 

Polled output port bit to processor. Data source to be polled is 
specified in the Peripheral Status Register. 

4.128 MHz (nominal) clock, derived from processor clock. Drives 
the bit rate generator and Codec interface timing. 

Decoded {/0 port write and read pulses for |/0 expansion. 
Address bus from processor. 

Output bit controlled from the Peripheral Status Register. 


input bit selected from the Peripheral Status Register to appear on BIO. 


Serial data input. Must be stable on the rising edge of RXC. 
Selected from the Peripheral Status Register to appear on BIO. 

Serial data receive clock. Rising edge retimes RXD, and raises an 
RX clock interrupt. 

Serial data output. Programmed from the Peripheral Status 
Register. Edges of TXD are synchronized to the rising edge of TXC. 
Serial data transmit clock. Rising edge clocks out data onto TXD 
from the Peripheral Status Register, and raises a TX clock interrupt. 
Negative supply (ground). 


2400 Hz square wave (CLK / 1720-- mask programmable). 
2.064 MHz (nominal) Codec clock. 

Codec framing pulse for Codec synchronization. Codec A/D 
interrupt occurs 16 CLK cycles after RCTD goes high. 

Serial dats output to Codec. PCM data is shifted out on the rising 
edges of the first 6 CCLK cycles after the rising edge of RCTD. 

Serial data input from Codec. PCM data is sampled on the first & 
CCLK falling edges after the rising edge of RCTD. 

Dats bus to chip. 

Master reset to chip. A low on this input will reset the INT signal, 
and initalize the bit rate timer. This is a Schmitt trigger input. 

ROM enable output. This signal goes low during a valid read from 
memory locations >000 - >F7F (MEN low). 

RAM enable output. This signal goes low during a valid read or write 
to memory locations >FB0 - >FFF (MEN low or WE low). 

Write enable to chip. Goes low for 1/0 or RAM write operations. 
Data enable to chip. Goes low for |/0 read operations. 

Memory enable to chip. Goes low for ROM or RAM reads. 

Positive supply (+5 Volts). 
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Detailed Description 


The PD32HCO1 consists of 4 functional blocks: a 
memory and !/0 decoder; {/0, interrupt, and 
serial data port control; a serial Codec port; and a 
bit rate generater (see figures. 2 & 3). 


Memory and 1/0 Decoder 


The memory and !/0 decoder segments the 4K 
word address space of the TMS32010 into 3 
areas: a 3968 word ROM area inclusive of 
addresses >000 to >F7F; a 128 word RAM area 
inclusive of addresses >F80 to >FFF; and from 
addresses >XX0 to >XX7, an !/O expansion port, an 
1/0, interrupt, and serial data port control; and a 
serial Codec port. 


Memory Decoding 


The ROMEN signal is used for selecting external 
program ROM. It goes low during memory read or 
table read cycles (MEN low), and the processor 
address is less than >F80. 


The RAMEN signal is used for selecting external 
data RAM. It goes low during memory read, table 
read, or table write cycles (MEN or WE low), and 
the processor address is above >F7F. 


140 Expansion Port 


The RPS and WPS signals are used for 1/0 port 
expansion. RPS goes high during an !/0 read cycle 
from port 5 (DEN low). WPS goes high during an 
1/0 write cycle to port 5, or a table write cycle to 
address >XX5 (WE low). 


1/0, Interrupt, and Serial Data Port 


The Program Status Register (PSR) at port 
location 6 controls the DRIVE and TXD output 
signals; the INT output operation via the Codec 
A/D, TX clock, and RX clock interrupt mask bits; 
and selects inputs to be tested on BIO (interrupt 
flags; the SCAN input; or the retimed RXD input). 
The bit encoding of the PSR is shown below: 


bit 0: RXMSK , RX clock interrupt mask. 
bit 1: TXMSK , TX clock interrupt mask. 
bit 2: AOMSK , Codec interrupt mask. 


Writing 1's to these bits will mask interrupts from 
the respective sources, and/or clear posted 
interrupts. Writing O's will enable interrupts. By 
testing for the interrupting source on the BIO 
line, interrupt vectoring can be managed (see bits 
3,4,5 description). 


bits 3.4.5: BIO Source Select . These three 
bits select one of five input sources (interrupt 
flags or pin inputs) onto the BIO output: 


bit S bit 4 bit 3 Selected Source 


Codec A/D Int. status 
TX Clock Int. status 
RX Clock Int. status 
SCAN bit input 
Retimed RXD input 


- oo0o°0 
x——- OO 


x—-O— 2 


Whenever a posted interrupt is selected, BIO will 
go low. BIO will stay high if the selected 
interrupt is not posted. When the SCAN input or 
the retimed RXD input is selected, BIO follows the 
polarity of the respective signal. 


bit6: TXD , the serial data port transmit data 
bit. This signal is retimed by the rising edge of the 
TXC clock, and appears on the TXD output pin. 


bit 7: DRIVE , a general purpose output pin. 
Serial Codec Port 


The Serial Codec Port consists of 8-bit Transmit 
and Receive data registers, designed to directly 
interface to Motorola 14400 series PCM 
Monochips. 


The Transmit Register forms incoming serial data 
on TDD into 8-bit parallel PCM samples, while the 
Receive Register forms 8-bit parallel PCM data 
samples into serial data on ROD. The operation of 
these registers is controlled by the Codec Timing 
Generator, which also generates the CCLK, the 
RCTD, and the internal A/D interrupt signals. 


Data written to the Receive Register at !/0 port 
location 3 is inverted, and sent MSB first on the 
Pin. Data is shifted out on the 6 rising edges 
of CCLK following the rising edge of RCTD. To 
prevent Receive Register underflow, data must be 
available in the Receive Register within 248 CCLK 
cycles after an A/D interrupt (nominally 120 
usec). 


Serial PCM data on the TDD pin is inverted, and 
read into the Transmit Register, MSB first, at 1/0 
port location 3. Data is shifted in on the 8 falling 
edges of CCLK following the rising edge of RCTD . 
To prevent Transmit Register overflow, data must 
be read within 248 CCLK cycles after an A/D. 
interrupt (nominally 120 usec). 


Bit Rate Generator 


The BRATE output signal is nominally a 2400 Hz 
square wave, derived from the CLK input divided 
by 1720 (mask programmable). This signal may 
be used for bit rate generation, TXC or RXC 
clocking, or real-time interrupts. BRATE is 
reset whenever the RS signal ts low. 
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PD32HC41 DETAILED BLOCK DIAGRAM (FIG. 3) 


SERIAL DATA_IN 
+ Mt gee 
3 


= DRIVE 


TXD 


F8Q-FFF 
Q00-F 7F 


1/0 
DECODER 
CODEC RECE!TVE REGISTER 


(QUTPUT} 


CODEC TRANSMIT REGISTER 
CINPUT) 


2-064 MHZ (NOMINAL ) CCLK 


RCTD 


DIVIDE BY 1720 24a8 BPS (NOMINAL) 
(MASK PROGRAMMABLE ) 


(C) 1985 PACIFIC MICROCIRCUITS LTD- 


11. Interfacing External Memory to the TMS32010 333 


Address and Input/Output Map ! 


Address/ 
Port®  R/w2 Function 


>XXS Read? Codec Transmit Register. Valid for 248 CCLK cycles (nominally 120 usec) 

Port 3 after an A/D interrupt. This 8-bit register contains the inverted version of the 
digitized serial PCM signal appearing on TDD. This register can only be read 
by using a IN from port 3 instruction. 


>»XX3/ Write Codec Receive Register. Must be valid within 248 CCLK cycles (nominally 

Port 3 120 usec) after an A/D interrupt. Data written to this 6-bit register will be 
inverted and shifted out on the RDD pin. This register can be written using a 
OUT to port 3 instruction, or by using a TBLW to address >XX3 (address must 


be less than >F80). 
>XX5/ Read” Input Port Expansion. The RPS signal will pulse high whenever an IN from 
Port 5 port 5 instruction is executed. 
»XX5/ ss Write = Output Port Expansion. The WPS signal will pulse high whenever an OUT 
Port 5 to port 5, or TBLW to >XX5 instruction is executed (address must be less than 
>F80). 


»XX6/ = Write Program Status Register. This register is used to: Select 1/0 and 

Port 6 interrupt status bits onto the BIO pin; mask and reset interrupts; control the 
DRIVE pin; and to send data on TXD. This register can be written using an OUT 
to port 6 instruction, or by using a TBLW to address >XX6 (address must be 


less than >F80). 
>000- Read Program ROM space. Notice that the 1/0 space and the ROM space are mapped 
>77F to overlapping addresses, but are distinguished by the MEN and DEN signals. 
MEN will go low for valid instruction reads, while DEN will go low for 1/0 
reads. 


>780- R/W_Data/Program RAM space. For read cycles, MEN goes low; for write cycles 
>7FF WE goes low. 


Notes: 1. When using TBLW to perform Output, many aliases of the 1/0 port locations exist due to 
incomplete address decoding. To maintain compatibility with future products, it is 
recommended that addresses >000 to >007 be used. When using OUT instructions, the 
™S32010 always addresses >000 to >007, and the aliases are irrelevant. 


2. Some !/0 addresses are not used. To prevent data corruption, port locations 0, 1, 2, and 4 
should not be written with TBLW or OUT. 


3. The 1/0 read locations can only be accessed with a TMS32010 IN instruction. The actual 


address locations are shown, however, for applications using other than the TMS32010 
processor . 
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Recommended operating conditions @ Vcc = 0 Volts 


Parameter Symbol Min Iyn fox Units Notes 
Supply voltage Vpp 47% SO 52 #£V 

Input high voltage Vin 2 Vppt0.5 V 

Input low voltage Vin 0.5 08 V 

Output high current lon 3 mA 

Output low current lo 3 mA 

Operating Temperature Ta 0 70 Cc 


Electrical cherecteristics over recommended operating conditions 1 


Parameter Symbol fin Iyn? tex Units Test Condition 
High level output voltage Von 3.5 45 Vv lon = 1ma 

Low level output voltage Vo. 0.3 05 V Io = 2 ma 

RS hysteresis voltage Virys 200 mV 

Off-state leakage current Ioz oS 5 uA 

Input current lin oS 5 uA Ve5‘Vin‘Vpp 
Supply current 5 lpp 2 mA 

Input capacitance C, 10 pF @ 1 Mi; 

Output capacitance Co 10 pF all other pins 0 V 
CLK input frequency Fax 0 4.126 6.0 MHz 


Note: 1. See Fig. 4 for DUT test loads. 
2. Typical specifications are valid at T, = 25 °C, Vpp * 5.0 Volts. 


3. Ipp is a function of Vpn, clock frequency, and output loading. 


1888 OHMS 


25 VOLTS: Egy BUT ~~ 2.5 VOLTS ° 


= 


ALL OUTPUTS EXCEPT D@-07 D8-07 
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Timing Specifications over recommended operating conditions ' 


Name Description Min Typ2 Max — Units 
R (1) tape 1 ROMEN select time from addr. 28 nsec 
0 = (2) teopa Hy ROMEN deselect time from addr. 30 nsec 
M (3) trom 2 ROMEN select time from MEN 25 nsec 
(4) toon tio -—»«- ROMMEN deselect time from MEN 18 nsec 
(5) toapan 1 RAMEN select time from addr. 27 nsec 
(6) tama RAMEN deselect time from addr. 1 nsec 
R (7) tpamayo RAMEN select time from MEN 24 nsec 
A (8) tpamyo RAMEN deselect time from MEN 14 nsec 
M (9) toapay3 - RAMEN select time from addr. 27 nsec 
(10) taanyH3 RAMEN deselect time from addr. 15 nsec 
(11) toapeg.q RAMEN select time from WE 21 nsec 
(12) tpamyig RAMEN deselect time from WE 15 nsec 
(13) ton, Data read access time from addr. 33 nsec 
Rs (14) top, «Data read hold time from addr. 60 nsec 
E (15) topo Data read access time from DEN 29 nsec 
6 = (16) topig po -—=«-dDatta read hold time from DEN 56 nsec 
P17) tap ny RPS select time from address 30 nsec 
s (18) toory 4 RPS deselect time from address 32 nsec 
T 19) tap no RPS select time from DEN 26 nsec 
E (20) tania 2 RPS deselect time from DEN 26 nsec 
R (21) tacuw Address set-up time to WE 4 nsec 
(22) tata pw Address hold time from WE -5 nsec 
& (23) towsy Data write set-up time to WE -25 nsec 
(24) town p Data write hold time from WE 0 nsec 
' (25) two) WPS select time from address 31 nsec 
/ (26) tou 1 WPS deselect time from address 32 nsec 
0) (27) typi? WPS select time from WE 27 nsec 
(28) typi «WPS deselect time from WE 27 nsec 
S (29) tpypsgy «XD set-up time to RXC 20 3 nsec 
1 (30) taynsyp -—-«RXD hold time from RXC 222 nsec 
0 = (51) tryp TXD delay time from TXC 28 nsec 
Cc (32) tactp RCTD Delay time from CCLK 15 34 100 nsec 
0 = (33) trppsy TDD set-up time to CCLK 50 16 nsec 
D = (34) trpprp «ss TDD hold time from CCLK 20-14 nsec 
E (35) tenp RDD delay time from CCLK 19 350 nsec 
C (36) tant INT delay time from RCTD Note 3 


Note: 1. See Fig. 4 for DUT test loads. 
2. Typical specifications are valid at Ty = 25°C, Vpp =.5.0 Volts 


3. tapintimax) is 17% toy - 45 nsec (nominally 4.1 usec). 
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CIFIC es 


PD32HCO! Interrupt Handling 


To handle interrupts on the PD32HCO1, the techniques illustrated below may be used. Not all 
applications may require the full implementation. Details on TMS32010 interrupts may be obtained 
from the publication “TMS32010 User's Guide” .@ 1963, Texas instruments Ltd., Revision B, March 
1985. For Mu-law to linear and linear to Mu-law conversion routines, see “Companding Routines for 
the TMS32010 -- Digital Signal Processing Application Report ©, © 1964, Texas instruments Ltd. 


Program Stetus Register 
Set up an 8-bit Program Status Register (PSR) image in data RAM with the following definitions: 


Bit Mame Description 

b7 DRIVE General purpose output bit 

b6 TXD Serial data output 

bs \ 3 bit field for selecting BIO input. 

b4 -+-- 0O=A/D interrupt status, 1 = TX interrupt status, 

bs / 2 = RX interrupt status, 3 = SCAN input, 4 = RXD input. 

b2 \ interrupt mask/acknowledge bits (active high). Used to clear posted interrupts. 
bi &—-  »b2== A/D interrupt mask, b1 = TX interrupt mask, bO = RX interrupt mask. 


The status of the PD32HC01 is defined by writing the contents of the PSR to port 6 with an OUT or a 
TBLW instruction. In addition to defining the state of the DRIVE and TXD pins, the PSR is used to respond 
to and acknowledge interrupts from the RX clock, TX clock, and a Codec A/D conversion. The following 
is an example of TMS52010 software which implements interrupt vectoring with the A/D interrupt at 
highest priority, and TX and RX interrupts at lower priority. 


interrupt Handling Program 


" CONTEXT SAVE FOR INTERRUPT (INTRP stack and PSR on data page !) 


INT SST INTRP Save_machine_state: 
SACH INTRP+1,0 Save_Status; 
SACL  INTRP+2 Save_ACH; 
MPYK 1 Save_ACL; 
PAC Save_T; 


SACL INTRP+3 End. 


* VECTORED INTERRUPT ARBITRATION (BIO Mux always initially points to A/D Interrupt flag) 


. Vector: 
BIOZ ADDA IF A/D_interrupt THEN 
* AD_interrupt_service 
. END; 
LAC PSR,0 
ADD ONE,3 IF TX_interrupt THEN 
SACL PSR TX_interrupt_service 
OUT PSR,PA6 END; 
BIOZ TXDAT 


@ 1985 Pacific Microcircuits Ltd. 
1645 140th Street, White Rock, B.C., Canada V4A 4H1. (604) 536-1886 
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ONE,S 
PSR 
PSR,PA6 
RXDAT 


IF RX_Interrupt THEN 
RX_interrupt_service 
END; 


® INTERRUPT ERROR RECOVERY PROCEDURE (Should never need execution) 


IF No_interrupt THEN 
Initialize_PSR; 
Restore_machine_state; 
Return_from_interrupt; 

END; 

ENO. 


* INTERRUPT SERVICE ROUTINES (A/D, TX DATA, RX DATA) 


* ADDA: Clear A/D interrupt, process, restore machine state, and return. 


" TXDAT: Disable TX interrupt, process, restore machine state (and enable TX interrupt), and return. 
* RXDAT: Disable RX Interrupt, process, restore machine state (and enable RX interrupt), and return. 


ADDA LAC 
OR 
SACL 
OUT 
SUB 
SACL 
OUT 


TXDAT LACK 


PSR,PA6 


INTRP+ | 
INTRP+2 
INTRP+3 
INTRP 


02 
PSR 
PSR 
PSR,PA6 


AD_interrupt_service: 
Clear_AD_interrupt; { highest priority interrupts 


) 


{ should have fast clear to allow) 


{ reposting. 


(Body of interrupt service routine) 


Restore_machine_State; 
Return_from_interrupt; 
END. 


TX_interrupt_service: 
Disable_TX_interrupt; 
{ lower priority interrupts can 


} 


} 


{ be left disabled until interrupt } 


{ servicing is complete. 


(Body of interrupt service routine) 


Initialize_PSR; 

Restore_machine_state: 

Return_from_interrupt; 
END. 


® 1985 Pacific Microcircuits Ltd. 
1645 140th Street, White Rock, B.C., Canada V4A 4H!1. (604) 536-1886 
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} 


bd RX_interrupt_service: 


RXDAT LACK »>21 Disable_RX_interrupt_select_RXxD; 
OR PSR 
SACL PSR 
OUT PSR,PA6 
BIOZ XXX 


(Body of interrupt service routine) 


LACK >CO 

AND PSR Initialize_PSR; 

SACL PSR Restore_machine_state; 
OUT PSR,PA6 Return_from_Interrupt; 
ZALH = INTRP+1 END. 

ADDS = INTRP+2 

LT INTRP+3 

LST INTRP 

EINT 

RET 


A/D Interrupt Handling 
When responding to an A/D interrupt, the following TMS32010 code is appropriate: 


OUT SMPL,PA3 (output a sample) 
IN SMPL,PA3 (input a sample) 


(Remainder of Interrupt Service Routine) 


Note that if the software is written with a one sample “look-ahead”, almost 124 usec (one sample 
period) can elapse between the interrupt request and execution of the A/D interrupt service routine 
without loss of data. The above code should be executed early in the interrupt service routine to take 
maximum advantage of the hardware architecture. This feature is important when several foreground 
software routines are implemented, or when the A/D interrupt service routine execution time can 
occasionally exceed 124 usec. 


Note: The information contained in this document is for illustrative purposes only. 
No guarantee as to its suitability for end-use applications is implied. 


® 1985 Pacific Microcircuits Ltd. 
1645 140th Street, white Rock, B.C., Canada V4A 4H!1. (604) 556-1886 
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CIFIC 


MICROCIRCUITS LTD. 


Interface Chip Simplifies TMS532010 Based Voice-band Processing 
Vernon R. Little, Pacific Microcircuits Ltd, White Rock, B.C, Caneda 


The TMS32010 is well suited for voice band signal processing. Its powerful instruction set and simple 
architecture makes cost-effective digital signal processing a reality for modems, vocoders, smart 
telephones, and a host of other voice and data communications functions. 


Pacific Microcircuits Ltd. of White Rock, B.C., Canada has introduced a high-speed, low power CMOS 
interface chip, the PD32HCO!1, specifically tailored to interface the TMS32010 to an external Codec, snd 
RAM and ROM memory. With the addition of a few external components, a complete general purpose audio 
processor can be constructed without any logic ‘glue’. 


The PD32HCO1, besides having high-speed memory decoding and a serial Codec interface, has SCAN and 
DRIVE input and output pins, a serial date communications port for USRT emulation, a mask-programmabie 
2400 Hz bit-rate generator or real tifne clock, multiple source interrupt control, and decoding for expanding 
1/0 off chip. It is available in both 40 pin plastic and ceramic DIP and 44 pin surface-mount plastic 


packages. 
General Purpose Audio Processor 


The utility of the PD32HCO1 is illustrated in the schematic diagram entitled "GENERAL PURPOSE AUDIO 
PROCESSOR". This is a complete DSP function capable of implementing a fully functional LPC vocoder, voice 
band modem, intelligent telephone, or a telecom test set. 


In the schematic, the PD32HCO1 (U8) ties all of the system resources to the TMS32010 (U7). ROM (US, 
U6) is segmented as a 3968 word address space from address >000 to >F7F. An optional RAM (U1-U4) is 
segmented as s 128 word address space from >F8O to >FFF. This decoding can be arbitrarily modified to 
increase the RAM space at the expense of ROM (see detail on schematic diagram). This allows ‘soft’ 
programming by downloading program instructions into RAM by using a bootstrap loader in ROM. 
Downloading can occur either on the serial RS-232 interface, or, with additional hardware, the WPS and 
RPS signals on the PD32HCO! can be used to coordinate parallel data transfers with a host processor. 


The 8-bit serial Codec interface connects directly to a p-law or A-law PCM Codec (U12). and is accessible 
through parallel registers on the PD32HCO!. Analog circuitry (UI3 etc.), including pre-emphasis and 
de-emphasis filters, can be connected to the Codec as required. Also shown is a VU-meter circuit for level 
setting of incoming audio. 


Provision for a synchronous RS-232 port is provided. Using the on-chip interrupt control and 1/0 logic, the 
TMS32010 can emulate s USRT at 2400 bps. This is very useful for modem and vocoder applications, and it 
may also be used for program downloading into RAM. With minor modifications in hardware, and appropriate 
software, asynchronous RS-232 communications can also be emulated at speeds up to 1200 bps. EIA 
drivers and receivers (U10, U11) are required to interface the RS-232 signals to the PD32HC01. 


The SCAN and DRIVE pins are used for miscellaneous control functions. In the example given, SCAN is used 
to sense the off hook status of the handset, and DRIVE is buffered with a 74HC04 (U9), and used to activate 
a buzzer or lamp to indicate device operation or error states. 


For more information on the PD32HCO1 signal processor interface, and other members of the product 
family, please contact the Product Development group at Pacific Microcircuits Ltd. 


® 1985 Pacific Microcircuits Ltd. ; 
1645 140th Street, White Rock, B.C., Canada V4A 4H1. (604) 536-1886 
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INTRODUCTION 


The TMS32020 Digital Signal Processor has the power and 
flexibility to cost-effectively implement configurations that 
satisfy a wide range of system requirements. The large 
address space of the TMS32020 can be filled in those circuits 
that require external data or program memory. Peripheral 
devices can be interfaced to the TMS32020 to perform serial 
communication and analog signal acquisition. 

This application report suggests hardware design 
techniques for interfacing memory devices and peripherals 
to the TMS32020. Examples of PROM, EPROM, static 
RAM, and dynamic RAM circuits built around the 
TMS32020 are demonstrated, with consideration given to 
the timing requirements of the processor and external 
devices. A memory-mapped UART (Universal 
Asynchronous Receiver-Transmitter) interface for 
communication with a host computer is presented, as well 
as an interface to a combo-codec (coder-decoder + filter) 
device for analog signal acquisition. 

All circuits shown in this application report have been 
built and their operation verified at room temperature. Since 
the logic devices in these circuits have not been optimized 
as the most cost-effective, the designer may desire to make 
tradeoffs with respect to speed, cost, performance, and 
temperature. 


TMS32020 CONSIDERATIONS 


The TMS32020 has program, data, and I/O address 
spaces for interfacing with external memory and peripherals. 
-Memory and I/O devices are usually selected by using the 
TMS32020 PS, DS, or IS signals (program, data, or I/O 
select, respectively), combined with the STRB (strobe) 
signal. The signal (PS, DS, or 1S) used depends on the 
memory or I/O space to be addressed. Some of the read- 
only devices may be selected (enabled) using any of the PS, 
DS, or IS signals exclusively, whereas the read-write devices 
commonly use a PS, DS, or IS combined with the STRB for 
an enable signal. The STRB, combined with a select line and 
the TMS32020 R/W (read/write) signal, forms a convenient 
timing reference to control the write operation of the 
read/write devices. The read-only devices typically do not 
require the STRB and R/W combination. 

Memory and I/O devices must respond with the data 
within a maximum of 90 ns when they are selected with either 
PS, DS, or IS during the read cycle. (Refer to the TMS32020 
(20-MHz operation) timing diagram in the TMS32020 Data 
Sheet.) Consequently, the memory or I/O devices used 
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should have correspondingly fast access time. For slower 
devices, one or more TMS32020 wait states must be inserted 
for proper operation. 

When the PS, DS, or IS is combined with the STRB 
signal during the read cycle, the memory or I/O devices must 
respond within 50 ns of the STRB signal going low, due to 
the data-read setup time required by the TMS32020. During 
the write cycle, the TMS32020 provides the minimum of a 
55-ns data-write setup time to the memory or I/O devices 
(see the timing diagrams in the TMS32020 Data Sheet). Wait 
states also apply to the configuration using the STRB signal. 

Tradeoffs with respect to using faster memory and 
slower decode logic devices, and vice versa, may be made 
when designing a memory system around the TMS32020. 


MEMORIES 


Examples of four memory types, PROMs, EPROMs, 
static RAMs, and dynamic RAMs, are shown in interface 
to the TMS32020. The selection of which memory device 
to use in a particular application is determined by speed, cost, 
and functional requirements. If speed and maximum 
throughput are desired, the TMS32020 can run with zero 
wait states and perform memory accesses in a single machine 
cycle. The TMS32020 can access slower memories by 
inserting One or more wait states into the memory access 
operation by using the READY input signal. A circuit using 
each of the memory devices is described and illustrated in 
the following subsections of this application report. 


PROM 

When only fixed program memory is required and 
speed is a consideration, a PROM device may be chosen for 
memory interface. A Texas Instruments TBP28S166 PROM 
(2048 X 8) with a three-state output has been selected as an 
example interface to the TMS32020. The TBP28S166 has 
the maximum access time from address of 75 ns, which meets 
the TMS32020 timing requirements. A basic configuration 
showing this PROM interfaced to the TMS32020 as program 
memory is shown in Figure 1. 

Another configuration that shows the TBP28S166 
interface to the TMS32020 is shown in Figure 2. Here, more 
memory exists in the system, and the TBP28S166 is decoded 
and mapped into the program memory address space, starting 
at >4000. 

The timing diagram for the interface of the TBP28S 166 
to the TMS32020 is shown in Figure 3. No wait states are 
necessary in both the basic and decoded configurations. 
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Figure 1. Basic Interface of TBP28S166 to TMS32020 
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Figure 2. Decoded Interface of TBP28S166 to TMS32020 
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Figure 3. Interface Timing of TBP28S166 to TMS32020 
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EPROM and Wait-State Generator 

During the prototyping and development design stage, 
an EPROM may be selected as the memory device for 
interfacing to the TMS32020. The Texas Instruments 
TMS2764-35 EPROM (8192 X 8) has the access time from 
address of 350 ns. This does not directly meet the TMS32020 
maximum allowable read-data access time of 90 ns, so a 
series of wait states is used to delay the TMS32020. A wait- 
state generator circuit for providing up to two 200-ns wait 
states is shown in Figure 4. 

Point A in Figure 4 corresponds to one wait-state input 
and point B to two wait-state inputs. In the case of the 
TMS2764-35 EPROM with a 350-ns access time, two 200-ns 
wait states are needed for proper interfacing. Figure 5 shows 


1 kQ 


+5 V 


% 74ALS20 
1 


uv 
bel @ 
m 


% 74ALS114A 


FROM 7MS32020 
CLKOUT2 


FROM TMS32020 RS 


% 74ALS20 
9 


the TMS2764 decoded into the program memory address 
space 0000-1FFF. Due to the long EPROM turn-off time, 
the 74ALS244 buffers at the EPROM output prevent data 
bus conflict. 

Figure 6 shows the timing considerations for the 
interface of the TMS2764 to the TMS32020. 

If faster EPROMs are desired, TMS2764-25 (250-ns 
access time from address) EPROMs can be interfaced to the 
TMS32020. If the circuit shown in Figure 5 is used, the only 
change required is to connect the MEMSEL (memory select) 
signal to input A of the wait-state generator (see Figure 4). 
The TMS2764-25 operates with one wait state in this 
configuration. 


TO TMS32020 
READY 


tConnections to other devices in the system that require two wait states. (Inputs not used by other devices should be pulled up.) 
tConnections to other devices in the system that require one wait state. (Inputs not used by other devices should be pulled up.) 
§Connections to other devices in the system that require zero wait states. (Inputs not used by other devices should be pulled up.) 
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Figure 4. Wait-State Generator and Timing 
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Figure 5. Decoded Interface of TMS2764 to TMS32020 
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Figure 6. Interface Timing of TMS2764 to TMS32020 


Static RAM 

Static RAM as program memory is useful for program 
storage when a remote download capability exists. Static 
RAM as data memory is used when minimum design 
overhead and minimum chip count are desired. 

Figure 7 shows an example of INMOS IMS1421-50 
static RAMs (4096 X 4), interfaced to the TMS32020. These 
static RAMs are mapped into the program memory address 
space, starting at >6000. 

The read/write timing considerations for the interface 
of the IMS1421 to the TMS32020 are shown in Figure 8. 

During the memory-read cycle, the IMS1421-50 
responds with data within 40 ns from being selected. This 
is in the specified range of the TMS32020, and no wait states 
are required. 

During the memory-write cycle, the falling and rising 
edges of the MEMSEL signal form the beginning and end 
of the IMS1421-50 write cycle while the IMS1421 W (write) 
line is asserted early (early write). 

During the early-write cycle, the IMS1421 data-bus 
drivers never turn on. This helps prevent bus conflicts with 
the TMS32020. The data supplied-to the bus by the 
TMS32020 meets the timing parameters of the IMS1421-50, 
and no wait states are required. 

The use of slower memories (e.g., IMS1420-45) is 
possible, provided the designer places external buffers on 
the data bus between the memory and the TMS32020. Since 
the internal IMS1420 output buffers do not turn off until the 
W line goes low, the external buffers (with the appropriate 
enable logic) prevent data bus conflicts during the write 
operation. 

If CMOS RAMs are required, the IMS1423-35 devices 
can be used in the circuit shown in Figure 7. These devices 
operate faster, and their pinouts are identical to that of the 
IMS 1421-50. Slower CMOS RAMs (e.g., IMS1423-45) can 
also be used; however, the data bus buffers must be added 
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as outlined previously to prevent data bus buffers must be 
added as outlined previously to prevent data bus conflicts. 
The design of the IMS1421 (NMOS) or IMS1423 
(CMOS) interface as data memory is very similar to the one 
presented in Figure 7, and the data and program memory 
timing are identical. The only change needed is the use of 
the DS select line (instead of PS) in the decode logic. 
Static memories that do not meet the TMS32020 no- 
wait-state timing requirements can be interfaced, provided 
the necessary number of wait states is generated and used 
to drive the TMS32020 READY line. 


Dynamic RAM 

In systems where large inexpensive memory space is 
required, the use of dynamic RAM devices with the 
associated control circuitry may be justified. This section 
describes an interface between the Texas Instruments 
TMS4416-15 (16K X 4 dynamic RAM) and the TMS32020. 
The circuit shown in Figure 9 uses the TMS4500A Dynamic 
RAM Controller to supply the control functions for the 
dynamic RAM devices. The TMS4500A provides address 
generation, timing, access/refresh arbitration, and other 
functions to control a bank of TMS4416 dynamic RAMs. 

The timing diagram for the memory access and refresh 
cycles in the interface of the TMS4416 to the TMS32020 
is shown in Figure 10. 

During the regular access cycle, the TMS4500A RAS 
(row address strobe) and CAS (column address strobe) 
signals both become active, and one automatic TMS32020 
wait state is generated using the MSC (microstate complete) 
signal (see Figure 9). The generation of at least one 
TMS32020 wait state is mecessary because of the 
TMS4416-15 access time of 150 ns. The regular (i.e., 
uninterrupted by refresh) memory access takes two 
TMS32020 cycles (400 ns at 20-MHz operation). 
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Figure 7. Interface of IMS1421 to TMS32020 
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Figure 8. Interface Timing of IMS1421 to TMS32020 
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Figure 10. Interface Timing of <TMS4416 to TMS32020 


During the refresh cycle, which may be observed in 
Figure 10 at the point where RAS becomes low and CAS 
stays high, the TMS4500A performs memory access/refresh 
arbitration. The arbitration must be performed when the 
TMS32020 tries to access the dynamic RAM memory at the 
time the TMS4500A begins the refresh operation, as shown 
in Figure 10. In this case, the TMS32020 must be held for 
one or two additional wait states, until the refresh operation 
is completed. The READY signal to the TMS32020 is 
generated through a combination of three other signals. The 
MSC signal starts driving the TMS32020 READY line, the 
TMS4S00A RDY (ready) signal continues to drive it, and 
the output of the 74AS74 flip-flop (Q) extends it to the end 
of the required period, as shown in Figures 9 and 10. Such 
a circuit is required since the TMS4500A RDY signal alone 
is not of the proper duration for the TMS32020. After the 
memory refresh is performed, a regular access cycle takes 
place as previously described. 
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In an access/refresh arbitration case, it takes the 
TMS32020 three or four clock cycles to access the dynamic 
RAM, depending on the time relationship of the access and 
refresh requests. The worst-case RAM memory access timing 
is 800 ns at 20-MHz operation. This occurs when the requests 
for access and refresh take place nearly simultaneously. The 
access/refresh arbitration timing, shown in Figure 10, takes 
three TMS32020 clock cycles, i.e., 600 ns at 20-MHz 
operation (not the worst-case condition), since the request 
for access happens right after the end of the refresh cycle. 

If the TMS32020 continuously accesses the dynamic 
memory, it will be interrupted by a refresh cycle once out 
of ten thousand memory access cycles, i.e., 99.99 percent 


of the memory accesses will happen without interruption. 


This low interruption rate is based on the relationship between 
the TMS32020 dynamic-memory cycle length (400 ns) and 
the required TMS4416 refresh period (4 ms). 
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The transition in the TMS32020 READY line, at the 
end of the READY signal, is caused by the corresponding 
behavior of the MSC signal during the TMS32020 operation. 
The transition does not affect the circuit operation since the 
READY line is sampled on the rising edge of CLKOUT1, 
i.e., before the transition takes place. 

The interface logic between the TMS32020, 
TMS4500A, and TMS4416 devices, i.e., the gates and a flip- 
flop (see Figure 9), can be replaced with one Programmable 
Array Logic (PAL)? integrated circuit for reduced IC count. 
One such PAL that is well suited to this application is 
SN74PALI6R4A (see Appendix). 


PERIPHERALS 


Peripheral devices, such as the UART and combo- 
codec, can be interfaced to the TMS32020 to perform serial 
communication and analog. signal acquisition. 
Communication to a host computer can be provided by a 
memory- or I/O-mapped UART interface. A codec provides 
analog signal conversion for telecommunications and speech 
processing. 


UART 

The UART and TMS32020 configuration may be 
useful when a general-purpose TMS32020 program 
development system or upload/download capability from/to 
the TMS32020-based signal processing system is required. 

As an interface example of an asynchronous 
communication controller to the TMS32020, a General 
Instruments AY-3-1015D UART is memory-mapped into the 
TMS32020 data memory space. Figure 11 shows the circuit 
for interfacing the AY-3-1015D UART to the TMS32020. 
The transmitter is data memory-mapped into location C000 
and the receiver into location E000. 

The receiver/transmitter timing diagrams for the 
interface of the AY-3-1015D to the TMS32020 are shown 
in Figure 12. 

The three-state buffer of the UART receiver has 
received-data-enable (RDE) timing constraints that preclude 
its use with the TMS32020 (see the AY-3-1015D 
specifications sheet). The 74LS244 buffer placed on the 
UART’s receiver port is fast enough to perform the switching 
function on the TMS32020 data bus. 

The one-processor wait state, present during the 
transmitter operation, allows for positive data strobing into 
the UART’s transmitter data register as required by the 
AY-3-1015D. 

The UART’s receiver and transmitter status lines, DAV 
(data available) and TBMT (transmitter buffer empty), are 


TtPAL is a trademark of Monolithic Memories, Inc. 
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synchronized with the rest of the system and connected to 
either the interrupt or BIO (branch on I/O) pin of the 
TMS32020. This allows either interrupt or polled I/O 
techniques to be implemented and assures the synchronization 
of the UART device with the TMS32020 regarding the 
receiving and transmitting rates. For example, the 
TMS32020 does not try to read the UART’s receiver buffer 
at a rate greater than the rate at which the characters are 
coming in or write to the UART’s transmitter buffer at a 
rate greater than the maximum character transmission rate. 

The UART can equivalently be mapped into the I/O 
space of the TMS32020, since the memory and I/O cycles 
of the TMS32020 are identical. The only change required 
is the use of the IS (instead of the DS) line and of address 
lines AO-A3 to select an I/O port. 


Combo-Codec 

In some areas of telecommunications, speech 
processing, and other applications that require low-cost 
analog I/O devices, a combo-codec device may be useful. 
A codec consists of nonlinear A/D and D/A converters with 
all the associated filters and data-holding registers. 

The TMS32020 contains a serial port for 
communicating to serial devices, such as codecs. The speed 
and versatility of the TMS32020 allow it to compand 
(COMpress and exPAND) a PCM (Pulse Code Modulation) 
data stream, acquired by the codec, through the TMS32020 
execution of software conversion routines (see the application 
report, ‘Companding Routines for the TMS32010’). Figure 
13 shows an interface example of a Texas Instruments 
TCM2913 codec to the TMS32020 serial port. 

Figure 14 shows the TMS32020 serial port receive and 
transmit timing considerations in the interface of the 
TCM2913 to the TMS32020. 

In this configuration, the TCM2913 codec functions 
in the fixed data-rate mode (2.048 MHz) with p-law operation 
selected. All timing and synchronization signals are 
externally generated using independent oscillator and 
frequency-dividing hardware (an 8-bit counter such as a 
74AS867 may be used in place of two 74LS161 4-bit counters 
to minimize the chip count). Alternatively, the designer may 
decide to generate the timing signals from the TMS32020 
clock by subdividing its frequency. 

In some circuits, it may be necessary to wire an op- 
amp to the analog output of the codec. In such cases or if 
variable output gain is required, a gain-setting resistor 
network must be provided as specified in the TCM2913 
documentation. 

Other linear A/D and D/A converters may be interfaced 
to the TMS32020 through its parallel ports. 
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Figure 11. Interface of AY-3-1015D to TMS32020 
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Figure 12. Interface Timing of AY-3-1015D to TMS32020 
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Figure 13. Interface of TCM2913 to TMS32020 
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Figure 14. Interface Timing of TCM2913 to TMS32020 


SUMMARY 


The speed, performance, and flexibility of the 
TMS32020 allow it to cost-effectively implement 
configurations that satisfy a wide range of system 
requirements. This application report has described and 
demonstrated hardware design techniques for interfacing 
memory devices and peripherals to the TMS32020. 

Examples of PROM, EPROM, static RAM, and 
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dynamic RAM circuits built around the TMS32020 have been 
provided, with consideration given to the timing requirements 
of the processor and external devices. Interface examples of 
UART and codec chips to the TMS32020 are also presented. 

Table 1 summarizes the interface requirements of 
various memory devices to the TMS32020, as described in 
this report. The table also includes maximum affordable 
access times for each circuit configuration. 


12. Hardware Interfacing to the TMS32020 


Table 1. Memory Type and TMS32020 Interface Requirements 


MEMORY TYPE PART NUMBER 


ACCESS TIME 


TBP28S166 75 ns max 


NUMBER OF WAIT 
STATES REQUIRED 
BY THE TMS32020 


ACCESS TIME 
FOR CIRCUIT 
CONFIGURATION 


85 ns max 


(from address) 


EPROM ™MS2764-35 350 ns max 
(from address) 

Static RAM IMS1421-40 30 ns max 
(from select) 

Static RAM IMS1420-55 55 ns max 


(from select) 


Static RAM 
(CMOS) 


IMS1423-35 35 ns max 


(from select) 


Static RAM 
(CMOS) 


IMS1423-45 


45 ns max 
(from select) 


Dynamic RAM 


T™TMS4416-15 80 ns max 


(from CAS) 
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(from address) 


2 371 ns max 
(from address) 


0) 39.2 ns max 
(from select) 

@) 62 ns max 
(buffers reqd) (from select) 
0 39.2 ns max 


(from select) 


6) 
(buffers reqd) 


62 ns max 


(from select) 


1 170 ns max 


(from CAS) 
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APPENDIX 
PAL! DESIGN 


PAL Assembler Code 


PALI4SR4 FAL LES LGN OUCUIMENT 
QOQ009O-O000 REV # 2/02/85 BY LIANA 
SHIVA DRAM CONTROLLER 


CLE MSc DS RDY CLEGUTI STROBE RW ONL NCZ Nr 
GND /CLEL /READY /NIS /NC4 /O /NCS 7G 7W YC 
IF (VCC) 
REALY 


/MSC#/OS + 
/ROYS#/OS + 


tJ 


TF (VEC) 


CLE = CLEOUTI 

IF (VU) 

W = /STROBE#/0S#/RW 
IF (Vt) 

G = /DS#RW 

Tr (VEC) 

ts = /REY 


FUNCTION TABLE 


CLE MSC OS ROY CLEQUT1 STROBE RW 
GNU /CLE1 /READY /W /G /In 
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Sea ee ee SS i oie en + 
, a ; by ; 
on ae A H RM RM 1 
> LE ORR Rt Ok ME i 
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5; Ou M FAAS i: YPPALE C ' 
;>UTREATODE : NAATILULY YW: 
> TZ2SWMOOYY Ti OLLTECCE FAITH VECTOR 


1m etmee OD eImTD CONE MD COTS emmmD GONE SORES ERREG CHMED CEES OOETY Sim SYOED MINED OONTD COMED SHOOD GENED GLUED SOCCD eAIED HERS GOOD ettne Coree Senne GATED GEDS EEL eripte GIES Menee GENET sone SeLeY GOERS GOUED SHEED CODES cimue Gotu sunee Hitee sntnt atte mine” cena cones 
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PAL Fuse Map 


366 


DESCRIPTION: 
SHIVA DRAM CONTFRULLER 
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NUMBEK OF FUSES BLOWN = 340 
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PAL Fuse Map Summary 
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13. TMS32020 and MC68000 Interface 
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INTRODUCTION 


Certain functions in a computer system may be too time 
consuming for a single processor to perform. A high-speed 
numeric processor, such as the TMS32020 Digital Signal 
Processor, may serve as a coprocessor with a slower yet 
capable host in a computer system. For example, many 
graphics algorithms must be implemented on a numeric 
coprocessor to the host so that the host can perform system 
functions while the coprocessor computes the numeric- 
intensive algorithms. The TMS32020 is capable of 
performing numeric functions, such as a multiply- 
accumulate, in a single cycle (200 ns). Other 16-bit 
processors, such as the Motorola MC68000, cannot approach 
the computational speed of the TMS32020, but have other 
qualities such as ‘supervisor mode’ and ‘user mode’ which 
make them useful as host processors. 

This application report shows how the MC68000-10 
can be used as a host processor with the TMS32020 serving 
aS a numeric coprocessor to implement the numeric-intensive 
algorithms often required in computer systems. Applications 
for such a system include graphic workstations, speech 
processing, spectrum analysis, and other computational- 
intensive applications. 

The schematic in the appendix has been fully built and 
tested and has proven functional. 


SYSTEM CONFIGURATION 


In Figure 1, the basic block diagram for the interface 
of the MC68000 with the TMS32020 is shown. The 
MC68000 is interfaced to its own separate program memory 
(EPROM) and data memory. Although the TMS32020 is 
interfaced to its own external program memory (PROM), 
it shares its external data memory with the MC68000. The 
TMS32020 typically has access to this shared data memory; 
however, the MC68000 can access this memory by asserting 
the HOLD line on the TMS32020. In this event, the 
TMS32020 places all its buses in a high-impedence state and 


turns on the buffers between the MC68000 and the shared 
data memory. This configuration allows the MC68000 to give 
the TMS32020 instructions and data, and then release the 
TMS32020 to perform various functions. 


HARDWARE CONSIDERATIONS 


Acknowledging Hold 

After the MC68000 has written to the latch that puts 
the TMS32020 into the hold mode, the TMS32020 must 
communicate to the MC68000 that it is ready for the 
MC68000 to communicate with the shared data memory. The 
three methods of acknowledging hold to the MC68000 are 
as follows: 
1. The MC68000 waits until it knows the TMS32020 is 

held. 
2. The HOLD Acknowledge (HOLDA) signal causes an 
interrupt to the MC68000. 

3. The HOLDA signal writes to a memory-mapped latch. 

The first method is implemented in the schematic in 
the appendix. This method assumes that the MC68000 will 
allow enough time for the buffers to be turned on before 
trying to access this memory. For example, the MC68000 
could execute several NOP (No OPeration) instructions 
before attempting to access the shared data memory. 
Sometimes this method may not be sufficient. For example, 
if the TMS32020 is in the repeat mode, it does not recognize 
the HOLD assertion until it has finished the repeat 
instruction. This could cause a long unpredictable delay 
before the TMS32020 acknowledges the HOLD interrupt. 

The second method of communicating to the MC68000 
that the TMS32020 is in the hold mode is to interrupt the 
MC68000. To implement this, the HOLDA signal can be 
tied to one of the MC68000 interrupts, thus allowing the 
MC68000 to access the shared memory as fast as possible. 
Some method of communicating to the MC68000 as to which 
device caused the interrupt needs to be considered, since the 
MC68000 searches all external devices for the originator of 
the interrupt. 


SHARED 


DATA 


Figure 1. System Block Diagram of the TMS32020 and MC68000 Interface 
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The third method of communicating to the MC68000 
when it can access the shared memory is to allow the 
HOLDA to be read through a memory-mapped latch. Then, 
the MC68000 can poll this memory location until it 
recognizes a change, thus signifying that the TMS32020 has 
indeed been placed in the hold mode. 


Communicating with Shared Memory 

Once the TMS32020 has acknowledged the HOLD 
assertion, the three-state buffers (74LS241) are turned on 
to allow the MC68000 address bus and R/W line to become 
valid to the shared memory (IMS1421-40). These buffers 
are physically enabled by HOLDA, thus assuring that the 
TMS32020 has three-stated its memory bus. Once the address 
becomes valid, the transceivers (74LS245) are enabled so 
that the MC68000 data bus can access the shared memory. 
These buffers are enabled by the output of the decoder 
(74ALS138). By doing this, the MC68000 data bus accesses 
the shared-memory data bus only when MC68000 is trying 
to access the shared memory. This prevents data bus conflicts 
when the MC68000 accesses other memory while the 
TMS32020 is being held. After the communication path is 
enabled, the MC68000 can read and write to the shared 
memory. Figure 2 shows the timing when the MC68000 
writes to the shared memory. The Enable/Select (E/S) on 
the shared memory is enabled when the address and the 
Address Strobe (AS) on the MC68000 become valid. The 
rising edge of the AS causes E/S to rise, thus writing to the 
shared memory. 


MmC68000 
CLK 
A1-A23 ADDRESS VALID 


RW \ / 
DO-D15 DATA VALID 


Figure 2. MC68000 Write Cycle to Shared Memory 
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When the TMS32020 is not in the HOLD mode, it 
can communicate directly with the shared data memory. The 
three-state buffers and the transceivers between the MC68000 
and the shared data memory are turned off, and the link 
between the TMS32020 and the shared memory is direct. 
Figure 3 shows the timing when the TMS32020 writes to 
the shared memory. The E/S on the IMS1421-40 is enabled 
by Data Strobe (DS) and Strobe (STRB) becoming valid on 
the TMS32020. The rising edge of STRB causes E/S to rise, 
thus writing to the shared memory. 


TMS32020 
CLKOUT1 
a 5, ADDRESS VALID 


DO-D15 {ora van) VALID 


Figure 3. TMS32020 Write Cycle to Shared Memory 


LDS and UDS Considerations 

The schematic in the appendix is the expansion of the 
block diagram shown in Figure 1. In this schematic, the 
Upper Data Strobe (UDS) and Lower Data Strobe (LDS) 
signals on the MC68000 are not included, i.e., not connected. 
This method is sufficient if “word’-specified instructions are 
the only ones used on the MC68000. Many systems work 
more efficiently if other length specifications for some of 
the MC68000 instructions are used. Therefore, a decode 
scheme, such as in Figure 4, may be implemented. In this 
scheme, the MC68000 can read or write bytes or words to 
the Synertek RAMs (SY2128). For example, the MC68000 
may write to data bits DO-D7 and not affect the upper data 
bits by asserting LDS low and leaving UDS at a logic one. 
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FROM DECODER 
LDS 
DATA BUS 


SY2128 


SY2128 


Figure 4. LDS and UDS Scheme for Memory Access 


This is an automatic function of the MC68000 if ‘byte’ 
lengths are specified on certain instructions. 


SUMMARY 


The TMS32020 Digital Signal Processor is capable 
of performing numeric-intensive algorithms faster than other 
numeric coprocessors used in the past. In addition, the 
TMS32020 offers a minimal-chip, cost-effective solution to 
applications requiring a high-performance coprocessor. 
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This report shows how the TMS32020 can work with 
the MC68000 to serve as a numeric coprocessor. The 
interface shown in this report is a generic one and can be 
used with different host processors. A block diagram of the 
system configuration is included, as well as hardware 
considerations. The appendix contains a fully tested 
schematic of the design presented in this report. 
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APPENDIX 
Schematic of TMS32020 and MC68000 Interface 
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Figure A-1. Schematic of TMS32020 and MC68000 Interface (Sheet 1 of 3) 
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Figure A-1. Schematic of TMS32020 and MC68000 Interface (Sheet 2 of 3) 
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Figure A-1. Schematic of TMS32020 and MC68000 Interface (Sheet 3 of 3) 
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In the last few years, many DSP applications have been created using the TMS320 
processors. Although it is beyond the scope of this book to cover all of these applications 
in Part III, some typical examples have been selected in two areas: telecommunications 
and computer/peripherals. 


In the telecommunications area, applications are made possible by taking advantage 
of the TMS320 processors’ high-speed computational power, large on-chip memories, 
advanced architecture, and comprehensive instruction sets. Because of these and many 
other features, a high-performance, effective, single-chip solution can be created to solve 
telecommunications problems. Some applications in the area are echo cancellers, 
transcoders, DIMF encoders/decoders, and adaptive repeaters. 


The first application report in the telecommunications area describes some of the 
TMS32010 interface circuits frequently used in telecommunications applications. Some 
of these circuits are interfaces to a codec, a host processor, and external memory devices. 
A circuit is also included to perform PCM companding functions for time-critical 
telecommunications applications. The report concludes by suggesting system 
configurations for some telecommunications applications using these interface circuits. 
Some of these applications are ADPCM transcoding, hands-free telephone 
(speakerphone), three-way conferencing, 2/4 wire transformer connection, and DTMF 
detection. The next report covers both the theory and implementation of a single-chip 
TMS32020 digital voice echo canceller. The single-chip system can perform a 128-tap 
or 16-ms echo cancellation for telephone network applications. The echo canceller is 
implemented in accordance with the CCITT recommendation (G.165). A simulation 
has been performed to test the echo canceller, and the result exceeds the CCITT 
requirements. The third report in Part II examines the implementation of a data encryption 
method using the TMS32010. The encryption algorithm chosen for the report is the 
Data Encryption Standard (DES). The report details the DES algorithm and its TMS32010 
coding. Processor resource requirements are also provided for applying the DES to 
speech coding at different bit rates. For a 2.4-kbit/s LPC vocoder application, the DES 
only requires six percent of the TMS32010 CPU loading. The encryption scheme operates 
on a stream of bits that represent text, computer files, speech, or any other entity in 
binary form. The result is directly applicable to the design of any secure data/voice 
communications. 


The fourth report in Part III discusses 32-kbit/s Adaptive Differential Pulse Code 
Modulation (ADPCM) transcoders. A half-duplex ADPCM transcoder, which complies 
with the CCITT recommendation (G.721), can be achieved with a single TMS32010. 
If the transcoder is used only for private lines, a full-duplex non-CCITT ADPCM 
transcoder is more cost-effective and can be designed with a single TMS32010 processor. 
Both the CCITT and non-CCITT algorithms and code implementations are covered in 
the report. To date, the CCITT standard for the 16-kbit/s transcoder has not been 
established. One of the potential candidates for the 16-kbit/s transcoder is the subband 
coder. A reprinted article on the subject is included, which describes the theory, 
TMS32010 implementation requirements, and performance figures for the 16-kbit/s 
subband coder. The complete code for the coder is available for licensing from Atlanta 
Signal Processors Inc. (ASPI). Interested readers should contact ASPI for further 
information. Because of the programmability of the digital signal processor, the 
TMS32010 can also be programmed to handle Dual-Tone MultiFrequency (DTMF) 
encoding and decoding over telephone lines. For a system already performing digital 
signal processing functions using the TMS320, this DI'MF capability may be obtained 
at no additional hardware cost. A reprinted article from Electronic Design News, which 
details the DTMF implementation algorithm and provides TMS32010 program description 
and code, concludes this portion of Part III. 
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The computer/peripherals area is a vastly diversified field for DSP applications. It is 
further divided into three subareas: speech coding/recognition, image/graphics, and digital 
control. Applications in these areas are usually both computation- and data I/O-intensive. 
Multiprocessing capability, ease of host interface, and programmability are also required. 
These requirements have made the TMS320 processors attractive for 
computer/peripherals applications. In these applications, a general-purpose, 
TMS320-based digital signal processing system or subsystem is usually built for 
multitasking functions. For example, such a general-purpose system may be designed 
to perform a speech compression algorithm for voice store-and-forward applications 
and to transmit and receive data through the phone line when the system is reprogrammed 
with a modem algorithm. The TMS320-based system, capable of multitasking, proves 
to be more cost-/performance-effective than a system consisting of a group of dedicated 
devices. In addition, the programmability of the TMS320 allows users to update or 
upgrade at a later date. 


The first subarea in computer/peripherals comprises a series of reprinted articles and 
a technical report from MIT Lincoln Laboratory on speech vocoding and recognition 
using the TMS32010. Two vocoders at different data rates are discussed: the 2.4-kbit/s 
pitch-excited Linear Predictive Coding (LPC) and 9.6-kbit/s Adaptive Predictive Coding 
(APC). Vocoders at these rates are often used in computer voice-mail, voice store-and- 
forward, and low bit-rate speech communications applications. These two vocoders, 
combined with the 32-kbit/s ADPCM and the 16-kbit/s subband coder, provide the full 
spectrum of support by the TMS320 in the area of speech compression. The first article 
describes the implementation of a full-duplex 2.4-kbit/s LPC vocoder on a single 
TMS32010 device. Next, the reprinted report from MIT Lincoln Laboratory discusses 
the design considerations for constructing a compact, low-power, 9.6-kbit/s APC. The 
report also includes the TMS32010 source codes for the major functional steps in the 
APC algorithm. The third article discusses speech recognition, usually both computation- 
and data I/O-intensive. A reprinted article from Electronic Design examines the design 
of a general-purpose and high-performance speech recognition system using the 
TMS32010. Algorithms for speech recognition are still being evolved. New techniques 
for better feature extraction and recognition decision making are being studied. The 
TMS320 programmable digital signal processors are not only powerful enough to execute 
recognition algorithms available today, but also flexible enough to be updated with new 
recognition algorithms at a later date. 


The second subarea consists of an image/graphics application where the TMS32020 
serves as the graphics engine performing tasks, such as image scaling, translation, and 
rotation. Circuits are also shown for the interface between the TMS32020 and the Texas 
Instruments TMS34061 Video System Controller. The TMS32020-TMS34061 graphics 
system produces a high-speed, high-image-resolution, graphics capability. When 
integrating such a system into PC or workstations, the TMS320 reduces the host processor 
burden of performing intensive graphics data computations, thus allowing the host 
processor to concentrate on other important system and I/O tasks. 


The third subarea in computer/peripherals presents an application report on the design 
of a digital control system and its implementation with the TMS32010. Because of the 
increased availability and lower cost of suitable digital hardware, microprocessors such 
as the TMS32010 are increasingly being used to implement algorithms for the control 
of feedback systems. The report provides an example that uses the design of a digital 
compensator. Other control applications are possible using the TMS32010, such as 
computer disk control, laser print-head control, robotic control, automobile-engine system 
control, flight control, and autopilot systems. 


Code for the major DSP routines and applications in Part III is included in the DSP 
Software Library. 
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INTRODUCTION 


Signal processing has long been a tool used to improve 
performance in telecommunications systems. The advent of 
digital signal processing has given the telecommunications 
industry a powerful instrument to further enhance 
performance and reliability. Speech recognition, speech 
coding, speech synthesis, and high-speed modems are 
examples of applications now possible because of digital 
signal processing. Until recent years, real-time digital signal 
processing was solely the domain of minicomputers. The 
Texas Instruments TMS320 family offers low-cost VLSI 
implementations of digital signal processing functions. The 
TMS32010, the first-generation processor of the TMS320 
family, is a high-speed, 16/32-bit digital signal processing 
microprocessor/microcomputer, ideally suited for 
telecommunications applications. 

The purpose of this application report is to facilitate 
TMS32010 design by showing various telecommunications 
applications. The interfaces described in this report can be 
used in the following applications: 


1. Digital modem 

2. Echo cancellation 

3. Analog repeaters 

4. Handsfree telephone (speakerphone) 

5. Noise reduction 

6. Digital speech interpolation (DSI) 

7. Analog-switched network simulator 

8. Voice and data encryption 

9. Simultaneous voice and data transmission 
10. Speech coding and decoding, such as 


Linear Predictive Coding (LPC) 

11. ADPCM transcoding 

12. Voice mail 

13. DTMF encoding and decoding 

14. System identification. 

This application report consists of two major sections. 

The first section details various hardware building blocks 
required for system interfacing. This provides a simple means 
of constructing a TMS32010-based system for many 
telecommunications applications. Included are circuit 
diagrams and their functional descriptions. These standard 
telecommunications interfaces provide a realistic medium for 
implementing digital signal processing algorithms based on 
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‘the TMS32010. Standard serial and parallel interfaces permit 


connection to most digital Private Branch Exchange (PBX) 

backplanes and Time Division Multiplex (TDM) systems. 

Standard combo-codec filters provide analog interfaces that 

conform to CCITT recommendations. Description of 

hardware PCM companding for use with combo-codec filters 

is useful with time-critical algorithms. Microphone, 

loudspeaker, and host system interfaces are also included. 
The second section of the report examines the following 

TMS32010 applications based on the building blocks 

described in the first section: 

Standalone analog interface 

Telephony test-set interface 

PBX backplane interface 

2/4-wire transformer interface 

Three-way conference interface 

ADPCM interface 

DTMF detection interface. 


Se ae. 


HARDWARE INTERFACE BLOCKS AND 
CIRCUITRY 


The hardware circuit described in this section is divided 
into distinct modules or blocks. This modular approach 
allows flexibility in accommodating a wide range of 
telecommunications applications. These blocks can be placed 
together to fit a specific application. The following hardware 
blocks are detailed in the next sections of this application 
report: 

TMS32010 and support circuit 
Timing and control circuit 
PCM linearization circuit 
PBX/TDM interface circuit 
Analog interface circuit 

. Host interface circuit. 

The system block diagram, shown in Figure 1, 
illustrates how these blocks fit together around the 
TMS32010, the focal point of all activities. All blocks are 
directly connected to the TMS32010 data bus. The 
TMS32010 accesses all interfaces, except the program 
memory, using its I/O ports. Each interface is assigned to 
an I/O port. Port decode logic is used to decode the port 
address from the TMS32010 I/O space. 
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TMS32010 ADDRESS BUS 


2K PROGRAM PCM 


TMS32010 MEMORY LINEARIZATION 


PORT 
DECODE 


ANALOG 
INTERFACE 


HOST 
INTERFACE 


PBX/TDM 
INTERFACE 


TMS32010 DATA BUS 


Figure 1. System Block Diagram 


TMS32010 and Support Circuit 

The TMS32010 serves as the hardware engine that 
performs the DSP algorithms. It interfaces to its peripherals 
via its I/O ports. Support circuitry for the TMS32010 
includes memory and port decode logic and synchronization 
flip-flops. Table 1 indicates the TMS32010 port assignments 
for the application hardware constructed for this application 
report. 


Table 1. TMS32010 Port Assignments 


PORT ASSIGNMENT 


Analog interface 1 

Analog interface 2, 

Microphone and loudspeaker interface 
PBX/TDM interface 1 

PBX/TDM interface 2 

p/A-law to two’s-complement linear conversion 


Two’s-complement linear to »/A-law conversion * 


Host interface 
Not used 


*To perform conversions, a write to port 4 is required. 
The converted value is read at port 5. 


The TMS32010 can be reset by momentarily activating 
the reset switch, which provides a 6-ms pulse to the RS input 
to the TMS32010. The device has one hardware interrupt 
(INT) and one software interrupt (BIO). The BIO (branch 
on I/O) input to the TMS32010 is considered the low-priority 
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interrupt and is used for synchronizing to the 8-kHz framing 
pulse that drives the telecommunications interfaces (1.e., 
combo-codec filters and PBX/TDM interfaces). The INT 
input is considered the high-priority interrupt and is used for 
communicating with the host processor via the host interface 
(see Table 2 for interrupt assignments). The INT input 
becomes active when a command from the host is waiting 
to be processed. The reset and interrupt signals going to the 
TMS32010 are all synchronized. 


Table 2. TMS32010 Interrupt Assignments 


INTERRUPT ASSIGNMENT 


RS 


Reset. Vector to location >OOOH in program 
memory. Initialization of all pointers and internal 
registers recommended. 


INT Hardware interrupt. Vector to location >OO2H 
in program memory. Interrupt service routine for 


performing host command processing. 


BIO Software interrupt. Program in wait state when 
using the BIOZ instruction, waiting for 8-kHz 


synchronization signal. 


Figure 2 shows the TMS32010 and support circuitry. 
The TMS32010 (U1) has a 16-bit data bus that interfaces 
to all ports and program memory. All port decoding is 
accomplished with two SN74ALS138 decoders (U6 and U7), 
which provide the write and read port strobes, respectively. 
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Figure 2. TMS32010 and Support Circuit 


The reset and interrupts are synchronized to the TMS32010 
with an SN74ALS174 D-type flip-flop (U4). The TMS32010 
12-bit address bus is used for addressing program memory 
and the individual ports. This address bus drives the 
TBP28S166 program memory PROMs (U2 and U3) and the 
port decode logic (U6 and U7). The PROMs are mapped 
into the lower 2K address space (>O00H to >7FFH). 
Address bit All is used for decoding the lower and upper 
2K program memory spaces. 


Timing and Control Circuit 

The timing and control circuit supplies all the required 
clocking, timing, and control signals to operate the 
TMS32010 and interfaces. A 20.48-MHz clock is provided 
for the TMS32010. Derived from this clock are all the timing 
signals to operate the combo-codec filters, such as the 
128-kHz clock for shifting data and the 8-kHz framing pulse 
for synchronization purposes (see Figure 3). External frame 
synchronization is permitted with an 8-kHz framing pulse 


input. Note that the internal clock always operates 
asynchronously with the external system clock. 

All timing and control signals are derived from a 
20.48-MHz clock source (see Figure 4). The inverters (U12) 
are configured as an oscillator with a 20.48-MHz crystal and 
10-kohm resistor. A decode counter (U8) divides the 
20.48-MHz clock by 10 to produce a 2.048-MHz clock signal 
required by the combo-codec filters. A dual 4-bit binary 
counter (U9) divides the 2.048-MHz clock by 16 to produce 
a 128-kHz clock signal used by the combo-codec filters and 
related support circuitry to shift in/out serial data. U9 also 
divides the 128-kHz clock by 16 to produce an 8-kHz framing 
strobe (FS) used in the serial/parallel data conversion process 
and as a synchronization signal to the TMS32010. U10 and 
U11 are used to synchronize counters U8 and U9 to an 
external asynchronous 8-kHz framing pulse (FP). This 
permits synchronization to an external system, such as a 
PBX, on a frame-by-frame basis. 


~—_—_—_——_——_—___125 | 


FS (8 kHz) | | 
DCLK (128 kHz) PULL 
FP ] ] 


PCM IN/OUT 
(FOR COMBO-CODEC) 


MS 


67 [26/25 | 4 a3 | 82] 61/80) 
B LSB 


Figure 3. Clocking and Timing Diagram 
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Figure 4. Timing and Control Circuit 


PCM Linearization Circuit 

In digital telecommunications systems, speech is 
transmitted in an 8-bit nonlinear PCM code. p-law and 
A-law are the two most popular PCM coding schemes. For 
linear digital signal processing, the PCM code must be 
converted to two’s-complement linear codes for processing 
by the TMS32010. After processing, the two’s-complement 
linear code must be converted to the nonlinear PCM code. 
This conversion process can be performed in software on 
the TMS32010. A TMS32010 source code for the conversion 
process is provided in the application report, “‘Companding 
Routines for the TMS32010.’’! In certain cases, there may 
not be enough time in an 8-kHz framing period to accomplish 
this conversion process. 

For time-critical applications, hardware linearization 
is used to provide a faster conversion process. The conversion 
process is accomplished by using table lookup ROMs. To 
convert from one code to another, a value is written to port 
4, followed by a read from port 4 or 5. Port 4 is read to 
convert from a nonlinear code value to a two’s-complement 
linear value. Port 5 is read to convert a two’s-complement 
linear value to a nonlinear code value. The TMS32010 
performs a port write followed by a port read in 800 ns when 
driven by a 20-MHz clock source. 

The linearization. circuit uses a lookup table to perform 
the conversion process (see Figure 5). U18 converts 14-bit 
two’s-complement linear PCM to 8-bit p/A-law PCM. U19 
converts »/A-law 8-bit PCM to 14-bit two’s-complement 
linear PCM. Data to be converted is clocked into two 
SN74ALS574 octal D-type edge-triggered flip-flops 
(U13 and U14) when the TMS32010 writes to port 4. U13 
and U14 contain the least- and most-significant bytes of data, 
respectively. »/A-law data is buffered to the data bus via U16. 
Data is enabled onto the data bus when the TMS32010 reads 
port 5. Note that only the least-significant byte is active on 
the data bus. Linear data is buffered to the data bus via U15 
and U17, which contain the most- and least-significant bytes 
of data, repectively. Data is enabled onto the data bus when 
the TMS32010 reads port 4. The most-significant byte of 
data is clocked into U15 on the rising edge of the MEN output 
from the TMS32010. 

FORTRAN programs that compute p-law PCM to 
14-bit two’s-complement linear PCM and vice versa are 
provided in the appendix. This computation generates the 
object file (in the INTEL data record format) for purposes 
of burning-in PROMs and EPROMS. A lookup ROM can 
also be programmed to generate patterns for test signal 
generator and DTMF encoder applications. 


PBX/TDM Interface Circuit 


The PBX/TDM interface permits connection to serial 
and parallel digital PBX backplanes and TDM systems. An 
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example of a TDM system is a D1 channel bank that provides 
T1 carrier digital transmission on two-wire trunks between 
central offices. Each trunk provides 24 8-bit channels 
multiplexed in time. Each channel is intended to carry 8-bit 
companded PCM voice. The application hardware does not 
support any signalling functions in a T1 carrier connection. 

Two PBX/TDM interfaces are provided in the 
application circuit to accommodate separate connections to 
receive (RX) and transmit (TX) paths. The hardware, as 
shown in Figure 6, is configured to interface onto an 8-bit 
parallel bus or serial PCM highway. The parallel bus 
interface provides three-state I/O, and the serial interface 
provides open-collector or three-state I/O. Selection between 
parallel or serial interface on the application circuit is 
hardware-selectable via a jumper. Each PBX/TDM interface 
occupies an I/O port on the TMS32010 I/O port map, as 
shown in Table 1. These interfaces are assigned to ports 2 
and 3. The TMS32010 is synchronized to these ports via an 
8-kHz framing pulse on its BIO input. 

The two interfaces in this application circuit operate 
identically; therefore, only one interface, as shown in Figure 
6, is described. The serial data input (SERIAL PCM IN) is 
converted to 8-bit parallel data via shift register U23. All 
timing and control signals are provided by the external system 
interface. SYSCLK is the external synchronous clock that 
drives U23. ITSEN (input time-slot enable) is synchronous 
with SYSCLK and clocks the 8-bit parallel data from U23 
into U20 at an 8-kHz rate. Parallel data P7-PO is clocked 
into U21 with the ITSEN signal at an 8-kHz rate. U20 and 
U21 are the storage registers for serial and parallel PCM 
data, repectively. Access to either of these registers by the 
TMS32010 is controlled on the application circuit via a 
jumper. The jumper routes the read port 2 signal from the 
TMS32010 to either U20 or U21, thereby selecting the serial 
or parallel data inputs. The PCM data output from the 
TMS32010 is clocked into U22 when a write to port 2 occurs. 
The parallel data in U22 is buffered to the system data bus 
via U26. U26 is an SN74LS244 line driver that provides 
high-current drivers to the external system PCM data bus. 
Data is enabled onto the external system PCM data bus when 
OTSEN (output time-slot enable) is active low. 

For serial interfacing, the OTSEN signal loads the 
parallel data at U22 into shift register U24. Data is shifted 
to the PCM highway using the external system clock 
SYSCLK. The serial input pin (pin 10) is connected to the 
serial output pin (pin 9) on U24 to permit broadcasting the 
same PCM data on two or more consecutive time-slots on 
the PCM highway. OTSEN enables the data onto the PCM 
highway by controlling buffer U25. U25 drives either a three- 
state or open collector/drain PCM highway. U18 provides 
the logic which configures a three-state driver on U25 as an 
open-collector/drain driver. 
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Figure 5. PCM Linearization Circuit 
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Figure 6. PBX/TDM Interface Circuit 
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Analog Interface Circuit 

Two analog interfaces are provided to access receive 
(RX) and transmit (TX) analog paths. These interfaces 
conform to CCITT recommendations through the use of 
industry-standard combo-codec filters. »/A-law operation can 
be hardware-selected via a jumper on the application circuit, 
as shown in Figure 7. One of the interfaces also contains 
microphone and loudspeaker amplifiers, which can be used 
in a handsfree speakerphone application. The gain of both 
amplifiers is adjustable using potentiometers. The overall 
gain through the TMS32010 and combo-codec filters is fixed 
at 3 dB. Each analog interface occupies an I/O port on the 
TMS32010 I/O port map (see Table 1). These interfaces are 
assigned to ports 0 and 1. The analog interface containing 
the microphone and loudspeaker amplifiers is at port 1. The 
application hardware sets the electret condenser microphone 
current at 1 mA. The loudspeaker amplifier can drive a 
standard 8-ohm loudspeaker. 

The transmission signal reference points are governed 
by the combo-codec filters.2 These reference points for 
unbalanced line interfacing are summarized as follows: 


TCM2913 


MCLK 
“YA 
DCLK 7 MIC AMPLIFIER 
14) 1 MQ POT 
9 5 ko j 
FS Ra (NOTE 1) 
~V 
oo 
+ VA. ~_15 
JUMPER 
-VA 1 
+VA 20 a ae 
16 
a 10 
ANALOG 
GROUND ¢ 
DGND 
100 ka 
4 , } A , 0.01 |-VA 
VIN + Vin- Vout+ VouT- l 
ANALOG ANALOG 
BALANCED BALANCED 
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NOTES: 1. Electret MIC current set at 1 mA. 
2. Analog supply: 
+Va = +5V 
-Va =-5V 


INPUT: 0 dBmO 1.064-V RMS 1020-Hz sine 
wave between Vin + and 
analog ground; Vyjj+ not 


connected. 


OUTPUT: 0 dBmO = _1.503-V RMS 1020-Hz sine 
wave between VoyuT+ or 


VouT— and ground. 


Referring to Figure 7, the TCM2913 (U27) combo- 
codec filters? contain A/D and D/A converters and filters 
required for analog interfacing. A jumper is used to select 
either p/law or A-law operation on the combo-codec filters 
by controlling the voltage level of ASEL (pin 15) on U27. 
A +5 voltage selects p-law, and a —5 voltage selects A- 
law. Analog and digital grounds should be kept completely 
separate to avoid introducing digital switching noise in U27 
and ground loops. The analog input to U27 is configured 
as a unity gain difference amplifier. U32 is the microphone 
amplifier that is configured as an inverting amplifier. The 
amplifier gain is controlled by a 1-Mohm potentiometer. The 
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Figure 7. Analog Interface Circuit 
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amplifier input is designed to interface to an electret 
condenser microphone. The microphone current is set at 1 
mA via a 5-kohm resistor connected to the analog +5 V 
supply. The DC voltage component is removed by 
capacitively coupling the amplifier. The 0.05-nF capacitor 
sets the highpass filter corner frequency at roughly 120 Hz. 
U32 is connected to U27, and the input gain at U27 is set 
at unity. The output transmission level at U27 is set at a O-dB 
loss by strapping the PWR and GSR pins together.2 

The loudspeaker amplifier is capacitively coupled to 
the U42 output, and fixed with a gain of 26 dB. However, 
a 100-kohm potentiometer attenuates the input, thereby acting 
as a volume control. The loudspeaker amplifier can drive 
an 8-ohm load. PCM data is clocked out of U27 at the PCM 
output pin (PCM OUT) at a rate of 128 kbit/s. U28 is a shift 
register that converts the serial data from U27 to 8-bit parallel 
data clocked into register U30 at an 8-kHz rate. The 
TMS32010 accesses register U30 when a read to port 1 


SN74ALS574 


= 


occurs, which enables 8-bit data onto the least-significant byte 
of the 16-bit data bus. When data is being sent to the 
interface, the TMS32010 writes to port 1, which clocks 8-bit 
data into register U31. This data resides on the least- 
significant byte of the 16-bit data bus. Data at U31 is loaded 
into shift register U29 at an 8-kHz rate. This data is shifted 
out in serial format at a 128-kbit/s rate to the PCM input 
pin (PCM IN) of U27. The 128-kHz clock signal used for 
shifting serial data and the 8-kHz loading signal are provided 
by the system timing and control circuit as DCLK and FS,2 
respectively. 


Host Interface Circuit 

The TMS32010 can be externally controlled (if desired) 
by a host processor via the host interface. The host interface 
contains three registers: an acknowledgement register, 
ACKR (U34); a command register, COMR (U35); and a 
status register, SR (U36). Figure 8 shows the host interface 


HO7-HDO (HOST DATA BUS) 


RDACK (READ ACKNOWLEDGE) 


WRCOM (WRITE COMMAND) 


RDOSTAT (READ STATUS) 


Figure 8. Host Interface Circuit 
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circuit. The host writes commands to the COMR, which can 
be read by the TMS32010. The commands are completely 
defined by the user. The TMS32010 writes to the ACKR, 
which can be read by the host. The COMR and ACKR are 
assigned to port 6 in the TMS32010 I/O map. The read and 
write strobes from the TMS32010 and the host control the 
bits in the SR that are used for handshaking purposes with 
the host. Only the host can read the contents of the status 
register. For PBX applications, the host is located within the 
PBX. 

Handshaking signals are provided to control data 
transfers across the host interface. The host can access these 
signals by reading a two-bit SR, which resides on the two 
least-signficant bits of the host 8-bit data bus. The SR should 
reside in the host I/O port map or memory map. When 
reading the SR, the host must mask out the six most- 
significant bits since they are unknown. When a system reset 
occurs, the SR is set to 3H (xxxxxx11). When the host writes 


a command to the COMR, the SR is set to 1H (xxxxxx01), 
and the TMS32010 is interrupted. The TMS32010 then reads 
the command value in the COMR. This sets the SR to 3H. 

When the TMS32010 completes processing the 
command, it responds to the host by writing an 
acknowledgement value to the ACKR, which sets the SR to 
2H. The host reads the SR to determine if the TMS32010 
has processed the command. If the command has been 
processed, the host reads the acknowledgement value in the 
ACKR. This sets the SR to 3H, which indicates that the 
command transaction has been satisfactorily completed by 
the TMS32010 and the host. The acknowledgement value 
can be dummy values or data being retrieved from the 
TMS32010. If the TMS32010 has not completed processing 
the command, the SR will read 1H or 3H. If the SR stays 
at 1H or 3H for an abnormally long period of time, a possible 
fault or problem is indicated. Table 3 shows the possible SR 
values and their interpretation. 


Table 3. Host Interface Status Register Interpretation 


* “HOST” refers to the command transaction state. ‘‘ 
is completed, or that a host command has not been sent. ‘‘B’’ indicates that a host command 
has been sent and that a command transaction is in progress. 
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HOST* SR COMSEND | COMACK DESCRIPTION 
VALUE (bit 1) (bit 0) 


These conditions should not occur. Potential 
problems are indicated. 


Idle command processing state. 
Not allowed. Potential problem is indicated. 


The TMS32010 has not yet started 
processing the host command. Potential 
problem is indicated. 


The TMS32010 has completed processing 
the host command. Data is waiting for the 
host. 


The TMS32010 has started, but not 
completed, processing the host command. 


A”’ indicates that the command transaction 
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.The TMS32010 can process commands from the host 
in the following suggested manner: The command value 
written by the host, with an offset added to it, becomes the 
command vector or subroutine for the TMS32010. This is 
accomplished in software with a sequence of actions using 
the TMS32010. The interrupt service routine reads the host 
command value at port 6 (COMR) and stores it in internal 
data RAM (using the IN instruction). This value is then stored 
in the accumulator (using the LAC instruction). Using the 
AND instruction, the command value is masked to zero the 
24 most-significant bits, and the new value is stored in 
memory. Masking is required because the eight most- 
significant bits on the TMS32010 data bus are unknown when 
reading the host interface port. A minimum offset of 4 is 
added to the masked command value in the accumulator 
(using the ADDS or ADD instruction). The offset is added 
to the command value to ensure that the resulting command 
vector does not overlap the reset and interrupt vector spaces. 
The next step is to vector to the subroutine via the CALA 
instruction. The CALA instruction uses the value in the 
accumulator as the subroutine vector. This forces the 
TMS32010 to begin executing a subroutine located at the 
command vector value in program memory space. Shown 
below is the source code sequence for the TMS32010. 


ISR IN x,6 ; read host command in COMR 
LAC x,0 ; load host command in ACC 


AND _ y _ ; mask host command 
ADD z ; add an offset to host command 
CALA ; call subroutine 


Note that ISR (interrupt service routine) is a label. 
During the TMS32010 initialization routine, the values 
>FFH and >4 are stored in locations y and x in data 
memory, respectively. 

The TMS32010 communicates with the host on a byte 
data-transfer basis. The least-significant byte of the host and 
TMS32010 data bus is used. When the host issues a 
command, it writes one byte of data to COMR (U35), as 
shown in Figure 8. WRCOM (write command) is the write 
strobe from the host and is active low. Both flip-flops in U36 
represent the SR. WRCOM sets a flip-flop in U36, thereby 
setting COMSEND (command send) low and interrupting 
the TMS32010. COMSEND is bit 1 in the SR. During the 
TMS32010’s interrupt service routine, the TMS32010 reads 
host commands at port 6, which enables data from U35 onto 
the data bus. The port 6 read strobe sets COMSEND high, 
indicating that the command is being processed. When the 
TMS32010 responds to the command, it writes one byte into 
the ACKR U34 at port 6. This sets the second flip-flop in 
U36, thus forcing COMACK (command acknowledge) low. 
This indicates command processing is completed. COMACK 
is bit O in the SR. The host reads the ACKR (U34) by 
enabling data onto its bus using the RDACK (read 
acknowledge) strobe, which is active low and sets COMACK 
high. The host can read the SR (U36) with strobe RDSTAT 
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(read status register), which is active low. U36 sets 
COMACK and COMSEND high when the TMS32010 is 
reset. A timing diagram illustrating the host transaction is 
shown in Figure 9. 


INTERFACING APPLICATIONS 


In this second section of the report, several 
telecommunications applications are discussed. First, an 
analog interface hardware application is examined, which 
is useful in self-contained standalone designs, such as analog 
repeaters or analog PBX designs. Then, a telephony test-set 
application is shown. This is useful as a self-contained 
standalone test tool or as a diagnostic function within a PBX. 
Finally, digital PBX applications in the following areas are 
described: PBX backplane interface, 2/4 wire transformer 
interface, three-way conference interface, ADPCM interface, 
and DTMF detection interface. It is important to note that 
the telecommunications designer can combine several 
applications using one TMS32010 to perform multiple 
functions. 

These applications utilize the interface blocks 
previously described and the TMS32010 for performing 
digital signal processing algorithms. However, the scope of 
this report is not to discuss the DSP algorithms, but rather 
to demonstrate how these applications can be made possible 
using the telecommunications interface circuits. Application 
reports are available, which implement digital signal 
processing algorithms for telecommunications applications 
using the TMS320 family.3,4.5 


Standalone Analog Interface 

The standalone analog interface is useful where remote 
or independent operation is required. Some examples of 
standalone applications are adaptive repeaters and handsfree 
telephones (speakerphones), as shown in Figure 10. 

The standalone analog interface contains elements from 
the following hardware blocks previously described: 

1. TMS32010 and support circuit, 
2. Timing and control circuit, and 
3. Analog interface circuit. 

Figure 10 illustrates the hardware configuration for a 
handsfree telephone application. The TMS32010 performs 
adaptive filtering and voice-switching functions. The analog 
interface circuit provides two combo-codec filters to interface 
to the analog telephone line, loudspeaker, and microphone. 
The combo-codec filters perform filtering of the analog I/O, 
and A/D and D/A conversion. The serial data streams to and 
from the combo-codec filters are converted to parallel format 
that permits interfacing to the TMS32010’s parallel I/O. The 
TMS32010 easily interfaces to combo-codec filters using 
8-bit shift registers and octal D-type flip-flop registers. Both 
combo-codec filters share one set of shift registers at the serial 
interface. The timing and control circuit provides all the 
signals required to synchronize and control the combo-codec 
filters, registers, and TMS32010. 
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Figure 9. Host Transaction Timing Diagram 
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Figure 10. Handsfree Telephone System Diagram 
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Telephony Test-Set Interface 

A telephony test-set is a useful tool for testing analog 
interfaces in telecommunications. It can also be used in a 
PBX design where analog line testing diagnostics are 
required. The application shown in Figure 11 is a standalone 
system. 

A test-set application uses the following hardware 
blocks: 
TMS32010 and support circuit, 
Timing and control circuit, 
Analog interface circuit, 
Host interface circuit, and 
PCM linearization circuit (optional). 

Balanced inputs and outputs are required in a test-set 
application. The TMS32010 can be programmed to measure 
noise, distortion, signal levels, etc., and serve as a network 
simulator for analog telephony circuits. The electrical 
performance of the combo-codec filters is a limiting factor 
on measurements and simulations that can be performed. 
Figure 11 shows a possible telephony test-set system 
configuration. All measurements are under control of a host 
processor. The host interfaces to a control input, such as a 
keyboard or human interface I/O, and a display, such as a 
CRT. Measurement parameters can be modified via the 
control interface, and the actual measured values displayed. 
All transfers between the host and application circuit are 
performed via the host interface. 

A possible test-set function may be a sine-wave signal 
generator. The TMS32010 produces. a digital sine wave, 
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which is converted to the analog domain via the combo-codec 
filters in the analog interface circuit. TMS32010 source code 
for a precision sine-wave generator is given in the application 
report, ‘‘Precision Digital Sine-Wave Generation with the 
TMS32010.’’6 Note that this algorithm produces linear 
samples that must be converted to p-law or A-law PCM 
samples. TMS32010 source code for performing this 
conversion can be found in the application report, 
‘‘Companding Routines for the TMS32010.’’! Hardware 
conversion is also possible using the PCM linearization 
circuit, described in the previous major section of this report. 


PBX Backplane Interface 

A backplane is an interconnect system with its 
mechanical, electrical, pin assignments, and interaction 
protocol rigidly defined. Mechanically, a backplane is a 
printed circuit board with connectors that reside at the rear 
of a card-cage, providing electrical interconnection between 
cards plugged into these connectors. Architecturally, a 
backplane has a set of protocols that control data transfers 
between interconnected cards. This application report 
concerns itself only with some architectural aspects of 
possible PBX backplanes. 

Two types of backplanes are used in most PBX designs: 
serial and parallel. The serial backplane is the most popular. 
Interfacing with standard codecs is simple, because they 
contain serial input and output ports. In some modern PBXs, 
a parallel backplane is used, which eases hardware interfacing 
since standard byte-wide logic can be used. In either system, 
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Figure 11. Telephony Test-Set System Diagram 
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this application circuit can serve as a signal processing 
module to perform specialized functions, such as DTMF 
detection, echo cancellation, noise reduction, and 
conferencing. The host processor determines which PCM 
channels are assigned to the signal processing module. 

A typical PBX backplane contains one or more serial 
PCM highways or an 8/16-bit PCM bus, system address and 
data buses, and system control lines (see Figure 12). The 
system address bus is used by the host processor to access 
all the peripherals or modules connected to the backplane. 
The data transfers occur across the data bus between the host 
processor and modules. The system control lines are typically 
read and write strobes used by the host processor to control 
these data transfers. The system clock and 8-kHz framing 
pulse are used for synchronizing access to the PCM highway 
or bus. In most systems, time-slot enable control lines are 
‘provided to access individual PCM channels. The time-slot 
enable control lines are provided by either the backplane with 
a control card using time-slot decode circuitry or card- 
resident time-slot decode circuitry. The application circuit 
does not provide any time-slot decoding; however, it provides 
the flexibility to interface to most systems through the 
external signals ITSEN and OTSEN. 


PARALLEL PCM BUS OR 
SERIAL PCM HIGHWAY 


Figure 12. Typical Backplane Configuration 


In Figure 13, the framing pulse is used to indicate the 
start of a frame. The framing pulse, usually one system clock 
cycle wide, is synchronous with the system clock and 
synchronizes all peripherals connected to the PCM highway 
or bus. Many PCM systems are based on a 2.048-MHz 
system clock, which provides 32 channels on a serial PCM 
highway or 256 channels on an 8-bit parallel PCM bus. 

The backplane application hardware requires the 
following blocks: 

1. TMS32010 and support circuit, 
2. Timing and control circuit, 
3. Two PBX/TDM interface circuits for use 
with receive and transmit channels, 
4. Host interface circuit, and 
PCM linearization circuit for critical real- 
time applications (optional). 
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Each PBX/TDM interface contains a serial input port, 
serial open-drain/collector, three-state output ports, and an 
8-bit wide three-state parallel output port. These ports directly 
connect to the PCM bus or highway on the backplane. An 
external device or circuit is required to provide the time-slot 
enables ITSEN and OTSEN (see Figure 14). The host 
determines which channels reside in time-slots on the PCM 
backplane, and writes information to a time-slot controller 
residing in the peripheral or module connected to the 
backplane. The controller then furnishes the input and output 
time-slot enables ITSEN and OTSEN, respectively. ITSEN 
assigns to the application circuit the PCM data in the 
corresponding time-slot on the PCM backplane. OTSEN 
enables PCM data from the application circuit onto the 
appropriate time-slot on the PCM backplane. The application 
hardware permits broadcasting PCM data onto several time- 
slots on the PCM backplane. 


2/4-Wire Transformer Interface 

The 2/4-wire transformer is useful for interfacing 
directly to 2-wire trunks or telephone lines. The transformer 
application requires the following hardware blocks: 

1. TMS32010 and support circuit, and 
2. Analog interface circuit. 

Figure 15 shows how a 2/4-wire transformer is 
connected to the analog interface circuit. VouT + drives a 
load resistor (RL) in series with the transformer. This series 
arrangement produces a 6-dB. loss between Vout + and the 
telephone line. The load resistor balances the transmission 
line. The input is arranged so that the output signal reflected 
back at the transformer to input Vyj + is minimized. This 
is accomplished by subtracting VoyT+ using the Vij — 
input in a difference amplifier configuration. The return loss 
will be a function of matching RL to the telephone-line 
impedance. A mismatch between the load RL and the 
telephone line will worsen the return loss. The transmission- 
signal reference points were described in the analog interface 
circuit section. Information on 2/4-wire analog interfacing 
that conforms to FCC Rules Part 68 can be found in 


- reference [7]. 


Three-Way Conference Interface 
A popular PBX feature is three-way conferencing. In 
analog PBXs, conferencing is easily accomplished with op- 
amps in a summing configuration. The problem is not trivial 
in digital PBXs. »/A-law encoded PCM samples cannot be 
directly summed together with standard adders (i.e., 
SN74LS283s) because of the nonlinear encoding scheme 
used. These PCM values must first be converted to linear 
values, summed, then converted back to p/A-law PCM. This 
function is possible with the TMS32010. As mentioned 
previously, the PCM samples can be linearized in software! 
or in hardware. 
For a three-way conference, the application hardware 
(see Figure 16) contains the following modules: 
1. TMS32010 and support circuit, 
2. Timing and control circuit, 
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Figure 14. Typical Backplane Interface 


NOTES: 1. R, balances telephone line. 
2. 6-dB loss from VoyrT and to telephone line. 
3. 6-dB gain from telephone line to GSX. 


Figure 15. 2/4-Wire 


3. Three PBX/TDM interface circuits, and 
4. PCM linearization circuit (optional). 
Three-way conferencing is the simplest method of 
conferencing; however, some system considerations are 
required. Return loss and noise performance are most 
important factors in a conferencing design. Return loss refers 
to the amount of attenuation of a reflected signal: the larger 
the return loss, the smaller the reflected signal. Reflections 
occur where there are 2/4-wire conversions in the network. 
In long network paths, such as satellite links, these reflections 
manifest themselves as echoes. (The application report, 
‘‘Digital Voice Echo Canceller with a TMS32020,’’5 
describes the implementation of an echo canceller using the 
TMS32020, the second-generation digital signal processor 
of the TMS320 family.) Return loss affects the ‘‘singing’’ 
margin and echo performance of the conference. ‘‘Singing’’ 
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LINE 
INTERFACE 
CIRCUIT 


Transformer Connection 


refers to an oscillatory state in a closed-loop system that 
occurs when excessive gain is inserted in the loop. Summing 
of two or more parties into a conference decreases the return 
loss, thereby increasing the chances of ‘‘singing’’ and 
degrading echo performance. In a three-way conference, the 
simplest method to reduce the effects of return loss is to insert 
transmission loss into the receive (RX) or transmit (TX) 
paths. 

In the three-way conference node shown in Figure 17, 
each party is connected only to the other two parties, not 
to itself. Normally, the return loss is dominated by the party 
with the worst reflections. In a worst-case condition, if these 
two parties have similar reflection characteristics, the return 
loss decreases by 6 dB. To offset this, the transmission levels 
at summing junctions are decreased by 6 dB, i.e., a 6-dB 
loss insertion at TX IN or RX OUT. The loss insertion 
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Figure 17. Three-Way Conference Configuration 
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slightly degrades noise performance. Noise performance 
refers to the subjective transmission quality of band-limited 
voice vs. band-limited noise. The noise performance of a 
conference, where all parties are summed together, always 
degrades to the party with the worst signal-to-noise ratio. 
In conferences where a large number of parties are connected 
together, voice-switching is used to limit the number of 
talkers summed together at one time, thereby limiting echo 
and noise performance degradation. Stringing together 
several three-way conferences to make a larger conference 
is not recommended, because noise performance will be 
substantially reduced due to compounded 6-dB insertion 
losses. The removal of this loss would likely degrade the 
echo performance of the conference and increase the chances 
of ‘‘singing’’. 

The TMS32010 source code in Figure 18 can be used 
for a three-way conference. PCM linearization can be 
performed in hardware or software. The user can insert the 
appropriate linearization code in the space provided. Memory 
locations TA, TB, and TC correspond to the TX IN inputs 
at ports X, Y, and Z, respectively. Memory locations RA, 
RB, and RC correspond to the RX OUT outputs at port X, 
Y, and Z, respectively. The PBX supervisor maps the 
TMS32010 ports to their appropriate time-slots on the 
backplane. The 6-dB insertion loss is accomplished by a 
15-position right-shift on the TX IN samples when loading 
the accumulator. This results in an effective single-position 
left-shift when using the upper 16-bits of the accumulator. 
Note that in the following code, WAIT is a label. 
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WAIT BIOZ WAIT ; wait for 8-kHz interrupt 
IN TA,X 5; input ports to memory 
IN TB,Y 
IN TC,Z 
* 
* Insert mu-/A-law to two’s-complement linear 
* conversion code here for ports X, Y, and Z. 
LAC TA,15 ; sum ports X and Y for 


- ; port Z 

ADD _ TB,15 

SACH RC,O 

LAC 1B,15  ; sum ports Y and Z for 
* ; port X 

ADD TC,15 

SACH_  RA,O 

LAC TA,15 ; sum ports X and Z for 
* ; port Y 

ADD TC,15 

SACH_ RB,O 


* 
* Insert two’s-complement linear to mu-/A-law 


* conversion code here for ports X, Y, and Z. 
OUT RA,X _ ; output memory locations 


= ; to ports 
OUT RB,Y 
OUT RCZ 
Bo WAIT 
END 


Figure 18. Three-Way Conferencing Source Code 
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ADPCM Interface 

Adaptive Differential Pulse Code Modulation 
(ADPCM) is a method of increasing voice-band transmission 
capacity. ADPCM is used in telecommunications systems 
to transcode 64-kbit/s PCM to 32-kbit/s ADPCM, thereby 
doubling the capacity of a transmission system. An 
implementation of a full-duplex ADPCM on a single 
TMS32010 is described in the application report, ‘‘32-kbit/s 
ADPCM with the TMS32010.’’3 Although it does not 
provide bit-by-bit compatibility with CCITT 
recommendations, it follows the recommended model. The 
same report also describes the TMS32010 implementation 
of a half-duplex 32-kbit/s ADPCM algorithm recommended 
by CCITT. 

The following paragraphs briefly examine two of many 
possible ADPCM applications: voice mail and transcoding 
in a PBX design. 

Voice-mail in a PBX application is possible by using 
the ADPCM transcoder and the host processor. The 
application hardware for a voice-mail function, shown in 
Figure 19, requires the following modules: 

1. TMS32010 and support circuit, 
2. PBX/TDM interface circuit, and 
3. Host interface circuit. 


ee VOICEMAIL ttst~t~s«s«d 


APPLICATION CIRCUIT 


HOST 
INTERFACE 


T™MS32010 
AND SUPPORT 
CIRCUIT 


PBX/TOM 
INTERFACE 


The ADPCM algorithm is stored on the TMS32010’s 
program memory ROMs. The PBX supervisor instructs the 
host to initiate the voice-mail function and maps the 
PBX/TDM interface circuit to the appropriate time-slot on 
the backplane. The host initiates the transaction by writing 
a command to the application circuit and waits for ADPCM 
samples. The TMS32010 reads.the »/A-law PCM sample 
from the PBX/TDM interface. It then performs the PCM to 
ADPCM transcoding function. The ADPCM sample is then 
written to the host interface. The host reads the ADPCM 
samples at the host interface and stores them in a high- 
Capacity storage device, such as a hard disk or bubble 
memory. At some point, the PBX supervisor instructs the 
host to terminate the transaction with the application circuit. 
When the mail is to be delivered, the PBX supervisor again 
instructs the host to initiate the transaction with the application 
circuit, and maps the PBX/TDM interface to the appropriate 
time-slot on the backplane. The host initiates the transaction 
by writing a command to the application circuit. The 
application circuit then waits for ADPCM samples from the 
host. The host retrieves the ADPCM sample from the storage - 
device and writes it to the host interface. The TMS32010 
reads the ADPCM sample, then performs the ADPCM to 
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Figure 19. Voice-Mail System Diagram 
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PCM transcoding function, and writes the PCM sample to 
the PBX/TDM interface. 

Transcoding within a PBX is possible between an 8-bit 
PCM highway and a 4-bit ADPCM highway. The 
transcoding application requires the following hardware 
blocks, as shown in Figure 20: 

1. TMS32010 and support circuit, 
2. Timing and control circuit, and 
3. Two PBX/TDM interface circuits. 

The PBX supervisor assigns the PBX/TDM interfaces 
to their respective time-slots on the PCM and ADPCM 
highways. The TMS32010 reads the sample from one of the 
interfaces, performs the transcoding function, then writes the 
sample to the second interface. The reverse operation is also 
applied, thereby providing full-duplex operation. 


DTMF Detection Interface — 

In modern PBXs, DTMF detection is used to implement 
call features. DTMF signalling from the station-set is 
interpreted by the PBX for various features, such as outside 
line access, call-forwarding, conferencing, and voice-mail. 
DTMF detection is an integral part of any PBX design. 
DTMF detection is a popular signal processing function 
easily implemented on the TMS32010. TMS32010 source 
code for the DTMF detector is given in reference [4]. 
Multiple-channel DTMF detection is possible with a single 
TMS32010. 

The DTMF detection application contains the following 
modules: 

TMS32010 and support circuit, 
Timing and control circuit, 

Host interface circuit, 

PBX/TDM interface circuit, and 
PCM linearization circuit (optional). 

The DTMF detection algorithm4 requires PCM 
linearization. When the DTMF detection algorithm used is 
time-critical and software linearization is not possible, the 
PCM linearization circuit is needed. A system diagram for 
DTMEF detection is shown in Figure 21. 

An example of system operation is as follows: The PBX 
supervisor is informed that a station-set is in an off-hook 
condition. The PBX supervisor prepares for DTMF detection 
by mapping the PBX/TDM interface and station-set to the 
same time-slot, and instructing the host to detect digits and 
implement features. Note that the station-set is interfaced to 
the PBX backplane with a line-card. The host writes a 
command to the host interface instructing the application 
circuit to begin DTMF detection. The host waits for digits 
to be returned at the host interface. The TMS32010 reads 
samples from the PBX/TDM interface and implements the 
DTMF detection algorithm. When a DTMF digit is detected, 
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the value is written to the host interface. The host reads the 
digit and implements the requested feature. The host then 
writes a command to the host interface, instructing the 
application circuit to terminate DTMF detection. 


SUMMARY 


The TMS32010 is well suited for a variety of 
telecommunications applications, ranging from trunk circuits 
to PBX systems. The hardware described in this application 
report allows the designer to apply some new ideas to end- 
product designs. Various digital PBX applications in the 
following areas were described: PBX backplane, 2/4-wire 
transformer connection, telephony test-set, three-way 
conferencing, ADPCM transcoding, and DTMF detection. 
Standalone systems applications, such as analog interfaces 
and testing, were also shown. All the applications can be 
transposed to the TMS32020, the second-generation digital 
signal processor. 

The TMS32010 offers cost-effective solutions for many 
telecommunications applications requiring digital signal 
processing. Further cost reduction is possible using the 
TMS320M 10. This mask version of the TMS32010 provides 
a 1523-word on-chip program ROM. Texas Instruments 
provides easy-to-use low-cost tools for circuit and algorithm 
development. The entire FMS320 family of digital signal 
processors is extensively supported by a staff of application 
engineers at the factory and in the field. 
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25 C 
2& C DEFINE CONSTANTS AND INITIALIZE 
27 Cc 
23 INTEGER $,@,LIN,LINBB 
29 DIMENSION LINBB(S12) 
30 C 
31 °C 2°S COMPLEMENT LINEARIZATION CALCULATION 
32 C 
33 DO 100 J=1,128,1 
1 34 JIeJ-1 
1 35 Q=MOD (45,14) 
1 36 S=JJ/16 
1 37 LIN=(2##S# (2*Q4+33) )-33 
1 38 LINBB(J)=LIN/2546 
1 39 LINBB(126+J)=(65536-L1IN)/254 
1 40 LINBB( 254+.) =MOD(LIN, 254) 
1 41 LINBB(384+.J5)=MOD((65534-LIN) , 254) 
1 42 100 CONTINUE 
43 LINBB(129)=0 
44 LINBB(385)=0 
45 CALL HEXFILE (LINBB, “U~LIN. HEX” ,512) 
46 STOP“LINEARIZATION COMPLETED~ 
47 END 
Name Type Offset P Class 
J INTEGER#4 2050 
Jt INTEGER#4 2062 
LIN INTEGER#®4 2074 
LINBB INTEGER#4 2 
MOD INTRINSIC 
Q INTEGER#4 2064 
Ss INTEGER#4 2070 
, 43 Cc 
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SUBROUTINE HEXFILE (DEC,LABL,3IZE) 
CHARACTER LABL#9, REC ,HX 

INTEGER DEC,SIZE,SUM,ARDR,CH 

DIMENSION DEC(SIZE),REC( 43) ,HX(16),CH( 43) 
OPEN(1,FILE=LABL, STATUS=" NEW ) 

HX (1) ="0" 


HX(5)=°4°" 
HX(&)=°5° 
HX(7)="6" 
HX (8)=°7° 
HX(9)=°8° 
HX(10)=" 7" 
HX CLL =ETAY 
HX (12)=°B" 
HX C13) =°C0" 
HX (14)=°D" 
HX(15)=°E° 
HX(16)=°F” 
CH(2)=1 
CH(3)=0 
CHI S)=0 
CH(9)=0 
REI (1>=8 * 
=SIZE/16 


DG 1000 J=1,I,1 
ADDR=(.J-1) #16 
CH(4)=ADDRK/ 40946 

(TH(S) =MOD(C ADDR, 4094) /254 
CH(6)=MO0D(ADDR, 254)/14 
CHC7)=MODCADDR, 14) 

pO 2000 L=1,14,1 
LL=(L~-1)#2+10 
CH(LL)=DEC (ADDR+L)/14 
CH(LL+1)=MO0( DEC (CADDR+L) , 146) 
CONT INUE 

SUM=0 

nC 3000 M=2,40,2 
SUM=SLUIM+ (CH(M) #14) 
CONT INUE 

DO 3100 M=3,41,2 
SUM=SIUM+CH(M) 

CONTINUE 
SILIM=254-MOD (3UM, 254) 
CH(42)=SUM/14 

CH( 43) =M0D (SUM, 146) 

BO 4000 N=2,43,1 

REC (ON) =HX (HCN) +1) 
CONTINUE 

WRITE(1,1300) (REC(K2),K2=1,43,1) 
CONTINUE 

WRITE(1,1400) 

CLOSE (1) 

FORMAT (43(A1) > 
FORMAT (° s OOOOOOO1FF” ) 
RETURN 
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109 Seurce Lines 31 DO 100 J=#1,22,1 
1 32 S=0 
1 33 L=J-1 
1 34 ULAW (.1I)=CPAND(L,S) 
1 35 100 CONTINUE 
34 DO 110 J=33,76,1 
1 37 S=1 
1 33 L=J-1 
1 39 LILAW(.J)=CPAND(L,S) 
1 40 110 CONTINUE 
41 DO 120 J=97,224,1 
1 42 S=2 
1 ie L=J-1 
1 44 LILAW (1) =CPANE(L, 5) 
1 45 120 CONTINUE 
44 DO 130 J=225,430,1 
1 47 S=2 
1 43 L=J-1 
1 49 ULAW (J) =CPAND(L,S) 
1 50 130 CONTINUE 
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1 52 5=4 
1 33 L=J-1 
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D Line# 1 7 
1 60 150 CONTINUE 

61 pO 160 J=2017,4064, 1 
1 4&2 S=4 
1 43 =J-1 
1 &4 VLAW C.J) =CFANDCL, 5) 
1 45 140 CONTINUE 

&4& DO 170 J=40465 ,3192, 1 
1 4&7 S= 
1 43 L=J-1 
1 Lone WLAW(.J)=CPAND(L,S) 
1 79 170 CONTINUE 

71 DO 200 J=1,8192,1 
1 72 WLAW (146384-J+1)=128+UILAW (1) 
1 73 200 CONTINUE 

74 CALL HEXFILE (ULAW, “LIN-U.HEX*, 16324) 

73 STOP“LI-LAW COMPANDING COMPLETED“ 

74& END 
Name Type Offset FP Class 
CPAND INTEGER#2 FUNCTION 
at INTEGER#2 32770 
L INTEGER#2 32773 
3 INTEGER#2 32774 
LILAW INTEGER#2 2 

77 

72 C U-LAW COMPAND FLINCTION 

77 

BO INTEGER FUNCTION CPAND(LIN, SEG) 

$1 INTEGER LIN, SEG 

82 CPAND=SEG414+( ((LINt33)/ (2##S5EG))-33)/2 

83 RETURN 

4 END 
Name Type Offset P Class 
LIN INTEGER#Z oO * 
SEG INTEGER#2 4 + 

$3 0 

&4 C INTEL RECORD FORMAT GENERATOR 

87 Cc 

85 SUBROUTINE HEXFILE (DEC,LABL ,SIZE) 

sy CHARACTER LABL*?, REC, HX 

vO INTEGER DEC,SIZE, SUM, ADDR, CH 

91 DIMENSION DEC (SIZE) ,REC( 43) ,HX(146) ,CH(43) 

2 QPEN(1,FILE=LABL , STATUS=~ NEW” ) 

93 HX(1)=70" 

94 HX(2)="1° 

95 HX(32)=°2° 

94 HX (4) =°3" 

97 HX(S)="74° 

93 HX (6) =°5* 

99 HX(7)=°4" 

100 HX(8)=°7~ 

101 HX(9)=°3" 
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HX(10)="9° 
HX(C11L)="A’ 
HX(12)="“B” 
HX (13) ="° 
HxX(14)="D" 

HX(15)=°E° 

HX(14)=°F° 

CH(2)=1 

CHS) =0 

CHC2)=0 

THC?) =O 

REm(t)="2° 

I=SIZE/146 

BM 1000 J=1,1,1 

ADDR= (I-11) #14 
TH(4)=ADDR/ 40746 
(TH(S)=MOD( ADDR, 4074) /254 
TH(4)=MOD(ADDR, 254)/146 
ITH(7)=MND( ADDR, 14) 

pa 2000 L=1,14,1 
LL=(L-1)#2410 

CH(LL)=DEC (ADDR+L)/14 
THC(LL+1)=MOD (DEC (CADDR+L) , 14) 
ONT INDE 

SUM=0 

DQ 3000 M=2,40,2 

SUM=SUM+ (CH(M) #14) 

CONTINUE 

0 3100 M=3,41,2 
SUM=SLM+ITH (M) 

ONT INUE 
SLUM=2546—-MiI0 (SUM, 254) 
TH(42)=SLIM/14 
7H(43)=M00(5UM, 14) 

DO 4000 N=2,43,1 

RET (CN) =HX (IHC N) +1) 

CONTINUE 
WRITE(1,1300) 
ONT INUVE 

WRITE (41,1400) 
CLOSE (1) 
FORMAT (43(A1)) 
FORMAT (* : QO0000001FF ~ ) 
RETLRN 

END 
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INTRODUCTION 


Echo-cancellers using adaptive filtering techniques are now 
finding widespread practical applications to solve a variety 
of communications systems problems.! These applications 
are made possible by the recent advances in microelectronics, 
particularly in the area of Digital Signal Processors (DSPs). 
Cancelling echoes for long-distance telephone voice 
communications, full-duplex voiceband data modems, and 
high-performance ‘‘handsfree’’ audio-conferencing systems 
(including speakerphones) are a few examples of these 
applications. 

The continuing deployment of all-digital toll switches, 
satellite-based voice and data networks, and new 
intercontinental long-haul circuits have been accompanied 
by more widespread use of all-digital voice echo cancellers 
in carrier systems.2 In addition, new low-cost integrated 
single-channel echo cancellers are expected to see increasing 
application in smaller systems for audio teleconferencing and 
low-cost voice/data communications using private satellite 
earth stations. 

Advancements in single-chip programmable digital 
signal processor technology now make it attractive to 
implement modular per-channel echo canceller architectures 
with all the functions required for a single echo canceller 


integrated within a single device. A programmable DSP 
implementation offers the advantages of a short development 
and test schedule and the flexibility to meet custom product 
requirements by extending software-based functional building 
blocks rather than designing new hardware. 

This application report describes the implementation 
of an integrated 128-tap (16-ms span) digital voice echo 
canceller on the Texas Instruments TMS32020 
programmable signal processor. The implementation features 
a direct interface for standard PCM codecs (e.g., Texas 
Instruments TCM2913) and meets the requirements of the 
CCITT (International Telegraph and Telephone Consultive 
Committee) Recommendation G.165 for echo cancellers.3 
This report presents the requirements for echo cancellation 
in voice transmission and discusses the generic echo 
cancellation algorithms. The implementation considerations 
for a 128-tap echo canceller on the TMS32020 are then 
described in detail, as well as the software logic and flow 
for each program module. 

A hardware demonstration model of a 128-tap voice 
echo canceller using the TMS32020 has been constructed and 
tested. Figure 1 shows a photograph of the echo canceller 
demonstration system. The main features of this model are 
described within the report. The appendixes contain complete 
source code and a schematic for the demonstration system 
echo canceller module. 


Figure 1. Echo Canceller Demonstration System 
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ECHO CANCELLATION IN 
VOICE TRANSMISSION 


Echoes in the Telephone Network 

The source of echoes can be understood by considering 
a simplified connection between two subscribers, S1 and S2, 
as shown in Figure 2. This connection is typical in that it 
contains two-wire segments on the ends, a four-wire 
connection in the center, and a hybrid at each end to convert 
from two-wire transmission to four-wire transmission. Each 
two-wire segment consists of the subscriber loop and possibly 
some portion of the local network. Over this segment, both 
directions of transmission are carried by the same wire pair, 
i.e., signals from speakers S1 and S2 are superimposed on 
this segment. On the four-wire section, the two directions 
of transmission are segregated. The speech from speaker S1 
follows the upper transmission path, as indicated by the 
arrow, while speech originating from S2 follows the lower 
path. The segregation of the two signals is necessary where 
it is desired to insert carrier terminals, amplifiers, or digital 
switches. 

The hybrid is a device that converts two-wire to four- 
wire transmission. The role of the hybrid on the right-hand 
side is to direct the signal energy arriving from S1 to the 
two-wire segment of S2 without allowing it to return to S1 
via the lower four-wire transmission path. Because of 
impedance mismatches (unfortunately occurring in practice), 
some of this energy will be returned to speaker S1, who then 
hears a delayed version of his speech. This is the source of 
‘talker echo.”’ 

The subjective effect of the talker echo depends on the 
delay around the loop. For short delays, the talker echo 
represents an insignificant impairment if the attenuation is 
reasonable (6 dB or more). This is because the talker echo 
is indistinguishable from the normal sidetone in the telephone. 
For satellite connections, the delay in each four-wire path 
is about 270 ms as a consequence of the high altitude of 
synchronous satellites. This means that the round-trip echo 
delay is approximately 540 ms, which makes it very 


essential to find ways of controlling or removing that echo. 
Since the subjective annoyance of echo increases with delay 
as well as echo level due to hybrid return energy, the 
measures for control depend on the circuit length. 

For terrestrial circuits under 2,000 miles, the via net 
loss (VNL) plan,4 which regulates loss as a function of 
transmission distance, is used to limit the maximum echo- 
to-signal ratio. On circuits over this length (e.g., 
intercontinental circuits), echo suppressors or cancellers are 
used. An echo suppressor is a voice-operated switch that 
attempts to open the path from listener to talker whenever 
the listener is silent. However, echo suppressors perform 
poorly since echo is not blocked during periods of doubletalk. 
They impart a choppiness to speech and background noise 
as the transmission path is opened and closed. Due to recent 
decreasing trends in DSP costs, digital echo cancellers are 
now viable as replacements for most of the circuits using 
echo suppressors. 

For satellite circuits with full hop delays of 540 ms, 
echo suppressors are subjectively inadequate, and cancellers 
must be employed. 


Digital Echo Cancellers in Voice Carrier Systems 

The principle of the echo canceller for one direction 
of transmission is shown in Figure 3. The portion of the four- 
wire connection near the two-wire interface is shown in this 
figure, with one direction of voice transmission between ports 
A and C, and the other direction between ports D and B. 
All signals shown are sampled data signals that would occur 
naturally at a digital transmission terminal or digital switch. 
The far-end talker signal is denoted y(i), the undesired echo 
r(i), and the near-end talker x(i). The near-end talker is 
superimposed with the undesired echo on port D. The 
received signal from far-end talker y(i) is available as a 
reference signal for the echo canceller and is used by the 
canceller to generate a replica of the echo called f(i). This 
replica is subtracted from the near-end talker plus echo to 
yield the transmitted near-end signal u(i) where 


disturbing to the talker, and can in fact make it quite difficult u(i) = x(i) + r(i) — fd). Ideally, the residual echo error 
to carry on a conversation. When such is the case, it is e(i) = r(i)—f(i) is very small after echo cancellation. 
FOUR-WIRE 
TRUNK 
TWO-WIRE TWO-WIRE 
SUBSCRIBER SUBSCRIBER 
LOOP LOOP 


HYBRID 


$1 


HYBRID 


Cae.) 
$2 


Figure 2. A Simplified Telephone Connection 
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FROM 
FAR-END 
TALKER 


TO 
FAR-END 
LISTENER 


u(i) 


CANCELLER 


x(i) + eli) — Fi) 
iene aey 


NEAR-END 
TALKER 


ECHO x(i) 


HYBRID 


x(i) + ri) 


e(i) 


Figure 3. Echo Canceller Configuration 


The echo canceller generates the echo replica by 
applying the reference signal to a transversal filter (tapped- 
delay line), as shown in Figure 4. If the transfer function 
of the transversal filter is identical to that of the echo path, 
the echo replica will be identical to the echo, thus achieving 
total cancellation. Since the transfer function of the echo path 
from port C to port D is not normally known in advance, 
the canceller adapts the coefficients of the transversal filter. 
To reduce error, the adaptation algorithm infers from the 
cancellation error e(i) (when no near-end signal is present) 
the appropriate correction to the transversal filter coefficients. 

The number of taps in the transversal filter of Figure 
4 is determined by the duration of the impulse response of 
the echo path from port C to port D. The time span over 
which this impulse response is significant (i.e., nonzero) is 
typically 2 to 4 ms. This corresponds to 16 to 32 tap positions 
with 8-kHz sampling. However, because of the portion of 
the four-wire circuit between the location of the echo 
canceller and the hybrid, this response does not begin 


at zero, but is delayed. The number of taps N, must be large 
enough to accommodate that delay. With N = 128, delays 
of up to 16 ms (or about 1,200 miles of ‘‘tail’’ circuit) can 
be accommodated. 

In practice, it is necessary to cancel the echoes in both 
directions of a trunk. For this purpose, two adaptive 
cancellers are used, as shown in Figure 5, where one cancels 
the echo from each end of the connection. The near-end talker 
for one of the cancellers is the far-end talker for the other. 
In each case, the near-end talker is the ‘‘closest’’ talker, and 
the far-end talker is the talker generating the echo being 
cancelled. It is desirable to position these two ‘‘halves’’ of 
the canceller in a split configuration, as shown in Figure 5, 
where the bulk of the delay in the four-wire portion of the 
connection is in the middle. The reason is that the number 
of coefficients required in the echo-cancellation filter is 
directly related to the delay of the tail circuit between the 
location of the echo canceller and the hybrid that generates 
the echo. In the split configuration, the largest delay is not 
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Figure 4. Echo Estimation Using a Transversal Filter 
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Figure 5. Split-Type Echo Canceller for Two Directions of Transmission 


in the echo path of either half of the canceller. Therefore, 
the number of coefficients is minimized. . 

The digital voice echo canceller can be applied in a 
variety of transmission equipment configurations. Some of 
these are illustrated in Figures 6 through 8. 

Figure 6 shows a single-channel echo canceller with 
a four-wire analog interface. The TMS32020 implementation 
described in this application report provides for the serial 
PCM codec interface required for this common 
configuration. 


SINGLE- 
CHANNEL 


DIGITAL 


4/W ANA OG 
LONG-PATH 
DELAY 


64 PCM 
ECHO kpbs CODEC 
CANCELLER 


In digital carrier transmission systems, digital voice 
channels are usually carried in groups of 24 using the T1 
group format.> As indicated in Figures 7 and 8, a 
Tl-compatible digital voice echo canceller can be 
implemented with 24 single-channel echo cancellers 
connected directly to the serial 1.544-Mbps Tl PCM data 
streams for the transmit and receive groups. 


Figures 9 through 11 show the appropriate architectures 
for applying digital voice echo cancellers to analog switching 
and analog transmission channel groups within the telephone 
network. 


4/w 
ANALOG | TERM 
C.O., PBX, | SET 


jor STATION 2/W 
| SUBSCRIBER 


| | LOOP 
eae 


Figure 6. Single-Channel Four-Wire VF Echo Canceller 
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Figure 7. Standalone Digital T1 Echo Canceller 
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Figure 8. Per-Channel Architecture for a T1 Digital Echo Canceller 
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Figure 9. Digital Switch to Analog Facility 
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Figure 10. Analog Facility to Digital Facility 
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Figure 11. Analog Facility to Analog Facility 


ECHO CANCELLATION ALGORITHMS 


Generic algorithm requirements for each major signal 
processing function are discussed in this section. The signal 
processing flow for a single-channel digital voice echo 
canceller is shown in the block diagram of Figure 12. 


Adaptive Transversal Filter 

The reflected echo signal r(i) at time i (see Figure 3) 
can be written as the convolution of the far-end reference 
signal y(i) and the discrete representation hy of the impulse 
response of the echo path between port C and D. 


N-1 
a= > 
k=0 
Linearity and a finite duration N of the echo-path 
response have been assumed. An echo canceller with N taps 
adapts the N coefficients a, of its transversal filter to produce 
a replica of the echo r(i) defined as follows: 


hk yGi—k) (1) 


W= T 
k=0 . 
Clearly, if ax = hy fork=0,...,N—1, then f(i) = r(i) 
for all time i and the echo is cancelled exactly. 


ax y(i-k) (2) 
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Since, in general, the echo-path impulse response hx 
is unknown and may vary slowly with time, a closed-loop 
coefficient adaptation algorithm is required to minimize the 
average or mean-squared error (MSE) between the echo and 
its replica. From Figure 3, it can be seen that the near-end 
error signal u(i) is comprised of the echo-path error 
r(i) — f(i) and the near-end speech signal x(i), which is 
uncorrelated with the far-end signal y(i). This gives the 
equation 


E(u2(i)) = E(x2(i)) + E(e*(i)) (3) 


where E denotes the expectation operator. The echo term 
E(e2(i)) will be minimized when the left-hand side of (3) is 
minimized. If there is no near-end speech (x(i) = 0), the 
minimum is achieved by adjusting the coefficients a, along 
the direction of the negative gradient of E(e2(i)) at each step 
with the update equation 


B dE(e2(i)) (4) 


ax(i+1) = axG) — Baxi 


where G is the stepsize. Substituting (1) and (2) into (3) gives 
from (4) the update equation 


ax(i+1) = ax(i) + 26E [e(i) yG-k)] (5) 
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Figure 12. Signal Processing for a Digital Voice Echo Canceller 


In practice, the expectation operator in the gradient 
term 2GE [e(i) y(i — k)] cannot be computed without a priori 
knowledge of the reference signal probability distribution. 
Common practice is to use an unbiased estimate of the 
gradient, which is based on time-averaged correlation error. 
Thus, replacing the expectation operator of (5) with a short- 
time average, gives 


M-1 
ay(it1) = ax(i) + 28 — > e(i—m) y(i-m—k) (6) 
m=0 


The special case of (6) for M = 1 is frequently called 
the least-mean-squared (LMS) algorithm or the stochastic 
gradient algorithm. Alternatively, the coefficients may be 
updated less frequently with a thinning ratio of up to M, as 
given in 


M-1 
ay(it+M+1)=ax(i) +28 J) eGit+M-—m) y(it+M-m-k) 
m=0 (7) 


Computer simulations of this ‘‘block update’’ method 
show that it performs better than the standard LMS algorithm 
(i.e., M=1 case) with noise or speech signals.6 Many 
cancellers today avoid multiplication for the correlation 
function in (7), and instead use the signs of e(i) and y(i—k) 
to compute the coefficient updates. However, this “‘sign 
algorithm’’ approximation results in approximately a 
50-percent decrease in convergence rate and an increase in 
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degradation of residual echo due to interfering near-end 
speech. 

The convergence properties of the algorithm are largely 
determined by the stepsize parameter 6 and the power of the 
far-end signal y(i). In general, making 8 larger speeds the 
convergence, while a smaller 8 reduces the asymptotic 
cancellation error. 

It has been shown that the convergence time constant 
is inversely proportional to the power of y(i), and that the 
algorithm will converge very slowly for low-power signals.7 
To remedy that situation, the loop gain is usually normalized 
by an estimate of that power, i.e., 


B 
26 = 26(i) 75 (8) 


where 6 is a compromise value of the stepsize constant and 
Py(i) is an estimate of the average power of y(i) at time 1. 


Py(i) = (Ly(i))2 (9) 
where Ly(i) is given by 
Ly(i+1) = (1-p) Ly(i) + ply@| (10) 


The estimate py(i) is used since the calculation of the 
exact average power is computation-expensive. 


Near-End Speech Detector 


When both near-end and far-end speakers are talking, 
the condition is termed ‘‘doubletalk.’? Since the error signal 
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u(i) of Figure 2 contains a component of the near-end talker 
X(i) in addition to the residual echo-cancellation error, it is 
necessary to freeze the canceller adaptation during doubletalk 
in order to avoid divergence. Doubletalk status can be 
detected by a near- end speech detector operating on the near- 
end and far-end signals y(i) and s(i), respectively. 

A commonly used algorithm by A. A. Geigel8 consists 
of declaring near-end speech whenever 


Is(i)| =|x(i) +r@)| == = max{] y@|,|yG-DI,....l¥G-N)}} 


(11) 


where N is the number of samples in the echo canceller 
transversal filter memory. It is necessary to compare s(i) with 
the recent past of the far-end signal rather than just y(i) 
because of the unknown delay in the echo path. The factor 
of one-half is based on the hypothesis that the echo-path loss 
through a hybrid is at least 6 dB. The algorithm in effect 
performs an instantaneous power comparison over a time 
window spanning the echo-path delay range. 

A more robust version of this algorithm uses short-term 
power estimates, y(i) and S(i), for the power estimates of 
the recent past of the far-end receive signal y(i) and the near- 
end hybrid signal s(i), respectively.These estimates are 
computed recursively by the equations 


1 
2 


(i+ 1) = (1—a) &(i) + als(i)| (12) 
yitl) = d~ae) 9) + aly(i)| (13) 


where the filter gain a = 2~—5. For this version of the 
algorithm, near-end speech is declared whenever 


ai) = ; max (¥(i),9(i—1),...,74-N)) (14) 


Since the near-end speech detector algorithm detects 
short-term power peaks, it is desirable to continue declaring 
near-end speech for some hangover time after initial 
detection. 


Residual Echo Suppressor 

Nonlinearities in the echo path of the telephone circuit 
and uncorrelated near-end speech limit the amount of 
achievable suppression in the circuit from 30 to 35 dB. Thus, 
there is no merit in achieving more than a certain degree of 
cancellation. 

The use of a residual echo suppressor algorithm has 
been found to be subjectively desirable.? During doubletalk, 
the residual suppressor must be disabled. A common 
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suppression control algorithm is to detect when the return 
signal power falls below a threshold based on the receive 
reference signal power. If the return signal consists only of 
residual echo and the canceller has properly converged, then 
the residual echo level will be below the threshold and the 
transmitted return signal will be set to zero. 

The return signal power is estimated by the equation 


Lyit 1) = (1—p) Lu@ + plu@| (15) 


The reference power estimate Ly(i) is given by (10). 
Suppression is enabled on the transmitted signal 
u(i) (i.e.,u(i) = 0) whenever Ly(i)/Ly(i) < 274. This 
corresponds to a suppression threshold of 24 dB. 


IMPLEMENTATION OF A _ 128-TAP ECHO 
CANCELLER WITH THE TMS32020 


The TMS32020 is ideally suited for the implementation 
of a single 128-tap digital voice echo canceller channel since 
it has the capability and features to implement all of the 
required functions with full precision. This section discusses 
an implementation approach that meets or exceeds the 
performance of currently available products and the 
requirements of the CCITT G.165 recoffimendations.3 


Echo Canceller Performance Requirements 
Echo cancellers have the following fundamental 


requirements: 
1. Rapid convergence when speech is incident in a new 
connection 


2. Low-returned echo level during singletalking (i.e., 

echo-return loss enhancement) 

Slow divergence when there is no signal 

4. Rapid return of the echo level to residual if the echo 
path is interrupted 

5. Little divergence during doubletalking 


Ww 


The CCITT recommendation G.165 specifies echo 
canceller performance requirements with band-limited white- 
noise (300 — 3400 Hz) test signals at the near-end and far- 
end input signal ports. The test specifications of G.165 are 
summarized in Table 1. 

Digital voice echo canceller products are typically 
designed to accommodate circuits with tail delays of 16 ms 
or more and circuits with echo-return loss levels greater than 
3 dB to 6 dB. Typical digital voice echo canceller product 
specifications are summarized in Table 2. 
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Table 1. CCITT G.165 Performance Test Specifications 


CCITT TEST 


. Final echo return loss (ERL) after 
convergence; singletalk mode 


Circuit ERL: 


PERFORMANCE 


DESCRIPTION REQUIREMENT 


Input noise level: — 10 dbm0O 


to —30 dbmO 
10 dB 


Steady-state residual echo level after 
convergence with no near-end signal 


. Convergence rate; singletalk mode 


Input noise level: —10 dbn O 


Combined echo loss after 500 ms from 
initialization with cleared register and with 
near-end signal set to zero at initialization time 


. Leak rate 


Degradation of residual echo after 2 minutes 


from time all signals are removed from fully 
converged canceller 


. Infinite return loss convergence 


Circuit ERL: 


Input noise level: — 10 dbmO 


~ 40 dbmO 
to — 30 dbm0O 
10 dB 


Returned echo level 500 ms after echo path is 


interrupted 


Table 2. Typical Echo Canceller Product Specifications 


PARAMETER SPECIFICATION 


. Maximum tail circuit length 
. Absolute delay 

. Minimum echo return loss 
. Convergence 


. Residual echo level (— 30 to — 10 dbmO receive level) 


6. Speech detector threshold 
. Speech detector hangover time 


Implementation Approach 

In the implementation of the generic echo-cancelling 
algorithms discussed above, the coefficient update process 
dominates the computational requirement and efficiency of 
DSP realizations. The DSP efficiency and speed, in turn, 
determines the maximum number of echo canceller taps that 
can be achieved with the processor. 

The block update approach of (7) with M = 16 was 
chosen for the TMS32020 implementation because it takes 
advantage of the efficient multiply and accumulate 
capabilities of the processor. Using the block update 
approach, a full-performance 128-tap canceller can be 
realized with a small margin. During each sample period (125 
ps), 8 out of 128 coefficients are updated using correlation 
of the 16 past error and signal values. 
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16, 32, or 48 ms 
0.375 ms maximum 

6 dB 

24 dB enhancement in 
250 ms 


— 40 dbm0O (suppressor disabled) 


-—65 dbmO (suppressor enabled) 


6 dB below receive level 
75 ms 


Computer simulation studies were undertaken to verify 
the performance of the block update algorithm 
(M = 16) in comparison with the stochastic gradient 
algorithm (M = 1), taking into account the finite-precision 
and word-length limitations of the TMS32020. Figures 13 
and 14 show the simulation results for three values of the 
compromize stepsize constant 31, defined in (8). The curves 
represent the average of 600 samples for single convergence 
runs from a zero initial condition with white-noise input. The 
block update algorithm performs better than the stochastic 
gradient algorithm for all three values. For values of (1 
larger than 2-8, the algorithm can become unstable. 
Therefore, for both practical and performance reasons, the 
value 8; = 2—10 was chosen for implementation. 
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Figure 13 
Convergence Performance of the Block Update Algorithm 
and Stochastic Gradient Algorithm 
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Figure 14 
Convergence Performance of the Block Update Algorithm 
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In the TMS32020 implementation, it is convenient and 
desirable to normalize both the stepsize and the error 
variables u(i) by the square root of the power estimate Py(i), 
1.e., Ly(i) of (9). 

Normalizing u(i) and the stepsize separately enables the 
product term of (7) to be computed with single precision on 
the TMS32020 without significant loss of precision or 
overflow due to varying signal level. 

Table 3 gives a description of the program variables 
together with their names and ranges, and summarizes the 
number formats chosen for the echo canceller 
implementation. One of the most important aspects of the 
implementation approach is the handling of the binary 
representation of the signal samples, algorithm variables, 
coefficients, and constant parameters for various stages of 
the processing. The notation (Q.F) is used to define the 
representation of either 16-bit numbers or 32-bit accumulator 
numbers, where F specifies the number of bits which are 
to the right of the implicit binary point. The assignments of 
Table 3 ensure that the algorithm can be executed on the 
TMS32020 with single-precision arithmetic and with no 
significant loss of precision. 


Memory Requirements 

The echo canceller algorithm requires the storage of 
both reference samples and variable coefficients in on-chip 
data RAM so that the required FIR and block update 
convolution can be performed efficiently using the RPTK 
and MACD instructions. Therefore, the coefficients a, are 
stored in block BO, which is configured as program memory. 
The 16 normalized error samples for coefficient updating are 
also stored in BO. The 128 reference signal samples y(i) are 
stored in data RAM along with an additional 16 reference 
samples y(1 — 129), ..., y@i— 143), which are used in the 
update of coefficients a; 12, ..., @)27. The echo canceller data 
memory locations are summarized in Table 4. 


Software Logic and Flow | 
A flowchart of the TMS32020 program for a 128 — tap 
digital voice echo canceller is shown in Figure 15. 


In Table 5, the instruction cycle and memory 
requirements are listed for the various blocks of the program 
implementation. The blocks are listed in the order of 
execution. 
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Table 3. Algorithm Number Representation on the TMS32020 


VARIABLE DESCRIPTION 


s(t) 
r(i) 
Lyli) 
Ly(i)- 4 


u(i) 


un(i),...,un(i— 15) 


Filter coefficients 
Reference samples 
Near-end signal 
Echo estimate 


Average absolute value of y(k) 


Near-end signal minus echo 
estimate s({k) —r(k) 


Normalized outputs 
un(i) =uli) x Ly(i)~1 


Short-time average 
of 2 x |s(i)| 


Short-time average 
of |y(i)| 


Table 4. Echo Canceller Data Memory Locations 


BINARY 
REPRESENTATION 


[-1. 1-2 -15] 
~ 215-1] 
[ - 25, 215-1] 


[ 215, 215-1] 


[o.215-1] 
[-1. 1-2-18] 


[ ~2'5, 215-1] 


[-1. 1-2-15] 
[o.215-1] 
[o,215-1] 


VARIABLE SYMBOL LOCATION | ReMaRK 


40,.+.8127 


y(k),...,y(k) — 143) YO,...,¥143 


un{k,...,un(k — 15) UNO,...,UN15 
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Block BO 
767,766,...,640 


AO is in higher address 


Block B1 Y128,...,Y143 
768,769,...,911 required for block update 


Block BO 
512,...,527 
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INITIALIZE PROCESSOR 


4 - LAW TO LINEAR CONVERSION OF 
y(n). STORE yin). NEXT CYCLE 
y- LAW TO LINEAR CONVERSION OF 


Sin). 


READ », - LAW 
PCM FAR-END 
SAMPLE FROM 
DRR INTO i1(n +1) 


HIGHPASS FILTER s(n) TO REMOVE| |!N SERIAL INPUT 


DC-OFFSET AND GET s(n). STORE sin).|_ | 'NTERRUPT 
ROUTINE. 


COMPUTE ECHO ESTIMATE rin) IN FIR 
FILTER ROUTINE. 


COMPUTE OUTPUT uin) = S(n) — rfn). 


IN ABSENCE OF NEAR-END SIGNAL 
{u(n) = e(n)), COMPARE e(n) WITH 
ERROR SUPPRESSION THRESHOLD. 


NO 


1S ERROR 
BELOW THRESHOLD? 


SUPPRESS ERROR: u(n) = e(n) 


LINEAR TO »-LAW CONVERSION OF 
u(n); WRITE IT TO DXR. 


UPDATE POWER ESTIMATES OF y(n) 
AND u(n). NORMALIZE u(n). 


DETECT NEAR-END SPEECH. 


NEAR-END 
SPEECH 
CONDITION? 


YES 


UPDATE FILTER COEFFICIENTS. 


SERIAL 
OUTPUT INTERRUPT 
CONDITION? 


YES 


AFTER SHIFTING OUT DXR, START 
NEXT CYCLE IN SERIAL OUTPUT 
INTERRUPT ROUTINE. 


NEXT CYCLE 


READ ».-LAW PCM NEAR-END SAMPLE 
FROM DORR into i2(n+1), IN SERIAL 
INPUT INTERRUPT ROUTINE. 


A) Figure 15. Echo Canceller Program Flowchart 
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MODULE FUNCTION 


Cycle Start Routine 


Echo Estimation 
Routine 


Compute Output 


Residual Output 


Suppression Routine 


Linear to p-law 
Compression Routine 


Power Estimation 
Routine 


Output Normalization 
Near-end Speech 
Detection 


Coefficient Increment 
Update Routine 


Coefficient Update 
Routine 


Cycle End Routine 


Receive Interrupt 
Service Routine 


Transmit Interrupt 
Service Routine 


Interrupt Branches 


Processor 
Initialization * * 


p-law to Linear 
Conversion Table* 


Table 5. Program Module Requirements 


CODE 
LISTING 
PAGE 


CPU PROGRAM 
CYCLES MEMORY 
LOCATIONS 


DESCRIPTION 


p-law to linear conversions; take 
absolute value of inputs and high-pan 
filter s{i). 


FIR convolution of reference samples 
and filter coefficients to get echo 


replica r(i). 


u(i) = s(i) — r(i) 
Store u)i). 


If output power below threshold, 
set uli) = O. 


Convert u(i) to p-law. 


Estimate short-term power of u(i) 
and y(i). 


Comput up(i) = = and clip it. 
yi 


Perform maximum test for near-end 
speech. 


If no near-end speech, compute 
increments for coefficient group. 


Add increments to coefficient group. 


Wait for interrupt. 


Save status and read input sample. 


Branch to start. 


Clear memory, initialize status and 
set parameters. 


DATA* 
MEMORY 
LOCATIONS 


*Locations are entered only for the routine that uses them first. 


**Not in main cycle; CPU cycles not counted in total. 
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The program loop is executed once per I/O data sample 
period of 125 ys. The program loop is interrupt-driven from 
. the output data sample mark of a T1 frame. Depending on 
the near-end speech detector/hangover status, the coefficient 
update computation module may be skipped. An input data 
sample interrupt mark occurs during the program loop at a 
time dependent on the channel location within the T1 frame. 
In response to the interrupt, the main program execution is 
interrupted and saved until the new input samples have been 
read into memory. At the end of each program loop, the 
processor waits for the next output sample interrupt. 

In the following subsections, the implementation of 
each major block is described in detail. Each variable used 
in an equation is referred to by its name in the program 
enclosed in parentheses. 


Cycle Start Routine 

The voice echo canceller program has been 
implemented with either p-law or A-law conversion routines 
as a program option. 

The y-law (or A-law) to linear input conversion routine 
is implemented by table lookup in order to minimize the 
number of instructions. The 256 14-bit two’s-complement 
number corresponding to the 256 possible 8-bit p-law 
numbers are stored in program memory. The 8 bits of the 
p-law number specify the relative address of the 
corresponding linear number in the table, which is added to 
the first address in the table to form the absolute program 
memory address for the linear number. The TBLR instruction 
is then used to move the number from program to data 
memory. 

In the cycle start routine, the p-law input reference 
sample is read from memory location DRR2 and converted 
to its linear representation y(i) (YO). Its absolute value is also 
stored in location ABSYO. The near-end input sample is then 
read and converted to a linear representation sdc(i) (SODC). 
The sample s(i) is next put through a highpass filter to remove 
any residual dc offset. The highpass filter is a first-order filter 
with a 3 — dB frequency at 160 Hz. Its output s(i) (SO) is given 
by 


sii+1) = (1—-Y) s(i) + ; (1 — y) (sde(i) — sde(i-— 1)) 


where y = 27-3. (16) 


Note that the filter implementation requires double- 
precision arithmetic, with SO denoting the MSBs of s(i) and 
SOLSBS its LSBs. 


Echo Estimation 
The echo estimate f(i) (EEST) is formed by convolving 
the tap weight coefficients ap, ..., a}27 (AO, ..., A127) 


with the 128 most recent reference samples 
y(i), ..., yi— 127) (YO, ..., Y127). 
127 
ri) = Le ax yi-k) (17) 
k=0 
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This operation is most efficiently implemented on the 
TMS32020 using the RPTK and MACD instruction. The 
samples y(i),..., y(i— 127) are stored in block B1 of data 
memory while ao,..., 42127 are stored in block BO configured 
as program memory. Since the MACD instruction also 
performs a data move, 


y(i-k +1) — y(i-k) fork = 1,...,128 (18) 


no data shifting is required for the computation of the next 
echo estimate. 

The block update routine used for the coefficient 
adaptation requires the storage of y(i— 128), ..., y@— 143) 
(Y128, ..., Y143) in addition to the most recent 128 samples 
used in the convolution. Since these samples are not used 
in the convolution, they are updated using the RPTK and 
DMOYV instructions. 


yai-k+1) — yai-k) fork = 129,...,143 (19) 


The tap weight coefficients ag, ..., 4127 are initially 
set to zero, and are adjusted by the algorithm to converge 
to the impulse response of the echo path ho, ..., hy27. 


aq(i) — hy fork = O,...,127 (20) 


The |hk| < 1,*¥ k,betause the power gain of the echo 
path is smaller than unity. The binary representation for the 
a,’s was chosen to be of the form (Q.15) with 15 bits after 
the binary points. This format represents a number between 
-1 and (1 — 2~—15). The reference samples and the echo 
estimate are represented as 16-bit two’s-complement integers 
(no binary point). The 32-bit result of the convolution is 
therefore of the form (Q.15), and the 16 bits of the echo 
estimate are the MSB of accumulator low (ACCL) and the 
15 LSBs of accumulator high (ACCH). One left shift of the 
accumulator is required before ACCH is stored in EEST. 


Residual Error Suppression 

The residual cancellation error is set to zero (or 
suppressed) whenever the ratio of a long-time average of the 
absolute value of the output (ABSOUT) to a long-time 
average of the absolute value of the reference signal (ABSY) 
is smaller than a fixed threshold. The two long-time averages 
are updated subsequently in the program as described below. 
The suppression is, of course, disabled when a near-end 
speech signal is present (HCNTR > 0). The suppression 
threshold is set at 1/16 or —24 dB. 


Linear to »-Law (A-Law) Conversion 

The linear to p-law (A-law) conversion routine is an 
efficient adaptation to the TMS32020 of the conversion 
routine written for the TMS32010 and described in the 
application report, ~“‘Companding Routines for the 
TMS32010.’’9 
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Signal and Output Power Estimation 

An estimate of the long-time average of |u(i)| is 
required by the residual error suppression routine. This 
estimate Ly(i) (ABSOUT) is obtained by lowpass filtering 
\u(i)| (ABSU0O) using the following infinite impulse response 
(IIR) filter: 


Lyi+l) = (-a) Lui) + alud)| (21) 


where a = 2~7. In terms of the program variables, the IIR 
filter is given by 


ABSOUT = 2-16 (216 x ABSOUT - 29 
x ABSOUT + 29 x ABSUO) (22) 


Similarly, the estimate Ly(i) (ABSY) of the long-term 
average of y(i) (ABSY0O) is the output of an IIR filter with 
the same a, but differs from the above filter by the addition 
of a cutoff term that prevents the estimate from taking values 
smaller than a desired level. 


ABSY = 2-16 (216 x ABSY — 29 x ABSY + 29 
x ABSYO + 29 x CUTOFF) (23) 


This insures that ABSY = CUTOFF even if ABSYO 
is zero for a long time. 

Since Ly(i) is used to normalize the algorithm stepsize, 
this feature is important in order to prevent excessively large 
stepsizes when the far-end talker is silent. 

The stepsize is normalized according to 


260). = SS (24) 


In order to avoid double-precision arithmetic, this 
normalization is carried out in two stages (as described in 
the subsection on coefficient adaptation). Each of the stages 
requires a division by Ly(i). It is more efficient to compute 
Ly(i)~! (IABSY) and replace the divisions by two 
multiplications. 

Since ABSY is a positive integer, taking its inverse 
consists simply of repeating the SUBC instruction. IABSY 
is a positive fractional number of the form (Q.15), taking 
values between 0 and 1—2715. 


Output Normalization 

The normalized output up(i) (UNO) is defined as 
#G)/Ly(@) and replaces the actual error in the coefficient 
update routine for finite-precision considerations, described 
in the subsection on coefficient adaptation. In the absence 
of near-end speech, up(i) is equal to a normalized cancellation 
error and is used in the coefficient update. In the presence 
of near-end speech, no coefficient update is carried out, and 
the normalized outputs are not used. 

The block update approach requires the 
storage of the 16 most recent normalized outputs 
Up(i), ..., Un(i — 15) CUNO,..., UN15). Ina given program 
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cycle, only up(i) is computed and stored, while 
Up(i — 1),..., Uni — 15) computed in previous program 
cycles are only updated using the DMOV instruction. 


un(fi-k +1) - u,(i-k) fork = 1,...,14 (25) 


In the absence of near-end speech, the normalized 
output should be a number smaller than one, which is 
represented as a (Q.15) fraction. To insure that the 
representation is adequate even in the presence of a near- 
end signal, the normalized output is clipped at +1 or -1, ie., 


if uni) > 1.0, then u,(i) = 1.0 


(26) 
if un(i) < —1.0, then u,(i) = —1.0 
Near-End Speech Detection 
Near-end speech is declared if 
(i) = max (¥(i),fG-1),...,(i— 127 —h(i))) (27) 


where §(i) (ABSSOF) is the output of a lowpass filter with 
input 2 x |s(i)| (ABSSO). The variable y(i) is a lowpass 
filtered version of |y(i)|, and h(i) (H) a modulo-16 counter. 
The lowpass filters are IIR filters with short-time constants, 


s(i+1) = (1-a@) s(i) + @ x 2 x |s(i)| (28) 


yit+l) = (1-a@) y(i) + @ x ly(d| (29) 


where a = 27-5. 


The counter h(:) is incremented by one for every input 
sample. The routines maintain nine partial maxima 
m0, ml,..., m8 (Mo, Mj, ..., Mg), defined at time 
i =16m + h(i) by 


mo(i) = max (¥(i),...,74i—hG) +) 
my (i) = max (¥(i—h),...,7(i — h(i) — 15)) ae 
mg(i) = max (¥(i—h—112),...,7G—h(i) — 127)) 


Figure 16 illustrates how the partial maxima are 
maintained. 

The condition for near-end speech declaration is then 
equivalent to 


S(i) = max (mo,...,mg) (31) 


The partial maxima are updated according to the 
following recursions: 


if h = O, then moi) = yit+1) 
and mj(i) = mj —1(i) (32) 
where j = 1,...,8 
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Figure 16. Partial Maxima for Near-End Speech Detection 


and 

if0 <h s 15, then mo(i+1) = max (mo(i),yG+1)) 
and mj(i + 1) = mij(i) 

where j = 1,...,8 


If near-end speech is declared, a hangover counter 
(HCNTR) is set equal to a hangover time (HANGT), which 
was chosen to be 600 samples or 75 ms. If no near-end speech 
is declared, then the hangover counter is decremented by one, 
unless it is zero. If the hangover counter is larger than zero, 
then the coefficient update routine is skipped. Moreover, if 
the reference signal power estimate Ly(i) is smaller or equal 
to the cutoff value of — 48 dB, then adaptation is also 
- disabled to avoid divergence during long silences of the far- 
end talker. 


Coefficient Adaptation 
The 128 coefficients of the transversal filter are divided 
into 16 groups of 8 coefficients each, as shown in Table 6. 
Table 6. The Coefficient Groups 


49-846-832,----8112 


41-84 7-833.----8113 


415 ,831-447-----8127 
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The coefficients in only one of the groups are updated 
in a given program cycle, while the other coefficients are 
not modified. A modulo-16 counter h(i) (H) points to the 
index of the group to be updated, and is incremented by one 
during every program cycle. 

The update equation is repeated here for ease of 
reference. 

Bj ae 
yy eGi-m) ya-k—m) 
(Ly(i))? m=0 aa, 


au(i+1) = ax(i) + 


fork = h,h + 16,...,h + 112, where h is the value of 
the counter and goes from 0 to 15. The error terms 


e(i — m) (m = QO, ..., 15) are the most recent cancellation 
errors. In this case, the errors are equal to the 15 most recent 
canceller outputs u(i), ..., u(i — 15) since the adaptation is 


carried out only in the absence of a near-end signal. 

For finite-precision considerations, the actual 
implementation of the update equation by the routine is 
carried out in the following two main steps: 


1. Compute eight partial updates: 
k+15 


yD Bem yi-k—m (35) 


Yk) =. ; 
= m=k = Ly(i) 


where k = h,h + 16, ...,h + 112. 
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The normalized outputs up(i), ..., Un(i — 15) have 
already been computed and stored. 


2. Update the coefficients: 


axli+ 1) = ali) + (24 x (Ly()—1 x 26) x 444i) 2-16 
(36) 


where G (GAIN) is a program parameter that determines 
the stepsize of the algorithm and has the value 
O,t1y. 35. se. 1 9, 


The partial updates 7,(i) are computed using the MAC 
instruction in repeat mode. The result is rounded and stored 
in temporary locations INCO, ..., INCO + 7 in block Bl. 

For the second step of the update, Ly(i)~ | (IABSY) 
is first loaded in the T register with a left shift of G (GAIN). 
It is then multiplied by each of the yx(i)’s. SPM is set to 
2 to implement the 24 multiplication by shifting the P register 
four positions to the right before adding it to the accumulator 
(APAC). 


Interrupt Service Routines 

At the end of the cycle, the program becomes idle until 
a receive interrupt occurs followed by a transmit interrupt 
that sends it back to the beginning of the cycle. The transmit 
interrupt routine simply enables interrupts and branches back 
to the start. The receive interrupt must store the status register 
STO and the accumulator, then read the received sample from 
DRR, zero its eight most significant bits, and store it in 
DRR1. It restores the accumulator and status register STO 
before returning to the main program. 


External Processor Hardware Requirements 

Very little external hardware is required to implement 
a complete single-channel 128-tap echo canceller with the 
TMS32020. In addition to the processor, only two external 
1K x 8 PROMs and some system-dependent interface logic 
are required. A typical interface circuit for the demonstration 
system is shown in Appendix A. 

The TMS32020 serial I/O ports allow direct interfacing 
of the echo canceller to a digital Tl carrier data stream. 
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Three I/O functions must be performed during each 
T1 frame (125 ys). The far-end and the near-end signals must 
be read in, and the processed near-end signal must be written 
out. To perform these functions, a timing circuit must extract 
the T1 clock and the T1 frame marks for each direction of 
transmission. The timing circuit uses the frame mark to 
generate a channel mark that selects the desired channel out 
of the 24 present in the T1 frame. The channel mark goes 
to a high level during the clock cycle, immediately preceding 
the eight serial bits of the desired sample. 

The T1 clock, channel mark, and serial data signals 
are directly input into the TMS32020 serial clock (CLKR), 
serial input control (FSR), and serial input port (DR), 
respectively. Because data is read in from two directions of 
transmission, a triple two-to-one multiplexer 
(e.g., SN74LS157) is required to select one of the two sets 
of T1 signals to be input into the TMS32020. During each 
Tl frame, the multiplexer alternates once between each 
direction of transmission, under the control of the timing 
circuit. 

Since data is written out in only one direction, the 
TMS32020 serial output port (DX) is directly tied to the 
outgoing T1 data line. The serial output clock (CLKX) and 
the serial output control (FSX) signals are the same as the 
near-end direction-of-transmission CLKR and FSR signals. 
If the far-end T1 channel-frame location overlaps the near- 
end T1 channel location in time, it is necessary to delay each 
far-end sample external to the TMS32020 to permit it to be 
read following the sample from the near-end direction. This 
requires an eight-bit serial shift register and some additional 
timing circuits. 


Description of a Single-Channel Demonstration 
System 

The demonstration system has been constructed in order 
to verify the TMS32020 implementation. Two photographs 
and a block diagram of the demonstration system are 
provided. 

Figure 17 is a photograph of the front panel of the 
demonstration system, and Figure 18 is a closeup photograph 
of the single-channel echo canceller module. 
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Figure 17. Front Panel of the Echo Canceller 
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Figure 18. Single-Channel Echo Canceller Module 
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As shown in the block diagram of Figure 19, the 
demonstration system models two end offices, a delay due 
to a satellite link, a delay due to a terrestrial link, a typical 
end-loop line response, and the echo canceller. A phone is 
connected via a two-wire interface to each end of the path. 
The two-wire interfaces are converted to four-wire in 
electronic hybrids. The hybrids also provide the required 
battery voltage to power the phones. The near-end two-wire 
line has a series-passive line simulator. The associated hybrid 
has an adjustable termination to allow a variable amount of 
hybrid mismatch, and therefore a variable amount of near- 
end echo response. 

At each'end, the four-wire analog signal is converted 
to and from PCM uz -law digital representation by a codec. 
The PCM signaling is done in a T1 format, with appropriate 
timing provided by a central timing generator. Variable delay 
is provided in the near-end and far-end path by digital 


oc 
POWER TIMING 


y-LAW T1 0TO 


1 ’ 


™{S32020 
ECHO 
CANCELLER 


VARIABLE 
TERMINATION 


p- LAW 
PCM 6-ms 
DECODER DELAY 


LINE RJ-11E pari 
SIMULATOR PHONE 


RJ-11E 


memories. The TMS32020 echo canceller is situated in the 
middle of the path, with signal processing done on the near- 
end to far-end direction of transmission. The other direction 
is used as the reference signal. All the TMS32020 signal I/O 
is performed using the T1 format. A display of the processed 
signal is used as an indicator of echo suppression in the 
absence of near-end signal. To aid the testing of the echo 
canceller, the far-end phone can be switched out and a noise 
generator switched in as a source of far-end signal. 

The performance of the TMS32020 echo canceller was 
measured for white-noise input, as suggested in the 
CCITT G.165 recommendation. The measurement results 
are summarized in Table 7 and show that the TMS32020 echo 
canceller performance exceeds the CCITT requirements in 
all the tests described. The subjective performance on speech 
was also found to be very good in both singletalk and 
doubletalk modes, with no audible distortion of the signal. 
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Figure 19. Block Diagram of Echo Canceller Demonstration System 


Table 7. TMS32020 Echo Canceller Performance 


. Final echo return loss after 
convergence; singletalk mode 


. Convergence rate; singletalk 
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. Leak rate 
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CONCLUSION 


The development of novel variations of the generic 
least-mean-squared (LMS) echo cancelling algorithm and the 
near-end speech and residual suppression control algorithms 
has resulted in the implementation of a complete 128-tap 
single-channel echo canceller on a single TMS32020 
programmable Digital Signal Processor. The echo canceller 

_performance exceeds all requirements of the CCITT G.165 
recommendations and the performance of similar currently 
available products. The only external hardware required are 
two program PROMs and a serial data multiplexer. A direct 
T1-rate serial interface is available to minimize component 
count in four-wire VF and T1 carrier configurations. 

The single-channel TMS32020 echo canceller program 
provides a high-performance building block for low-cost 
systems, which can be tailored to a wide variety of system 
applications. Programmability offers the flexibility to 
implement custom requirements, such as cascaded sections 
for longer tail delay range, short-range multichannel 
versions, or other special-purpose functions. 

The echo canceller application illustrates the power and 
versatility of the TMS32020 single-chip programmable signal 
processor. Applications of this technology can be expected 
to benefit many other complex signal processing tasks in 
communications products, including voiceband data modems, 
voice codecs, digital subscriber transceivers, and TDM/FDM 
transmultiplexers. 
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APPENDIX A 


HARDWARE SCHEMATIC OF THE SINGLE-CHANNEL DEMONSTRATION PROCESSOR 
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APPENDIX B 


SOURCE CODE LISTING 
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OUTPUT ESTIMATE 
RESIDUAL OUTPUT SPRS THRESHOLD 


HOLOS 1 

Y142 DATA MEM ADRS 
NEAR-END SAMPLE MSBS 
NEAR-END SAMPLE LSBS 


INPUT NEAR-END SAMPLE (K=0) 
INPUT NEAR-END SAMPLE (K=1) 


PAGE 4 DATA MEMORY ALLOCATION 


EQU 
EQU 


EQU 
EQU 


512 
65280 


0 
15 


PAGE 4 DATA MEM ADRS 
PAGE 4 PROG MEM ADRS 


NORMALIZED OUTPUT (K=0) 
NORMALIZED OUTPUT (K=15) 


PAGE 5 DATA MEMORY ALLOCATION 


EQu 
EQU 


EQu 
EQU 


PAGE 6 DATA 


EQuU 


EQU 
EQU 


PAGE 7 DATA 


EQu 


EQU 
EQU 


EQU 


EQU 


640 
65408 


8) 
127 


768 


0 
127 


896 


* 


* 


* 


* 


PAGE 5 DATA MEM ADRS 
PAGE 5 PROG MEM ADRS 


FIR FILTER COEFFICIENT (K=127) 
FIR FILTER COEFFICIENT (K=0) 


MEMORY ALLOCATION 


PAGE 6 DATA MEM ADRS 


REFERENCE SAMPLE (K=0) 
REFERENCE SAMPLE (K=127) 


MEMORY ALLOCATION 


PAGE 7 DATA MEM ADRS 


REFERENCE SAMPLE (K=128) 


“ REFERENCE SAMPLE (K=143) 


TEMPORARY STORAGE LOCATION 3 


COPY OF UNO FROM PAGE 4 


OZOZESWLL 2 PIM JopJoouRD OYd| sdI0A [eTIq “ST 


I pp 


EC128 


0115 
0116 
0117 
0118 
0119 
0120 
0121 
0122 
0123 
0124 
0125 
0126 
0127 
0128 
0129 
0130 
0131 
0132 
0133 
0134 
0135 
0136 
0137 
0138 
0139 
0140 
O141 
0142 
0143 
0144 
0145 
0146 
0147 
0148 
0149 
0150 
0151 
0152 
0153 
0154 
0155 
0156 
0157 
0158 
o1s9 
0160 
0161 
0162 


0163 
0164 
0165 
0166 


0167 
0168 


0000 


0000 
ao000 


0000 


0000 


0000 


0000 


0000 


0000 
0000 


0000 
0000 


0000 
0000 
0000 
0000 
0001 
0002 
OO1A 
OOI1A 
OO1A 
0018 
oo01ic 
ooIc 
0010 
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0012 
0013 


0014 
0015 
0016 
0017 
0018 


0060 


0061 
0062 


0063 
0064 


0065 
0066 
0067 


0068 
0069 
006A 
0068 
oo6c 
0060 


OO6E 
0076 


0078 
OO7F 


FF8O 
FFOO 


FF80 
0028 


FF8O 
oct 


FF80 
O1CF 


AONE 
SONE 


ADAO 
ADY 1! 
AOINCO 
AOM7 
ADUN1I4 


H 


HANGT 
HCNTR 


ABSSO 
ABSSOF 


ABSEO 
ABSOUT 
AELSBS 


ABSYO 
ABSY 

AYLSBS 
1ABSY 
CUTOFF 
ABSY OF 


MO 
MB 


INCO 
INC7 


Al27PM 
UNOPM 


EQu 
EQu 


EQU 
EQu 
EQu 
EQU 
EQU 


EQU 


EQU 
EQu 


EQU 
EQU 


EQU 
EQU 
EQqu 


EQqu 
EQU 
EQU 
EQu 
EQU 
EQU 


EQu 
EQu 


EQU 
EQqu 


EQU 
EQu 


110 
118 


120 
127 


P5PM+A127 
P4PM+UNO 


aes 3s 8 


* 2 @ @ & 


14:10:03 11-19-85 
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HOLOS 1 
HOLDS SATURATION 1} 


AO DATA MEM ADRS 
Yt! DATA MEM AORS 
INCO DATA MEM ADRS 
M7 DATA MEM ADRS 
UN14 DATA MEM ADRS 


MODULO 16 COUNTER 


HANG OVER COUNTER RESET VALUE 
HANG OVER COUNTER 


iSO} 
SHORT TAU LPF 2*/S0} 


{OUTPUT} 
LONG TAU LPF 
LONG TAU LPF 


;OUTPUT; MSBS 
1OUTPUT; LSBS 


1YO} 
LONG TAU LPF {Y0;} MSBS 
LONG TAU LPF {YO}; LSBS 


1/ABSY 
ABSY CUTOFF LVL FOR NO UPDATE 
SHORT TAU LPF ;YO} 


LOCAL MAXIMA (K=0) 
LOCAL MAXIMA (K=8) 


UPDATE INCREMENT (K=0) 
UPDATE INCREMENT (Ke7) 


Al27 PROG MEM ADRS 
UNO PROG MEM ADRS 


MKAKRHEHGHHKKKHHOAEHSHHHKHSHKHHHKRHKEHHHKRHKHKHHHEHHHHHHHROHKHHHKRRHRHEHREHHHHH 


INTERRUPT BRANCHES 


2 HM © OF OF MD OF © OF OF © OF DD OF ME ME OF OF wD ED FD OE OF RO oe ee Ot te he te ee ee 


AORG 


AORG 


ON HARDWARE RESET GO TO INIT 


ON RINT GO TO RXRT 


ON TINT GO TO TXRT 


Ec128 


0170 
0171 
0172 
0173 
0174 
0175 
0176 
0177 
0178 
0179 
0180 


0181 
0182 
0183 
0184 
0185 
0186 
0187 
0188 
0189 


0190 
0191 
0192 
0193 
0194 
0195 
0196 
0197 
0198 
0199 
0200 
0205 
0202 
0203 
0204 
0205 
0206 
0207 
0208 
0209 
0210 
0211 
O2l2 
0213 


0214 
0215 
0216 


0217 
0218 
0219 
0220 
0221 
0222 


OOlE 
0028 
0028 
0028 
0029 
0029 
002A 
002B 
002c 
002c 


oa20 
0020 
Q02E 
002F 
0030 
0030 


0031 


0031 
0031 
0032 
0032 
0033 
0033 
0034 
0034 
0035 
0035 
0036 
0037 
0038 
0038 
0039 
003A 
0038 
0038 
003C 
003C 
0030 
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csoo0 
0001 
2E00 
6068 


5068 


0001 
27F8 
6068 


5168 


C160 
CA00 
CBIF 
60A0 
p00! 
0030 
6004 
poo! 
FFFF 
6001 
6060 


6061 


14:10:03 


11-19-85 
PAGE 0004 


SHH HHHHDHSHHHEHEHHKHHHHHHEHKHHHKHHHRHKSHHKRH HHH KHHKOKRHSHHHHRKRONREH OD 


* 
« 
° 


PROCESSOR INITIALIZATION ROUTINE 


TROP RRR RRR ERE RRR RRR ERE RE RRR E RARER STEER EERE REREAD 


INIT 


aes: 


eos e@ 8 8 8 


AORG 40 
LOPK 0 
LALK >2E00 
SACL TEMP 1 
LST TEMP! 
LALK >27F8 
SACL TEMP 1 
LST TEMP 1 


INITIALIZE PAGE 0 


LARK ARI ,96 
ZAC 

RPTK 31 
SACL + 
LALK >0030 
SACL IMR 
LALK >FFFF 


SACL OXR 
SACL DRR1 


SACL ORR2 


ese ee ee et 


INTIALIZE STO AND STI 


0010 
DATA 
0 -> 
' -> 
lt -> 
ag -> 
Lt -> 
0010 
DATA 
0 -> 
Go -> 
1 -> 
1 -> 
i -> 
Q -> 
0 -> 
1 -> 


1110 0000 0000 IN BINARY 
FOR STO 

oP PG POINTER SET TO 0 
INTM INTERRUPTS DISABLED 
OVM OVERFLOW SATURATION 
Ov OVERLOW REG CLEARED 
ARP AR POINTER SET TO 1 
O11! 1111 1000 IN BINARY 
FOR STI 

PM NO P REG SHIFTING 
TXM FSX IS AN INPUT 

FO ORR, OXR TO 8 BITS 
XF XF PIN SET TO HIGH 
SXM SIGN EXTENSION ON 
Tc TC FLAG BIT RESET 
CNF BO IS DATA MEM 

ARB AND 1! -> ARP 


LOWEST PAGE 0 LOCATION -> ARI 


0 -> 


ZERO PAGE 0 


ACC 


ENABLE XINT,RINT 


DISBALE TINT, INTO,INTI,INT2 


MU-LAW FFFF 


= LINEAR 0 


cep 


OZOZESWL & YIM JoTJOOURD OYdY SdIOA [ENZIC “CT 


EC128 


0223 


0224 
a225 
0226 


0227 
0228 
0229 


0230 
0231 
0232 


0233 
0234 
0235 


0236 
0237 
0238 


0239 
0240 
0241 
0242 
0243 
0244 
0245 
0246 


0247 
0248 
0249 
0250 
0251 
0252 
0253 
0254 
0255 
0256 
U257 
0258 
0259 
0260 
0261 
0262 
0263 
0264 


0265 
0266 
0267 


0268 
0269 
0270 


0030 
003E 
OO3F 
0040 
0040 
0041 

0042 
0043 
0043 
0044 
0045 
0046 
0046 
0047 
0048 
0049 
0049 
QO4A 
0048 
004C 
094C 
0040 
004E 
OO4F 
OO4F 


OO4F 
OO4F 
00s0 
0051 
0051 
0052 
0052 
0053 
0053 
0054 


0054 
0054 
0055 
0055 
0056 
0056 
0057 
0057 
0058 
0059 
OO5A 
OO5A 
ooSB 
oosc 
005D 
0050 
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0001 
0300 
6065 


0001 
0084 
6066 
0001 
FFF9 
6067 
0001 
0800 
6060 
0001 
000! 
606E 
0001 


036E 
606F 


0100 
0200 


CAO0O 
COFF 


60A0 


CBFF 
60A0 
C807 
D001 
000! 
6012 
0001 
AF FF 
6013 


0001 


es 


LALK 
SACL 
LALK 
SACL 
LALK 
SACL 
LALK 
SACL 
LALK 
SACL 
LALK 


SACL 


XTBL 
BADDR 
132 
BIAS2 
-7 
NEG7 
THRESO 
THRES 
1 

ONE 
PIDM+Y 143-1 


ADY 142 


1.0 85.157 14:10:03 11-19-85 


PAGE 0005 


INITIALIZE PAGE 4 AND 5 


LRLK 


ZAC 
RPTK 


SACL 


ARI ,512 


255 


“+ 


“ LOWEST PAGE 4 ADDRESS -> ARI 


* 0 -> ACC 


“ ZERO PAGE 4 AND 5 


INITIALIZE PAGE 6 ANO 7 


RPTK 
SACL 
LOPK 
LALK 
SACL 
LALK 
SACL 


LALK 


255 


+ 


SONE 


PSDM+A0 


* ZERO PAGE 6 AND 7 


Eci26 


0271 
0272 
0273 


0274 
0275 
0276 


0277 
0278 
0279 


0280 
0281 
0282 


0283 
0284 
0285 


0286 
0287 
0288 


0289 
0290 
0291 


0292 
0293 
0294 


0295 
0296 
0297 


0298 
0299 
0300 
0301 
0302 
0303 
0304 
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O2FF 
6014 


000! 
0301 
6015 


0001 
03F8 
6016 


0001 
O3F5 
6017 


0001 
O20E 
6018 


000} 
0258 
6061 


0001 
0400 
6069 


0001 
0020 
606B 


0001 
002! 
606C 
0001 
0400 
6060 
606E 
CcCE0O 


FF80 
OBE 


SACL 
LALK 
SACL 
LALK 
SACL 
LALK 
SACL 
LALK 
SACL 
LALK 
SACL 
LALK 
SACL 
LALK 
SACL 
LALK 
SACL 
LALK 
SACL 
SACL. 
E INT 


B LOOP 


ADAO 
P6DM+Y0+1 
ADY 1 
P70M+ INCO 
ADINCO 
P70M+M8- | 
ADM7 
P4DM+UNI5~1 
ADUNI4 
HANGTO 
HANGT 
>400 

ABSY 

>20 

IABSY 
CUTOF 0 
CUTOFF 
>400 
ABSYOF 


MO 


1.0 85.157 


* »400 = 


14:10:03 11-19-85 
PAGE 0006 


1/8 OF MAX ABSY 


0311 OO7F 
0312 OO7F 
0313 0080 


0317 0080 
0318 0080 
0319 0081 
0320 008! 
0321 00862 
0322 0082 
0323 0083 
0324 0083 
0325 0084 


OZOZESWLL & YIM saTJoOURD OYIY so10A [BUSI “CI 
° 


0329 0084 
0330 0084 
033i 00865 
0332 0085 
0333 0086 
0334 0086 
0335 0087 
0336 0087 
0337 0088 
0338 0088 
0339 0089 


0343 0089 
0344 0089 
0345 QO08A 
0346 008A 
0347 0088 
0348 008B 
0349 O08C 


0353 o008C 
0354 008C 
0355 0060 
0356 0080 
0357 OOBE 
0358 OO8E 
0359 QO6F 
0360 OO8F 
0361 0090 
0362 0090 


toy 
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c8oo0 


4161 
0065 
C806 


5800 


2000 
CE18 
c807 
6068 


c800 


4160 
0065 


5872 


4171 
4870 
1070 
4872 


1c72 


PAGE 0007 


MHHRSSHHSHHHHSHHHHHHHHHHHHHHHHHNHNRHHEHHS HHH HHHHAHEHHRHHR NHK E OOH 


bg CYCLE START ROUTINE 


° 
We 1 FE DO at 1 He 1 OF TE MH OE Ob MH 0 OF HF OE OF OE Ot SE OF OF MO TD OD OF OF OF OF OF OF Ee OF ee ee Oe ee 


START LOPK 0 


CONVERT MU-LAW INPUT REFERENCE SAMPLE TO LINEAR (YO) 


ZALS DRR2 * MU-LAW Y(0) -> ACC 
A0O BADOR * ADD MU-LAW TABLE BASE ADORESS 
LOPK 6 
TBLR Yo * LINEAR Y(O) -> YO 
* : 
if COMPUTE ABSOLUTE VALUE OF YO 
LAC Yo "“ YO -> ACC 
ABS 
LOPK 7 
SACL ABSYO * {YO} -> ABSYO ON PAGE 7 
LDOPK 0 
* 
CONVERT MU-LAW NEAR END SAMPLE TO LINEAR (SODC) 
ZALS ORR1 * MU-~LAW S(0)DC -> ACC 
ADD BADOR * ADD MU-LAW TABLE BASE ADDRESS 


TBLR Sooc * LINEAR S(0)0C -> SODC 


COMPUTE HIGH PASS FILTEREO NEAR END SAMPLE (SO) 


ZALS SOLSBS * SOLSBS -> LOW ACC 

AOOH so * sO (MSBS) -> HIGH ACC 

suB SO,HTAU * ACC - SO * 2*°*HTAU -> ACC 
AOOH sooc * ACC + SODC * 2*°*16 -> ACC 
SsuB SODC ,HTAU-1 * ACC - SODC * 2**HTAU-I -> ACC 


EC128 


0363 
0364 
0365 
0366 
0367 
0368 
0369 
0370 
0371 

0372 
0373 
0374 
0375 
0376 
0377 
0378 
0379 
0380 
0381 

0382 


009} 
009! 
0092 
0092 
0093 
0093 
0094 
0094 
0095 
0095 
0096 


0096 
0096 
0097 
0097 
0098 
0098 
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4473 
\ 
0C73 
6071 
6870 


5672 


CEIB 
C807 


6863 


SUBH 
ADO 

SACL 
SACH 


DMOV 


COMPUTE 


ABS 
LDPK 


SACH 


1.0 85.157 
S1p0c > 
S10C,HTAU~1 . 
SOLSBS * 
so af 
s00C ig 


ABSOLUTE VALUE OF 


ABSSO - 


14:10:03 IJ1-t 


9-85 


PAGE 0008 


ACC - SIDC * 2**16 -> ACC 


ACC + SINC * 2**HTAU-1 


LOW ACC -> SOLSBS 


HIGH ACC ~> SO (MSBS) 


sooc -> $10C 


so 


150 


-> ABSSO ON PAGE 7 


-> ACC 


OZOZESWLL & YIM Jo[JI0URD OYOY O10, [eU3Iq “SI 


EC128 


0384 
0385 
0386 
0387 
0388 
0389 
0390 
0391 
0392 
0393 
0394 
0395 
0396 
0397 
0398 
0399 
0400 
0401 
0402 
0403 
0404 
0405 
0406 
0407 
0408 
a409 
0410 
0411 
0412 
0413 
0414 
0415 
0416 


0417 
0418 
0419 
0420 
0421 
0422 
0423 
0424 
0425 
0426 
0427 
0428 
0429 
0430 
0431 
0432 
0433 
0434 
0435 
0436 
0437 
0438 


0099 
0099 
OO9A 


009A 
009A 
0098 
0098 
ao9c 
009C 
00390 
0090 
OO9E 


009E 
O09E 
OO9F 
OO9F 
OOAO 
OQOAO 
OOAI 
OOAl 
O0A2 
Q0A2 
Q0A3 
QOA4 
Q0A4 
Q0A5 
OOA5 
OOA6 
OOA6 
OOA7 


OOA7 
OOA7 
O0A8B 
00A8 
Q0A9 
00A9 
QGAA 
OOAA 
OOAB 
OO0AB 
QOAC 
OOAC 
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cs800 


5589 
316F 
CBOE 


5690 


2E6E 
A000 
CEOS 
CB7F 


5C90 
FF8O 


CE04 
CEIS 
696B 


2070 
1068 
606C 
C807 
CE1B 
6065 


14:10:03 11-19-85 
PAGE 0009 


TAPP OPPO eee RES ERR RR ERE REAR ERR REAR RRR EERE RR ERE REE REREA RS 
* 

be ECHO ESTIMATION ROUTINE 

* 


$8 9 1 Ot we Ot He te OF OE HF OE MF ME OE OE St 0 SE HF TF MOF A TE SE OE BF OF te HF HOF SP TE OF DD tm MF OF He Ot me OF tt ee em ee ee 


EESTR LOPK 0 


MOVE Y128,Y129,...,Y142 TO NEXT HIGHER MEMORY LOCATION 


LARP ARI * 1 -> AR POINTER 
LAR ARI ,ADY 142 * ADY142 -> ARI 
RPTK 14 “K = 142,141,...,128 
DMOV “- * Y(K) => Y(K+1) 

hel : 

: CONVOLVE REFERENCE SAMPLES WITH FIR COEFFICIENTS 

FIR LAC ONE ,14 * ROUND-OFF OFFSET -> ACC 
MPYK 0 "O->P 
CNFP * ARL STILL POINTS AT Y127 
RPTK 127 "K = 127,126,....0 
MACD A127PM, *- *" Y(K) * A(I-K) + ACC -> ACC 
CNFD 
APAC * P + ACC -> ACC 
SACH EEST, 1 * 2. * HIGH ACC -> EEST 


s 


COMPUTE THE OUTPUT 


LAC so * $O -> ACC 

SuB EEST * ACC - EEST -> ACC 

SACL OUTPUT * ACC -> OUTPUT 

LOPK 7 

ABS 

SACL ABSEO * ACC -> ABSEO ON PAGE 7 


EC128 


0440 
0441 
0442 
0443 
0444 
0445 
0446 
0447 
0448 
0449 
0450 
0451 
0452 
0453 
0454 
0455 


0456 
0457 
0458 
0459 
0460 
0461 


0462 
0463 


0464 
0465 
0466 
0467 
0468 
0469 


Q0AD 
OOAD 
QOAD 
OOAE 
OOAE 
QOAF 
OOAF 
QOoBO 
OOBO 
OoBl 
0oB1 
0062 
0083 
0083 
00B4 
0084 
00B5 
0085 
00B6 
00B7 
00B7 
0068 
00B9 
0089 
GOBA 
OOBA 
008B 
0088 
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PAGE 0010 
NNER EMER ENON HEH OREN HEHE EM RHEE RHEE EERE THRONE HOE 
* 
i RESIDUAL OUTPUT SUPPRESSION ROUTINE 
3C6B SPRS LT IABSY = YTABSY -> T REG 
3866 MPY ABSOUT * ABSOUT * IABSY ~> P REG 
2062 LAC HCNTR *" NEAR END SPEECH FLAG -> ACC 
c800 LOPK 0 
F180 BGZ WOUT * IF N.E. SPEECH NO SPRS 
Q0BB 
CE14 PAC * P REG -> ACC 
1060 SUB THRES * ACC - THRES -> ACC 
F180 BGZ WOUT * IF THRES EXCEEDED SKIP SPRS 
ooBB 
FA80 Broz WOU T * IF BIO PIN LOW SKIP SPRS 
0088 
CAO0 ZAC * 0 -> ACC 
606C SACL OUTPUT * ACC -> OUTPUT 
3C6C WwOUT LT OUTPUT * OUTPUT -> f REG (FOR UNO) 


OCOZESWLL ® YUM JalfaouRd oYydy Id10A [eusiq “S| 


44 


aosBc 
oOoBC 
0080 
0080 
OOBE 
OOBE 
OOBF 
oOoBF 
o0co 
ooc! 
00c! 
o0c2 
00c2 
00c3 
00c3 
ooc4 
o0c4 
oocs 
oocs 
oocé 
o0c7 
00c7 
ooce 
00cs 
00c9 
ooc9 
QOCA 
OOCA 
oocs 
oocs 
aocc 
oocc 
ooco 
OOoce 
OOCcE 
OOCF 
00D0 
0000 
ooo0 
ocol 
0001 
ocod2 
0002 
0003 
0003 
0004 
0004 
ao0o5 
000s 
0006 
0007 
0007 
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406C 
ce18 
CEI8 


F380 
0000 


4866 
3167 
CB06 
CEA2 


DEO4 
Fooo 


6868 
7169 
4069 
CEIB 
0268 


0406 
FFOO 


FF80 
OOoDvE 
CE 1B 
4866 
3167 
CB06 
CEA2 


DE04 
F000 


6868 


PAGE O01! 


SPHRKHHHHHHHHKRHHHHHRHHHMHHHHSHHEHKHHHEHRHHRESHHSHHEHEHOHHHHHHEHHRH HOO e 
* 

. LINEAR TO MU-LAW COMPRESSION ROUTINE 

° 
SHOSHHHEREHSRHRARSEKRHAHHHEHDKEHKRHHHNAHEHHHHHRHHHHRHSHHRHHEHRHHHRAHKRHHRSHR Ee 


CMPRS ZALH OUTPUT * OUTPUT -> ACC 
SFL 
SFL * LEFT JUSTIFY ACC 
BLZ NEGCMP ° IF ACC < 0 THEN GO TO NEGCMP 


POSCMP ADOH BIAS2 


LAR ARI ,NEG7 
RPTK 6 * FINO MSB 
NORM 
ANOK >F000,14 * ZERO 2 MSBS AND ALL LSBS 
SACH Q 
SAR ARI,SI 
ZALH Ss! 
ABS 
ADO Q,2 
XORK >FFOO,4 * INVERT ALL BITS 
8 TXOUT 
NEGCMP ABS * LEFT JUSTIFIED OUTPUT IN ACC 


ADOH BIAS2 


LAR ARI ,NEG7 

RPTK 6 * FIND MSB 

NORM 

ANOK >FO00,14 * ZERO 2 MSBS AND ALL LSBS 
SACH Q 


EC128 


0523 
0524 
0525 
0526 
0527 
0528 
0529 
0530 
053! 
0532 


0533 
0534 


0008 
0008 
0009 
0009 
OOOA 
OODA 
0008 
0008 
oooc 
oo0p0Cc 
ooDD 
oovEe 
OODE 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 14:10:03 11-19-85 
PAGE 0012 

7169 SAR AR1,.SI 

4069 ZALH SI 

CE18 ABS 

0268 AOD Q.2 

0406 XORK >7F 00,4 * INVERT ALL BITS IN Q 

7FO0 

6CO!t TXOUT SACH OXR,4 "= 2°"4 * HIGH ACC -> DXR 


obY 


OZOZESWLL & WIM sa]J20URD OYDy 210A [UTI “CI 


EC128 


0536 
0537 
0538 
0539 
0540 
0541 

0542 
0543 
0544 
0545 
0546 
0547 
0548 
0549 
0550 
0551 

0552 
0553 
0554 
0555 
0556 
0557 
0558 
0559 
0560 
0561 

0562 
0563 
0564 
0565 
0566 
0567 
0568 
0569 
0570 
0571 

0572 
0573 
0574 
0575 
0576 
0577 
0578 
0579 
0580 
0581 

0582 
0583 
0584 
0585 
0586 
0587 
0588 


OODF 
OODF 
O0E0 


OO0EO 
QOEO 
OOE! 
OOE | 
OOE2 
OOE2 
QO0E3 
00E3 
OOE4 
OOoE4 
QOES 
OOE5 
OOE6 


OOE6 
OOES6 
00E7 
QOE7 
goes 
OOEs 
OOE9 
O0E9 
OOEA 
OOEA 
OOEB 
OOEB 
OOEC 
OOoEC 
OOED 


OOED 
OOED 
OOEE 
OOEE 
OOEF 
OOEF 
OOFG 
OOFO 
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C807 


4066 
4967 
1966 
0965 
6866 


6067 


4069 
496A 
1969 
0968 
096C 
6869 


606A 


4012 
CBOE 
4769 


606B 
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AH OHANH HEHEHE HHN HH HH HANH HHHHHH HHH EROHENEH HHH HEHHNRH RHR EHS 
* POWER ESTIMATION ROUTINE 
« 


tt 9b 9b aE ME EH OE OF 0 OE OE HE OF OE HF OE OF OF He OE OE OF OE OF OE OE OF HE HE TF OD OF OF OF OFM FE OF OF OF OF OF OR OE OF OY OE TE OO te HH tte te 


NORM —_LOPK 7 * T REG STILL CONTAINS OUTPUT 

? UPDATE LONG TAU OUTPUT POWER ESTIMATE (ABSOUT) 
ZALH = ABSOUT * ABSOUT -> HIGH ACC 
ADDS  AELSBS * AELSBS -> LOW ACC 
SUB ABSOUT, LTAU * ACC ~ ABSOUT * 2**LTAU -> ACC 
ADD ABSEO,LTAU " ACC + ABSEO * 2**LTAU -> ACC 
SACH — ABSOUT * HIGH ACC -> ABSOUT 
SACL — AELSBS * LOW ACC -> AELSBS 

* 

. UPDATE LONG TAU REFERENCE POWER ESTIMATE (ABSY) 
ZALH — ABSY * ABSY -> HIGH ACC 
ADDS  AYLSBS * AYLSBS -> LOW ACC 
SUB ABSY,LTAU " ACC ~ ABSY * 2**LTAU -> ACC 
ADD ABSYO,LTAU * ACC + ABSYO * 2**LTAU -> ACC 
ADD CUTOFF .LTAU * ACC + CUTOFF * 2**LTAU -> ACC 
SACH — ABSY * HIGH ACC -> ABSY 
SACL  AYLSBS * LOW ACC -> AYLSBS 


COMPUTE 1/ABSY (DIVIDE | BY ABSY) 


ZALH AONE 
RPTK 14 

SUBC ABSY 
SACL 1ABSY 


EC128 


0590 
0591 
0592 
0593 
0594 
0595 
0596 
0597 
0598 
0599 
0600 
0601 
0602 
0603 
0604 
0605 
0606 
0607 
0608 
0609 
0610 
0611 
0612 
0613 
0614 
0615 
0616 
0617 
0618 
0619 
0620 


0621 
0622 
0623 
0624 


0625 
0626 
0627 
0628 
0629 
0630 


0631 
0632 
0633 
0634 


0635 
0636 
0637 
0638 


0639 
0640 
0641 


OOF I 


OOF 1 
OOF | 
oaF2 
OOF 2 
OOF 3 
OOF 3 
OOF 4 
OOF 4 
OOFS 


OOFS 
OOFS5 
OOFE6 
OOF6 
OOF 7 


OOF 7 
OOF7 
ooF8 
OOF9 
OOFS 
OOFA 
OOFA 
OoFB 
OOFC 
OOFC 
OOFD 
OOFD 
OOFE 
OOFE 
OOFF 
0100 
0100 
0101 
0101 
0102 
0103 
0103 
0104 
0104 
0105 
0106 
0106 
0107 
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3118 
CBOD 
5690 


5680 


3868 


CE14 


F480 
0100 


0013 


F480 
0106 


CA00 
1013 


FF80 
0107 


1013 


F280 
0106 


2013 


FF8O 
0107 


CE14 
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RHEHRHHHHRHSHHRAAHHHHNHHHHHHRHHEHHHHHRHHHRHERHKHHHHKEHEHEHHH HARE EHH ED 
: OUTPUT NORMALIZATION ROUTINE 

* 

TARR ESE RRR ERR ERE RS ERE SERRE ERE ESE R RRR RRR ERE ARREARS) 


. 
. MOVE UNO,UNI,....UN14 TO NEXT HIGHER MEMORY LOCATION 
* 
LAR AR1,ADUN14 * ADUNI4 -> ARI 
RPTK 13 ® K=14,13,.-001 
DMOV - * UN(K) -> UN(K+1) 
DMOV . * UN(O) -> UN(E) 
. 
* COMPUTE NORMALIZED OUTPUT (UNO) 
* 
MPY 1ABSY * IABSY * T REG(OUTPUT) -> P REG 
PAC * P REG (UNO) -> ACC 
* 
. SATURATE NORMALIZED OUTPUT (UNO) AT +/- 1.0 
* 
BGEZ  POSUNO * IF UNO > 0 THEN GO TO POSUNO 
NEGUNO ADD SONE * ACC + SONE -> ACC 
BGEZ § SMLUNO * IF -1.0 < UNO < 0 THEN NO SATR 
ZAC * 0 -> ACC 
SUB SONE * ACC - SONE -> ACC 
B SAVUNO 
POSUNO SUB SONE * ACC - SONE -> ACC 
BLEZ § SMLUNO “ 1F 0 < UNO < 1.0 THEN NO SATR 
LAC SONE * SONE -> ACC 
8 SAVUNO 
SMLUNO PAC * P REG (UNO) -> ACC 


OZOZESWLL & YIM saTJaOURD OYDY sd10A [eUsIG “C1 


Leb 


EC 128 


0642 O107 6011 
0643 0108 
0644 0108 6080 


SAVUNO SACL 


SACL 
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CUNO 


° 


1.0 865.157 


*" ACC -> CUNO. 


*" ACC -> UN(0) 


14:10:03 


11-19-85 ECci28 


PAGE 0015 
0646 


0647. 


0648 
0649 
0650 
06S!1 

0652 
0653 
0654 
0655 
0656 
0657 
0658 
0659 
0660 
0661 

0662 
0663 
0664 
0665 
0666 
0667 
0668 
0669 
0670 
067! 

0672 
0673 
0674 
0675 
0676 
0677 
0678 
0679 
0680 
0681 

0682 
0683 
0684 
0685 
0686 


0687 
0688 
0689 
0690 


0691 
0692 
0693 
0694 
0695 
0696 
0697 
0698 
0699 
0700 


0109 
0109 
OL10A 


OL10A 
O10A 
0108 
0108 
010C 
o1oc 
0:00 
0100 
O10E 


010E 
010E 
O1oF 
O10F 
O110 
o110 
Oli! 
Olli 
O12 


O112 
0112 
0113 
0113 
O1l4 
0114 
O1ls 
O1l6 
0116 
O117 
0117 
0118 
O1lg 


0119 
0119 
OLA 
OLA 
O1riB 
0118 
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C807 


4060 
1860 
0868 
6860 


4064 
1B64 
0c63 
6864 


2060 
0012 


0004 
oooFr 


6060 


F180 
O1IF 


3117 
CB07 
5690 
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SMHS HSHHRHHSEHHHHHHARHKEKHHHHHHHSEHKHHHHEHEKRHEHAHHHHHHHHHRHRHEHEKNHEA DO 


« 
° 
2 


NEAR-END SPEECH DETECTION ROUTINE 


AMAR HHKRHROHHHKSHEHHHHHKRHKHHKHKHHHHEKRHHHKHHHHREKHHEHHOKRHRHRRHHHHE EHOW 


NESP 


LOPK 7 


UPDATE SHORT TAU REFERENCE POWER ESTIMATE (ABSYOF ) 


ZALH ABSYOF id 
suB ABSYOF ,STAU = 
ADO ABSYO,STAU * 


SACH ABSYOF ® 


UPDATE SHORT TAU NEAR END 


ZALH ABSSOF * 
SUB ABSSOF ,STAU * 
ADD ABSSO,STAU+NER ° 


SACH ABSSOF sed 


ABSYOF * 2**°16 -> ACC 
ACC - ABSYOF * 2**STAU ~-> ACC 
ACC + ABSYO * 2**STAU -> ACC 


HIGH ACC -> ABSYOF 
POWER ESTIMATE (ABSSOF ) 


ABSSOF * 2**16 -> ACC 

ACC - ABSSOF * 2**STAU ~> ACC 
ACC + ABSSO*2°"STAU+NER -> ACC 
HIGH ACC .-> ABSSOF 


UPDATE MODULO 16 COUNTER (H) 


LAC H *. 
ADD AONE ° 
ANOK >OOOF © 
SACL H * 
BGZ NESP 1 id 


MOVE MO,MI,...+M7 TO NEXT 


LAR AR1,ADM7 of 


RPTK 7 * 


H -> ACC 
ACC + 1 => ACC 


IF ACC = 16 THEN 0 -> ACC 


AcC -> H 


IF H > O THEN GO TO NESPI 


HIGHER MEMORY LOCATION 


AOM7 -> ARI 
K=7,6,...50 


M(K) -> M(K+1) 


8br 


OZOZESWLL & YM sa]J20URD OYDg sd10A [eNZIG “ST 


EC128 


0701 
0702 
0703 
0704 


0705 
0706 
0707 
0708 
0709 
0710 
o711 
O712 
0713 
0714 


0715 
O716 
0717 
0718 
0719 
0720 
0721 
0722 
0723 
0724 
0725 
0726 


0727 
0728 
0729 
0730 
0731 
0732 
0733 


0734 
0735 
0736 
0737 
0738 
0739 
0740 


0741 
0742 
0743 
0744 
0745 
0746 
0747 


0748 
0749 
0750 
0751 


arc 
OL1Ic 
o1t0 
0110 
Olle 
O1lIF 


O1LF 
OLIF 
0120 
0120 
0121 
0121 
0122 
0123 
0123 
0124 


O124 
0124 
0125 
0125 
0126 
0126 
0127 
0128 
0128 
0128 
0129 
0129 
O12A 
012A 
012B 
o12Cc 
o1rec 
o12C 
0120 
0120 
O12E 
012E 
O12F 
0130 
0130 
0130 
O31 

0131 

0132 
0132 
0133 
0134 
0134 
0134 
0135 


32020 FAMILY MACRO ASSEMBLER PC !.0 85.157 


5660 


FF8O 
0149 


2060 
106E 


F280 
0124 


566D 


2064 
106E 
F280 
0149 
2064 
106F 
F280 
0149 
2064 
1070 
F280 
0149 
2064 
1071 

F280 


0149 


2064 


NESP1 


NESP2 


OMOV 


8 


UPDATE MOST RECENT 


LAC 
SUB 


BLEZ 


DMOV 


COMPARE 


LAC 
SUB 


BLEZ 


LAC 
SUB 


BLEZ 


LAC 
SUB 


BLEZ 


LAC 


SUB 


BLEZ 


LAC 


ABSYOF 


NESP3 


ABSYOF 
MO 


NESP2 


ABSYOF 


REFERENCE POWER 


ABSSOF 
MO 


NESP3 


ABSSOF 
MO+1 


NESP3 


ABSSOF 
MO+2 


NESP3 


ABSSOF 


M0+3 


NESP3 


ABSSOF 


$4:10:03 11~19-85 
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“ ABSYOF -> MO 


" ON MEMORY MOVES SKIP DETECTION 


LOCAL MAXIMA (MO) 


ABSYOF -> ACC 
ACC - MO ~> ACC 


IF MO > ABSYOF THEN NO UPDATE 


ABSYOF -> MO 


TO NEAR-END POWER 


ABSSOF -> ACC 

ACC - MO -> ACC 

NO N.E. SPEECH IF MO > ABSSOF 
ABSSOF -> ACC 

ACC - MI -> ACC 
NO N.E. SPEECH IF M1 > ABSSOF 
ABSSOF -> ACC 
ACC - M2 -> ACC 
NO N.E. SPEECH IF M2 > ABSSOF 
ABSSOF -> ACC 

AGC - M3 -> ACC 

NO N.E. SPEECH IF M3 > ABSSOF 


ABSSOF -> ACC 


EC128 


0752 
0753 
0754 


0755 
0756 
0757 
0758 
0759 
0760 
0761 


0762 
0763 
0764 
0765 
0766 
0767 
0768 


0769 
0770 
0771 
0772 
0773 
0774 
0775 


0776 
0777 
0778 
0779 
0780 
0781 
0782 


0783 
0784 
0785 
0786 
0787 
0788 
0789 
0790 
0791 
0792 
0793 
0794 
0795 
0796 


0797 
0798 
0799 
0800 
oso! 
0802 


0135 
0136 
0136 
0137 
0138 
0138 
0138 
0139 
0139 
O13A 
O13A 
0138 
O13C 
013C 
013C 
0130 
0130 
OI3E 
O13E 
013F 
0140 
0140 
0140 
O14t 
0141 
O142 
0142 
0143 
0144 
0144 
0144 
0145 
0145 
0146 
0146 
0147 
0148 


0148 
0148 
0149 


0149 
0149 
O14A 
014A 
0148 
014C 
014C 
0140 
0140 
O14E 
O14E 
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1072 
F280 
0149 
2064 
1073 
F280 
0149 
2064 
1074 
F280 
0149 
2064 
1075 
F280 
0149 
2064 
1076 


F280 
0149 


566! 


2062 


F680 
0150 


1012 
6062 


FF8O 


NESP3 


SUB MO+4 * 
BLEZ NESP3 . 
LAC ABSSOF bd 
SuB MO+5 * 
BLEZ NESP3 7 
LAC ABSSOF = 
SUB MO+6 © 
BLEZ NESP3 ” 
LAC ABSSOF = 
suB MO+7 : 
BLEZ NESP3 ia 
LAC ABSSOF : 
SUB MO0+8 ‘of 
BLEZ NESP3 * 


NEAR-END SPEECH DETECTED 
OMOV HANGT * 


CHECK AND UPDATE HANGOVER 


LAC HCNTR - 
BZ NESP4 = 
SUB AONE a 
SACL HCNTR © 
B LOOP = 


14:10:03 11-19-85 
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ACC - M4 -> ACC 


NO N.E. SPEECH IF M4 > ABSSOF 


ABSSOF -> ACC 
ACC - M5 -> ACC 


NO N.E. SPEECH IF M5 > ABSSOF 


ABSSOF -> ACC 
ACC - M6 -> ACC 


NO N.E. SPEECH IF M6 > ABSSOF 


ABSSOF -> ACC 


ACC - M7 -> ACC 


NO N.E. SPEECH IF M7 > ABSSOF 
ABSSOF -> ACC 

ACC - M8 -> ACC 

NO N.E. SPEECH IF M8 > ABSSOF 


SET HANGOVER COUNTER (HCNTR) 


HANGT -> HCNTR 


COUNTER 


HCNTR -> ACC 


IF HCNTR = 0 THEN GO TO NESP4 


ACC - 1 -> ACC 
ACC -> HCNTR 


GO TO CYCLE END 


OZOZESWL & IM JoTa0ueD OYdY so10A [eNSIG “CT 


6bb 


EC128 


0803 
0804 
0805 
0806 
0807 
0808 
0809 
0810 
o8tt 
0812 


O1L4F 
0150 


0150 
0150 
O151 
ors! 
0152 
0152 
0153 
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O:BE 
« 
ig CHECK IF LTAU REFERENCE POWER ESTIMATE 1S BELOW CUTOFF 
* 
2069 NESP4 LAC ABSY * ABSY -> ACC 
106C SUB CUTOFF * ACC - CUTOFF -> ACC 
F280 BLEZ LOOP * IF ABSY < CUTOFF THEN LOOP 
O1BE 


EC128 


0814 
0815 
0816 
0617 
0818 
0819 
0820 
0821 

0822 
0823 
0824 
0825 
0826 
0827 
0828 
0829 
0830 
0831 

0832 
0833 
0834 
0835 
0836 
0837 
0838 
0839 


0840 
0841 
0842 
0843 
0844 
0845 
0846 
0847 
0848 
0849 
0850 
0851 
0852 


0853 
0854 
0855 
0856 
0857 
0858 
0859 
0860 
0861 
0862 
0863 
0864 
0865 


0866 
0867 


0154 
0154 
0155 
0155 
0156 
0156 
0157 
0157 
0158 
0158 
0159 
0159 
OSA 
O1SA 
015A 
0158 
0158 
o015c 
o1sc 
0150 
0150 
O1SE 
O1SF 
O1SF 
0160 
0160 
0161 
0161 
0161 
0162 
0162 
0163 
0163 
0164 
0164 
0165 
0166 
0166 
0167 
0167 
0168 
0168 
0168 
0169 
0169 
016A 
O16A 
0168 
0168 
oL1é6éc 
0160 
0160 
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2015 
0060 
6010 
3110 
CEOS 


3C11 


2Fi2 
38A0 
CBOE 


5DA0 
FFO} 


3011 


6878 


2Fi2 
38A0 
CBOE 


5DA0 
FFO!t 


3011 


6879 


2Ft2 
38A0 
CBOE 


5DA0 
FFOI 


3011 
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PHHHHHHERA HAHAH HHO HHH RE HOHE HHH HHEHHHOHHHHEHRDME HE EHD Oe ee 


af 


. COEFFICIENT INCREMENT UPDATE ROUTINE 


* 
(RE RE REP E REAR E RE REET ERS EEE SRSA REREAD 


UPINC LAC ADY1 * ADYI1 -> ACC (YO IS NOW IN YI) 


AOD H * ACC + H -> ACC 


SACL TEMP3 


LAR AR1,TEMP3 * ADY! + H -> ARI 
CNFP 
LT CUNO * UNO -> T REG 
LAC AONE, 15 * ROUND-OFF OFFSET -> ACC 
MPY + * UN(O) * Y(O+H) -> P REG 
RPTK 14 "“K = 1,2,...,15 
e 

MAC UNOPM+1,*+ * UN(K) * Y(K+H) + AGC -> ACC 
LTA CUNO * P REG + ACC -> ACC UNO -> T 
SACH INCO * HIGH ACC -> INC(O) 
LAC AONE, 15 * ROUND-OFF OFFSET -> ACC 
MPY "+ = UN(O) * Y(164+H) -> P REG 

2s 
RPTK 14 OR 162525 15 ° 
MAC UNOPM+1, "+ * UN(K) ® Y(K+164+H) + ACC —> ACC 
LTA CUNO * P REG + ACC -> ACC UNO -> T 
SACH INCO+1 ® HIGH ACC -> INC(1) 
LAC AONE, 15 * ROUND-OFF OFFSET -> ACC 
MPY “+ * UN(O) © Y(32+H) -> P REG 
RPTK 14 *K = 1,2,...+15 
MAC UNOPM+1, *+ * UN(K) *® Y(K+32+H) + ACC -> ACC 
LTA CUNO * P REG + ACC -> ACC UNO -> T 


OSP 


OZOZESWLL & YUM JoT[ooURD OYA SdI0A [ENSIG “CT 


Ec1l28 


0868 OI6E 
0869 OI16E 
0870 OL6F 
0871 O16F 
0872 O16F 
0873 0170 
0874 0170 
0875 0171 
0876 0171 
0877 O172 
0878 0172 

0173 
0879 0174 
0880 0174 
068! 0175 
0882 0175 
0883 0176 
0884 0176 
088s 0176 
0886 0177 
0887 0177 
0888 0178 
0ss9 0178 
0890 0179 
0891 0179 

OL7A 
0892 0178 
0893 0178 
0894 O17C 
0895 O017C 
0896 0170 
0897 0170 
0898 017D 
0899 OL7E 
0900 OI7E 
0901 OL7F 
0902 O17F 
0903 0180 
0904 0180 

0181 
0905 0182 
0906 0182 
0907 0183 
0908 0183 
0909 0164 
o910 0184 
0911 0184 
0912 0185 
0913 0185 
0914 0186 
0915 0186 
0916 0167 
0917 0187 

0188 
0918 0189 
0919 0189 
0920 O16A 
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687A 


2Fi2 
38A0 
CBOE 


50A0 
FFOI 


3011 


687B 


2F 12 
38A0 
CBOE 


5DAO 
FFOL 


3011 


687C 


2F 12 
38A0 
CBOE 


S5DA0 
FFO) 


3011 


6870 


2F 12 
3B8A0 
CBOE 


5D0A0 
FFOI 


3011 


SACH 


LAC 
MPY 
RPTK 


MAC 


LTA 


SACH 


LAC 
MPY 
RPTK 


MAC 


LTA 


SACH 


LAC 
MPY 
RPTK 


MAC 


LTA 


SACH 


LAC 
MPY 
RPTK 


MAC 


LTA 


14:10:03 11-19-65 EC128 
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0921 
INCO+2 ® HIGH ACC -> INC(2) 0922 
0923 
0924 
AONE , 15 0925 
0926 
"+ 0927 
0928 
14 0929 
0930 
UNOPM+1 , *+ 
0931 
0932 
CUNO ' 0933 
0934 
INCO+3 0935 
0936 
0937 
AONE, 15 
+ 
14 


UNOPM+1, *+ 


CUNO 


INCO+4 


AONE, 15 
“+ 
14 


UNOPM+1,*%+ 


CUNO 


INCO+5 


AONE, 15 
“+ 
14 


UNOPM+1, "+ 


CUNO 


O1BA 
0188 
01868 
0188 
0186C 
018C 
0:80 
0180 
Ose 
O18E 
O18F 
0190 
0190 
0191 
0191 
0192 
0192 
0192 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 


687E 


2F 12 
38A0 
CBOE 


5DA0 
FFO! 


3011 


687F 


CE04 


SACH 


LAC 
MPY 
RPTK 


MAC 


LTA 


SACH 


CNFDO 


INCO+6 


AONE, 15 
*+ 
14 


UNOPM+1,°%+ 


CUNO 


INCO+7 
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Isp 


EC128 


0939 
0940 
0941 
0942 
0943 
0944 
0945 
0946 
0947 
0948 
0949 
0950 
0951 
0952 
0953 
0954 
0955 
0956 
0957 
0958 
0959 
0960 
0961 
0962 
0963 
0964 
0965 
0966 
0967 
0968 
0969 
0970 
0971 
0972 
0973 
0974 
0975 
0976 
0977 
0978 
0979 
0980 
0981 
0982 
0983 
0984 
0985 
0986 
0987 
0988 
0989 
0990 
0991 
0992 
0993 
0994 
0995 


0193 
0193 
0194 
0194 
0195 
0195 
0196 
0196 
0197 
0197 
0198 
0198 
0199 
0199 
QISA 
O19A 
0196 
0198 
O!9C 
019¢C 
0:90 
0190 
019D 
OI9E 
O19E 
O19F 
O19F 
O1A0 
OLAO 
OLAI 

O1Al 

O1A1 

O1A2 
O1A2 
OLA3 
O1A3 
O1A4 
O1A4 
O1A5 
OLAS 
O1A5 
O1LA6 
OLAG 
OLAT 
O1A7 
O1A8 
01A8 
O1A9 
Q1A9 
O1A9 
O1AA 
OLAA 
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co10 
3116 
2014 
1060 
6010 
3210 
CEOA 
236B 
6010 


3C10 


38AA 
4080 
CEIS 


68D9 


38AA 
4080 
GEIS 


6809 


38AA 
4080 
CEIS 


6809 


38AA 


4080 
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See eSHSR HHH HHH HHHHHHHHHHHHEHE RHO HHH H HHO ew ee 


* COEFFICIENT UPDATE ROUTINE 


HHMHRAHHHOHRRAHHHHE HEHEHE MEER HEHEHE 


LARK 
LAR 
LAC 
SUB 
SACL 
LAR 
SPM 
LAC 
SACL 


LT 


MPY 
ZALH 
APAC 


SACH 


MPY 
ZALH 
APAC 


SACH 


MPY 
ZALH 
APAC 


SACH 


MPY 


ZALH 


ARO,16 
AR1,ADINCO 
ADAO 

H 

TEMP3 
AR2,TEMP3 
2 

IABSY ,GAIN 
TEMP3 


TEMP3 


*+,AR2 


*0-,0,AR1 


*+,AR2 


*O0-,0,ARI 


*+,AR2 


* 


“0-,0,AR1 


*+,AR2 


16 -> ARO (AR2 INCREMENT) 
ADINCO -> ARI 
ADAO -> ACC 


ACC - H -—> ACC 


ADAO - H -> AR2 

SET 4 BIT LEFT SHIFT OF P REG 
IABSY * 2**GAIN -> ACC 

ACC -> TEMP3 


TEMP3 -> T REG 


INC(O) * T REG -> P REG 
ACH) © 2**16 -> ACC 
P REG + ACC ~> ACC 


HIGH ACC -> A(H) 


INC(!) * T REG -> P REG 
AC(164+H) * 2*°"16 -> ACC 
P REG + ACC -> ACC 


HIGH ACC -> A(16+H) 


INC(2) * T REG -> P REG 
A(32+H) * 2*°*16 -> ACC 
P REG + ACC ->» ACC 


HIGH ACC -> A(32+H) 


EC128 


0996 
0997 
0998 
0999 
1000 
1001 
1002 
1003 
1004 
1005 
1006 
1007 
1008 
1009 
1010 
1011 
1012 
1013 
1014 
1015 
1016 
1017 
1018 
1019 
1020 
1021 
1022 
1023 
1024 
1025 
1026 
1027 
1028 
1029 
1030 
1031 
1032 
1033 
1034 
1035 
1036 
1037 
1038 


O1A8 
O1AB 
O1AC 
OLAC 
O1AD 
OLAD 
OLAD 
OLAE 
O1AE 
OLAF 
OLAF 
0160 
0160 
0181 
0181 
0181 
0162 
0:62 
0183 
01B3 
01B4 
01B4 
0165 
0185 
0185 
O1B6 
01B6 
O1B7 
O1B7 
0168 
0188 
01B9 
01B9 
0189 
OIBA 
OLBA 
0188 
0188 
O1BCc 
01BC 
0180 
01BD 
01B0 
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CE15 


6809 


38AA 
4080 
CEIt5 


6809 


38AA 
4080 
CE1S 


6809 


3BAA 
4080 
CEIS 
6809 


3BAA 
4080 
CEIS 


6809 


CE08 


“0-,0,AR1 


*0-,0,ARI1 


*+,AR2 


*“0-,0,AR1 


*+,AR2 


“0-,0,ARI1 
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* SET NO SHIFT OF P REG 


11-19-85 


CSP 


OZOZESWL & YUM JaTJIURD OYSY 2dI0A jeUsIC “CT 


ECI28 


1040 
1041 
1042 
1043 
1044 
1045 
1046 
1047 
1048 
1049 
1050 
1051 
1052 
1053 
1054 
1055 
1056 
1057 
1058 
1059 
1060 
106] 
1062 
1063 
1064 
1065 
1066 
1067 
1068 
1069 
1070 


1071 
1072 
1073 
1074 
1075 
1076 
1077 
1078 
1079 
1080 
1081 
1082 
1083 
1084 
1085 
1086 
1087 
1088 
1089 
1090 
1091 
1092 
1093 


OBE 
O1BE 
O1BF 
O1BF 
01C0 
o1co 
o1cl 


O1C1 
O1CI 
01Cc2 
01C2 
01C3 
01C3 
01C4 
01C4 
01C5 
o1cs 
01C6 
01C6 
01C7 
O1C7 
o1cs 
01c9 
01c9 
OICA 
O1CA 
01C8 
01CB 
o1cC 
o01CcC 
oicbd 
01CcD 
O1CE 
O1CE 
O1CF 
01CF 


OLCF 
O1CF 
0100 
0100 
o101 
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CE IF 
CEItF 


5500 


7862 
cso0 
6863 
6064 
5660 
4100 


0004 
OOFF 


6060 
4164 
4863 
5062 
Cceoo 


CE26 


CEOO 


FF8O 
OO7F 
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SRSHSSHHHPHEHHHHHKHHKHEHHHHSHHKEHSHHOHHHEHKHEEHHHHHEHHHHHKH HEHEHE Hene 
J 

* CYCLE ENO ROUTINE 

° 


SRSKHHRHSHHHHEHEHHAHHHRHKHHHEHHHRHHEHHHEHHHHHHHHEHHHHR DHE ERO DOH 


Loop IDOLE * WAIT IN LOOP UNITL RINT/XINT 
IOLE * EXTRA IDLE FOR TWO RINT 
NOP 
AHMNNHREEHEREEEHEHEHEERRON HEHEHE HERRERA NHER RHEE RENEE OREN Ee 
« 
* RINT SERVICE ROUTINE 
RXRT SST TSTO * SAVE STO 
LDOPK 0 * 0 -> PAGE POINTER 
SACH TACCH * SAVE HIGH ACC 
SACL TACCL * SAVE LOW ACC 
OMOV ORR} * DRRI -> ORR2 
ZALS ORR “ DRR -> ACC 
ANDK >OOFF " MASK-OFF MSB BYTE 
SACL ORR! * ACC -> ORR! 
ZALS TACCL © RESTORE LOW ACC 
ADDOH TACCH * RESTORE HIGH ACC 
LST TSTO * RESTORE STO 
EINT * INTERRUPTS ENABLED 
RET * RETURN TO PROGRAM 
PEON HHRRHEHEHEHRAE HEHEHE RERHHHRRHEHEHH RHO H HERR EHH RREE HER 
. 
bo XINT SERVICE ROUTINE 
TXRT EINT * INTERRUPTS ENABLED 
B START * BRANCH TO PROGRAM START 


01D2 
0300 
0300 


0300 
0300 
0301 
0302 
0303 
0304 
0305 
0306 
0307 
0308 
0309 
030A 
0308 
030C 
030D 
O30E 
O30F 
0310 
0311 
0312 
0313 
0314 
0315 
0316 
0317 
0318 
0319 
O31A 
0318 
031C 
0310 
O31E 
O31F 
0320 
0321 
0322 
0323 
0324 
0325 
0326 
0327 
0328 
0329 
032A 
032B 
032C 
032D 
032E 
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0300 


EOAI 
EIAL 
E2Al 
E3A) 
E4Al 
ESA! 
E6A1 
E7AI 
EBA! 
E9AI 
EAA1 
EBA! 
ECAI 
EDA! 
EEA 
EFA! 
FO6! 
FOEI 
FI6l 
FIEI 
F261 
F2E1 
F361 
F3E1 
F461 
F4E1 
F561 
F5E1 
F661 
F6E! 
F761 
FTE1 
F841 
Fes! 
F&Cl 
F90) 
F941 
F961 
F9C1 
FAO! 
FA4] 
FA8! 
FAC] 
FBO! 
FB41 
FBS! 
FBCI 
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AHORA MHMREHRHERHHHHEHHHHENHEHARHHHEHESEROHRHHHEREHEEHOEHRANLODEE 
° 

bod MU-LAW EXPANSION LOOKUP TABLE 

® 

HARE HHHERERAHARHHRARERHHEHRRERREHEEHHEHHHHHEREEHORREREDOREE 


AORG »>300 
xXTBL EQU $ 

DATA >EDAI * NEGATIVE VALUES FIRST 
DATA EIA ° (FF, FE, ETC.) 
DATA >E2A1 
DATA >E3AI 
DATA >E4AI 
DATA >ESA! 
DATA >E6A1 
DATA >ETAI 
DATA >ES8AI 
DATA >E9A! 
DATA >EAAI 
DATA >EBAI 
DATA >ECAI 
DATA >EDA! 
DATA >EEA 
DATA >EFAI 
DATA >F061 
DATA >FOE!1 
DATA >F161 
DATA >FIE!I 
DATA >F26)1 
DATA >F2EI 
DATA >F361 
DATA >F3E1 
DATA »>F 46! 
DATA >F4E) 
DATA >F561 
DATA »>F5E! 
DATA >F661 
DATA >F6E! 
DATA >F761 
DATA >FTEI 
DATA >F841 
DATA »>F88) 
DATA »>F8C1 
DATA >F901 
DATA >F941 
DATA >F981 
DATA >F9C1 
DATA >FAO! 
DATA >FA4! 
DATA >FABI 
DATA »>FACI 
DATA >FBOI 
DATA >FB4I1 
DATA >FB81 
DATA >FBCI 


€Sp 


OZOZESILL & WM JoTJIIUB CYS SIOA peudiq “St 


EC128 


1152 
1153 
1154 
1155 
1156 
1157 
1158 
1159 
1160 
1161 
1162 
1163 
1164 
1165 
1166 
1167 
1168 
1169 
1170 
1171 

1172 
1173 
1174 
1175 
1176 
1177 
1178 
1179 
1180 
1181 

1182 
1183 
1184 
1185 
1166 
1187 
1188 
1189 
1190 
1191 

1192 
1193 
1194 
1195 
1196 
1197 
1198 
1199 
1200 
1201 

1202 
1203 
1204 
1205 
1206 
1207 
1208 


032F 
0330 
0331 
0332 
0333 
0334 
0335 
0336 
0337 
0338 
0339 
033A 
0338 
033¢C 
0330 
033€ 
033F 
0340 
0341 
0342 
0343 
0344 
0345 
0346 
0347 
0348 
0349 
O34A 
0348 
034C 
0340 
034€ 
O34F 
0350 
0351 
0352 
0353 
0354 
0355 
0356 
0357 
0358 
0359 
035A 
0358 
035C 
0350 
O35€ 
O35F 
0360 
0361 
0362 
0363 
0364 
0365 
0366 
0367 
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FCO! 
FC31 
FCS51 
FC71 
FC9I 
FCBI 
FCO! 
FCF 1 
FOI 
FO31 
FOS! 
FO71 
FO9I 
FOB! 
FOD1 
FOF | 
FEI 
FE29 
FE39 
FE49 
FES9 
FE69 
FE79 
FE89 
FES9 
FEA 
FEBS 
FEC9 
FEDS 
FEES 
FEF9 
FFO9 
FF19 
FF25 
FF20 
FF35 
FF 3D 
FF45 
FF 40 
FFSS 
FF5D 
FF65 
FF60 
FF75 
FF7D 
FF8S 
FF8D 
FF9S 
FF9D 
FFA3 
FFA7 
FFAB 
FFAF 
FFB3 
FF87 
FFBB 
FFBF 


DATA 
DATA 
DATA 
DATA 
DATA 
OATA 
DATA 
DATA 
OATA 
OATA 
DATA 
DATA 
DATA 
DATA 
OATA 
DATA 
OATA 
DATA 
OATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
OATA 
DATA 
DATA 
DATA 
DATA 
DATA 
OATA 
OATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


>FCOI 
>FC31 
>FCS! 
>FCTI 
>FC9I 
>FCBI 
>FCO! 
>FCF I 
>FDII 
>FO31 
>FDSI 
>FO71 
>FD9! 
>F DB! 
>FO01 
>F OF | 
>FE1I 
>»FE29 
>FEIR9 
>FEAS 
>FE59 
>FE69 
>FET79 
>FE89S 
>»FESS 
>FEAS 
>FEBS 
>FECS 
>FEDS 
>FEES 
>FEFS 
>FFOS 
>FFIS 
>FF25 
>FF2D 
>FF35 
>FF3D0 
>FF 45 
>FF 40 
>FF55 
>FFSO 
>FF6S 
>FF6D 
>FFT75 
>FFID 
>FF85 
>FF8O 
>FF9S 
>FF9D 
>FFA3 
>FFAT 
>FFAB 
>FFAF 
>FFB3 
>FFB7 
>FFBB 
>FFBF 
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1209 
1210 
12i1 
1212 
1213 
1214 
1215 
1216 
1217 
1218 
1219 
1220 
1221 
1222 
$223 
1224 
1225 
1226 
1227 
1228 
1229 
1230 
1231 
1232 
1233 
1234 
1235 
1236 
1237 
1238 
1239 
1240 
1241 
1242 
1243 
1244 
1245 
1246 
1247 
1248 
1249 
1250 
1251 
1252 
1253 
1254 
1255 
1256 
1257 
1258 
1259 
1260 
1261 
1262 
1263 
1264 
1265 
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FFC3 
FFCT 
FFCB 
FFCF 
FFD3 
FFD? 
FFOB 
FFDF 
FFE2 
FFE4 
FFE6 
FFE8 
FFEA 
FFEC 
FFEE 
FFFO 
FFF2 
FFF4 
FFF6 
FFF8 
FFFA 
FFFC 
FFFE 
0000 


IF SF 
1E5F 
1DSF 
ICSF 
1BSF 
1A5F 
195F 
185F 
175F 
165F 
155F 
145F 
135F 
125F 
L1ISF 
105F 
OF OF 
OF IF 
OE SF 
OE IF 
OD9F 
oO1F 
OC9OF 
OCIF 
OB9SF 
OBIF 
OASF 
OAIF 
099F 
OS1F 
089F 
OBIF 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


ASSEMBLER PC 


>FFC3 
>FFCT 
>FFCB 
>FFCF 
>FFD3 
>FFO7 
>FFOB 
>FFOF 
>FFE2 
>FFEA 
>FFE6 
>FFEB 
>FFEA 
>FFEC 
>FFEE 
>FFFO 
>FFF2 
>FFF4 
>FFF6 
>FFFS 
>FFFA 
>FFFC 
>FFFE 
>0 


> 1 FSF 
> LESF 
>105F 
>1CSF 
>1BSF 
>LASF 
>195F 
>165F 
>175F 
>165F 
>155F 
>145F 
>135F 
>125F 
>115F 
>105F 
>F OF 
>F IF 
>E SF 
DEIF 
>O9OF 
>OI1F 
>C9OF 
>CIF 
>BOF 
>BIF 
>ASF 
D>AIF 
>99F 
>SIF 
>89F 
>BIF 
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* POSITIVE VALUES NEXT 


(POLARITY BIT 


1) 


St 


OZOZESWLL & WIM Ja[foouRD oyog ad10A feNsIC “SI 


03A0 
O3Al 
03A2 
03A3 
Q3A4 
O3A5 
03A6 
03A7 
03A8 
03A9 
O3AA 
03AB 
Q3AC 
03AD 
O3AE 
O3AF 
03B0 
0361 
0362 
0383 
0384 
03865 
03B6 
0387 
0388 
0389 
O3BA 
0388 
03BC 
038) 
038E 
O03BF 
03CO0 


“03C1 


03C2 
03C3 
03C4 
03C5 
03C6 
03C7 
03Cc8 
03C9 
03CA 
03CB 
03CC 
03C0 
O3CE 
O3CF 
03D0 
0301 

0302 
0303 
03D4 
0305 
0306 
0307 
03D8 
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o78F 
O77F 
073F 
O6FF 
O6BF 
067F 
063F 
OSFF 
O58F 
057F 
053F 
O4FF 
O4BF 
047F 
043F 
O3FF 
O3CF 
O3AF 
036F 
036F 
034F 
032F 
030F 
O2EF 
O2CF 
O2AF 
028F 
O26F 
024F 
022F 
020F 
O1EF 
0107 
01C7 
0187 
O1A7 
0197 
0187 
0177 
0167 
0157 
0147 
0137 
0127 
0117 
0107 
OOF 7 
00E7 
0008 
0003 
00cB 
00c3 
0088 
0083 
00AB 
00A3 
009B 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
OATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


> 7BF 
>77F 
>73F 
>6FF 
> 6BF 
>67F 
>63F 
>5SFF 
>S8F 
>S7F 
>5S3F 
>4FF 
> 4BF 
>47F 
>43F 
>3FF 
>3CF 
>3AF 
>38F 
>36F 
>34F 
>32F 
>30F 
>2EF 
>2CF 
>2AF 
> 28F 
>26F 
>24F 
>22F 
>20F 
>1EF 
>107 
>1C7 
>187 
>1A7 
>197 
>187 
>177 
>167 
>157 
>147 
>137 
>127 
>i? 
>107 
FT 

>E7 

>0B 

>D3 

>cB 

>c3 

>BS 

>B3 

>AB 

>A3 

>9B 
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1323 
1324 
1325 
1326 
1327 
1328 
1329 
1330 
1331 

1332 
1333 
1334 
1335 
1336 
1337 
1338 
1339 
1340 
1341 

1342 
1343 
1344 
1345 
1346 
1347 
1348 
1349 
1350 
1351 

1352 
1353 
1354 
1355 
1356 
1357 
1358 
1359 
1360 
1361 

1364 
1363 


0309 
03DA 
0308 
030C 
0300 
O3DE 
O3DF 
03E0 
O3€1 
03E€2 
03E3 
0364 
O3E5 
O3E6 
03E7 
03E8 
O3E9 
O3EA 
O3EB 
O3EC 
Q3ED 
O3EE 
O3EF 
03F0 
O3F 1 
O3F2 
O3F3 
O3F 4 
O3F5 
O3F6 
O3F7 
03F8 
O3F9 
O3FA 
O3FB 
O3FC 
O3FD 
O3FE 
O3FF 
0400 


NO ERRORS, 
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0093 
0088 
0083 
007B 
0073 
006B 
0063 
005D 
0059 
0055 
0051 
004D 
0049 
0045 
004! 
0030 
0039 
0035 
0031 
002D 
0629 
0025 
002) 
OO1E 
ooic 
COLA 
0018 
0016 
0014 
0012 
0010 
OOO0E 
oooc 
OOO0A 
0008 
0006 
0004 
0002 
coa0 


END 
NO WARNINGS 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


»93 
>8B 
»83 
>7B 
>73 
>6B 
»63 
>50 
>»59 
>55 
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Telecommunications 


16. Implementation of the Data Encryption 
Standard Using the TMS32010 


Panos Papamichalis and Jay Reimer 
Digital Signal Processing - Semiconductor Group 
Texas Instruments 


455 


456 


INTRODUCTION 


The programmability of the TMS320 family of digital signal 
processors makes possible the implementation of different 
signal processing algorithms on the same device, rather than 
on several different custom chips. As an example, this 
application report describes the implementation of a data 
encryption method on the TMS32010. The encryption 
scheme operates on a stream of bits that represents text, 
computer files, or anything else presented in binary form. 
In particular, the encryption method is considered in 
conjunction with speech coding to achieve secure 
communication over voice channels. Another example 
consists of the encryption of data supplied to a transmitting 
modem and decryption of the corresponding data stream 
coming out of the receiving modem. The algorithm chosen 
for the encryption is the Data Encryption Standard (DES) 
as extablished by the National Bureau of Standards. ! 

The following paragraph has been taken from reference 
(1] and is brought to the attention of the readers of this report. 
For more information, please contact the national Bureau of 
Standards. 


‘‘EXPORT CONTROL: Cryptographic devices 
and technical data regarding them are subject to 
Federal Government export controls as specified 
in Title 22, Code of Federal Regulations, Parts 
121 through 128. Cryptographic devices 
implementing [the DES] and technical data 
regarding them must comply with these Federal 
regulations.”’ 


THE DATA ENCRYPTION STANDARD (DES) 


Encryption/Decryption Algorithm 

The Data Encryption Standard (DES), the algorithm 
of data encryption selected by the United States federal 
government, is described in detail in the NBS publications. ! 
The description is summarized here in order to associate it 
with the implementation on the TMS32010. 

Figure 1 shows a flowchart of the enciphering portion 
of the algorithm. The stream of bits to be encrypted (the 
plaintext) is segmented into blocks of 64 bits. Each 64-bit 
block is submitted to an initial permutation, and then is split 
into two 32-bit blocks (a left one Lo and a right one Ro), 
which are input to the transformation section. The 
transformation section consists of 16 stages, where the data 
is scrambled by the use of the encryption key. At each stage 
i, the inputs are the left block Lj.; and right block Rj-) of 
the previous stage, and the outputs are the left block Lj and 
right block Rj of this stage. The outputs Lj and Rj of each 
stage are computed from Lj-;, Rj-1, and a subkey Kj; that 
is generated from the encryption key. Note that the 16th stage 
is different from the other stages. The output of the last stage 
is subjected to a permutation, which is the inverse of the 
initial permutation. The resulting 64 bits are the ciphertext. 


64-BIT INPUT PLAINTEXT 


32 7 32 
Ry =p @f(Ro,K1) 


R2 =L4 @f(Ry.K2) 
R15 =L44 9 f(R74,.K15) 


L146 =115 ¢ f(R15.K16) Re=R15| 
64 


INVERSE PERMUTATION: 
MATRIX IP-1 
64/ 
OUTPUT CIPHERTEXT 


Figure 1. Flowchart of the DES Encryption 


The initial permutation and inverse initial permutation 
matrices IP and IP-! in Figure 1 are shown in Figures 2 and 
3. Where there is bit manipulation according to a matrix, 
the matrix should be read from left to right and row by row. 
Each entry of the matrix corresponds to a bit in the output 
block. In other words, the information presented in the matrix 
is sequential and not two-dimensional. The matrix 
arrangement is used here only for convenience of 
presentation. | 
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Each entry of the matrix indicates the order of the bit 40 8 48 16 56 24 64 32 
in the input bit stream, starting from the leftmost bit. For 39 7 47 #+=15 55 23 #463 ~ «31 
example, during the initial permutation, the leftmost bit after 38 6 46 14 54 22 62 = 30 
the permutation is the 58th bit from the left in the input 37 5 45 13 53 21 #4261 = 29 
stream; the second leftmost bit is the SOth bit, etc. If the 36 4 44 12 52 20 60 28 
matrix has as many entries as the number of bits of the input 35 3 43 11 #51 #19 59 27 
block, the input and output blocks from the bit manipulation 34 2 42 10 50 18 58 26 
have the same number of bits. If, however, the matrix has 33 1 41 9 49 17 457 25 
more entries (by repeating some of the bits of the input block) 
or fewer entries (by skipping some input-block bits), the Figure 3. Matrix IP-! 
output of the bit manipulation is longer or shorter, 
respectively, than the input. Such examples, shown later in The function f(Rj-1,Kj) of Figure 1, which combines 
this report, are matrix E that transforms 32 bits into 48 bits, the right block Rj.; of the previous stage and the subkey Kj 
and matrix PC-2 that transforms 56 bits to 48 bits. of the present stage i, is shown in detail in Figure 4. The 


input to the function is a block of 32 bits, and the output 


58 50 42 34 26 18 ~= «#10 2 is also a block of 32 bits. Figures 5, 6, and 7 give the matrices 
60 52 44 36 28 20 12 4 E, P, and S1 through S8 of Figure 4. Kj’s are subkeys 
62 54 46 38 30 22 14 6 generated from the main key, as described in the next section. 
64 56 48 40 32 24 16 8 The S-boxes, which convert a 6-bit sequence to a 4-bit 
57 49 41 33 25 17 9 1 sequence, are used as follows: the first and last bits of the 
59 51 43 35 27 #219 = 11 3 sequence, taken together, represent a number I between 0 
61 53 45 37 #29 21+ 13 5 and 3, while the middle 4 bits represent a number J between 
63 55 47 39 31 23 = 15 7 0 and 15. The S-boxes are 4 x 16 matrices whose entries 
take values between 0 and 15. Each of these entries can be 
Figure 2. Matrix IP represented by a 4-bit number. For each S-box, the I and 
32 
BIT-SELECTION TABLE: 
MATRIX E 
48) 
(=)=4 
48 
6 v ’ ig ’ g 
Ps} Le} Ls} Ls} Ls} Ls Le 
4 r, 
32 


PERMUTATION 
FUNCTION: 


MATRIX P 


Figure 4. Computation of the Function f(Rj,Kj +1) of Figure 1 
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J are computed, and the corresponding 4-bit number in the 
matrix is the output. For example, if the input to S1 box is 
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001010, then I = 0, J = 5, and the output is the decimal 
15, which corresponds to the 4-bit binary number 1111. 
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Figure 7. S Matrices (continued) 


For decrypting the ciphertext, the same procedure is 


used as for encrypting, except that the transformation block 64 
in Figure 1 is modified as shown in Figure 8. The left and 
right 32-bit word locations are exchanged, and the algorithm 
starts with subkey 16 and ends with subkey 1. The 
implementation for decryption is similar to encryption. The 
TMS32010 code (see the appendix) includes a flag to signal 
if encryption or decryption is performed. In decryption, the 
subkeys are read backwards, but everything else remains the 
same. 


Generation of the Subkeys 

The encryption key is 64 bits long, out of which only 
56 bits are used for the encryption. In every group of eight 
bits, the eighth bit is such that the byte has odd parity. This 
provides some guarantee of key integrity during transmission. 
From the 64-bit key, 16 subkeys, each 48-bits long, are 
generated as shown in Figure 9. The matrices for permuted 
choices 1 and 2, PC-1 and PC-2, are given in Figures 10 
and 11, respectively. Table 1 shows the number of left shifts 
at each stage. The indicated left shifts are circular left shifts 
for each half of keys Cj and Dj. 


Table 1. Left Shifts for Subkey Generation Rya=115 


iteration Number of ; : ‘ 
Number Left Shifts 


Lo =Rq @ fiL4.K4) 


INVERSE PERMUTATION: 
MATRIX iP ~ 1 
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OUTPUT DECIPHERED TEXT 


Figure 8. Flowchart of the DES Decryption 
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Figure 10. Matrix of PC-1 
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Figure 11. Matrix PC-2 
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MODES OF OPERATION 


The DES algorithm can be implemented in different 
modes of operation2, two of which are the Electronic 
CodeBook (ECB) mode and the Output FeedBack (OFB) 
mode. Selection of a mode determines the kind of plaintext 
to serve as input to the algorithm and how the ciphertext is 
used. 

Figure 12 is another representation of Figure 1 and 
corresponds to the most straightforward application of the 
algorithm, the Electronic CodeBook (ECB) mode. The 
plaintext is the actual bit stream to be encrypted, and the 
ciphertext is what is actually transmitted over the 
communications medium. 

Figure 13 depicts the 64-bit Output FeedBack (OFB) 
mode. In this case, the 64 bits of the output of the DES 
algorithm are exclusive-ORed with 64 bits of the actual bit 
stream to generate the encrypted stream for transmission. 
The same 64 bits of the DES output are then fed back to the 
input of the DES as the new input to the algorithm. The whole 
process is initialized by inputting a predetermined block of 
64 bits into the DES algorithm. The receiver has a similar 
operation. As long as it has the same initial 64-bit block as 
input, the receiver operates synchronously with the 
transmitter. Note that in this mode there is no need to 
implement the decrypting portion of the algorithm, because 
both the transmitter and the receiver are using only the 
encrypting portion of the DES. The OFB mode has the 


64 


DES 
ALGORITHM 


SCRAMBLED DATA 


CHANNEL 


advantage over the ECB that it confines any transmission 
errors to single bits, rather than a block of 64 bits. 


TMS32010 IMPLEMENTATION OF THE DES 


In addition to implementing the main 
encryption/decryption algorithm, it is necessary to process 
the encryption key in order to generate the subkeys used in 
the DES. The key consists of 64 bits, of which only 56 are 
active. In every byte of the key, the eighth bit is an odd- 
parity bit. To avoid any mistakes, the user is asked to supply 
the 56 bits. The device then generates the correct 64-bit 
version that can be used for storage and transmission. This 
function is realized by the routine DESKEY. The whole 
implementation of the Data Encryption Standard in 
TMS32010 assembly language can be found in the appendix. 
Figure 14 is a flowchart illustrating the arrangement of the 
TMS32010 programs that implement the DES. 

The largest part of the code consists of a repetitive 
construction, which implements the permutation of a block 
of bits according to a matrix. This construction is discussed 
here to provide a better understanding when implementing 
the code. 

Assume that a certain step of the algorithm requires 
the scrambling of 64 bits according to a matrix. The 64 input 
bits are stored in four words, WI1 to WI4. The 
scrambled-bit output is also stored in four words, WO] to 
WO4. The scrambling is accomplished by constructing the 


KEY DECIPHERED 
BIT 


INVERSE DES 
ALGORITHM 


Figure 12. Electronic CodeBook (ECB) Encryption Mode 
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Figure 13. Output FeedBack (OFB) Encryption Mode 
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INPUT KEY AND ENCRYPT/DECRYPT FLAG 


PROGRAM ‘DESKEY’ FOR KEY MANAGEMENT 


SUBROUTINE ‘PARITY’ CALLED IN ‘DESKEY’ TO ADD ODD PARITY 


PROGRAM ‘KEYGEN’ TO GENERATE 16 SUBKEYS 


SUBROUTINES ‘PRMCH1,’ ‘PRMCH2,’ ‘SHIFT’ CALLED 


IN ‘KEYGEN’ TO GENERATE SUBKEY 


PROGRAM ‘DES’ TO SELECT ENCRYPT/DECRYPT MODE 


DATA TO BE 
ENCRYPTED/DECRYPTED 


SUBROUTINE ‘INITP’ CALLED IN ‘DES’ FOR INITIAL PERMUTATION 


SUBROUTINE ‘PERM’ CALLED IN ‘INITP’ 


SUBROUTINE ‘FUNC’ CALLED IN ‘DES’ TO COMPUTE FUNCTION f. 
SUBROUTINE ‘SMTRX* CALLED IN ‘FUNC’ TO COMPUTE S-BOXES 


SUBROUTINE ‘INVPR’ CALLED IN ‘DES’ FOR INVERSE INITIAL 
PERMUTATION. SUBROUTINE ‘IPRM’ CALLED IN ‘INVPR’ 


OUTPUT CIPHERTEXT 


Figure 14. Flowchart of TMS32010 Programs Implementing the DES 


WO1 to WO4, one bit at a time from left to right. For 
example, let the current entry of the matrix be the 35th entry 
and have the value 53. This means that WO1 and WO2 are 
already filled (containing 2 x 16 = 32 bits), and now the 
third bit of WO3 is being determined. Since that bit is the 
53rd bit of the input stream, it occupies the fifth location 
in WI4 (53 = 3 x 16 + 5). If M1 contains >8000 
(hexadecimal value), the transfer of the bit from the input 
stream to the scrambled output stream is performed by the 
following code: 


LAC Wi14,4 
AND M1 
ADDH WO3 
SACH WO3,1 


In other words, the bit of interest is positioned at the 
leftmost location of the accumulator’s low word, and then 
masked out by M1. The current contents of WO3 are added 
to the accumulator’s high word, which is then stored back 
to WO3 shifting by one. In this way, the bit of interest is 
also picked up. 


In order to run the main algorithm, 16 subkeys, each 
48-bits long, must be generated, to be used at the 16 stages 
of the algorithm. The algorithm for the generation of the 
subkeys involves permutations and shifts of the original key. 
The method of generating the 16 subkeys is implemented in 
the routine KEYGEN. For convenience, each subkey is 
stored in four 16-bit words so that only the top 12 bits are 
active in each word. 

Having generated the subkeys, an incoming stream of 
bits can be encrypted or decrypted. The body of the DES 
algorithm is implemented in the routine DES. This routine 
implements both encryption and decryption, and the 
appropriate choice is flagged by a bit. In the present 
implementation, this bit is supplied together with the key. 

Table 2 shows the program storage in words and the 
execution time in instruction cycles and milliseconds, as 
required by the DES algorithm. The information given for 
the routine DES is the time required per 64 input data bits. 
The execution time of the DES algorithm corresponds to a 
loading of 100 percent of the device for a bit rate of about 
42.5 kbit/s. This makes the TMS32010 a faster encryption 
device than some of the currently available dedicated 
devices.9 
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Table 2. TMS32010 Memory and Speed for the DES 


Program 
Memory 
(words) 


Key Management and 

Subkey Generation 
1102 
(both) 


Execution Time 
(instruction (ms) 
cycles) 


4527 


0.905 


7464 
7542 


1.493 
1.508 


Encryption 
Decryption 


NOTE: The encryption/decryption execution time is an average for 
the half-duplex operation and for processing 64 bits, based 
on a 200-ns instruction cycle. The corresponding actual 
bit rates are: 

Encryption: 
Decryption: 


42.86 kbit/s 
42.42 kbit/s 


The data memory required for the implementation of the DES 
includes 64 locations for the subkeys and another 51 locations 
as scratch registers. If an application running concurrently 
requires more data memory, the minimum memory 
requirement is four words, which contain the DES key. The 
subkeys can be regenerated at the beginning of every 
encrypting (or decrypting) session (but with a corresponding 
penalty in execution time). 

Note that no attempt of optimization is made in this 
implementation. Because of tradeoffs between data memory, 
program memory, and execution speed, the designer must 
optimize the implementation of the algorithm to fit the needs 
of the application. 


ENCRYPTION OF CODED SPEECH 


The TMS32010 Digital Signal Processor has been applied 
to encoding of speech signals in a variety of approaches. Its 
architecture and instruction set are particularly convenient 
for easy implementation and fast execution of speech 
algorithms, such as those used in vocoders. Several 
algorithms have been implemented on the TMS32010 to 


LPC-10 
ANALYSIS 


LPC-10 
SYNTHESIS 


encode speech data at 2.4, 9.6, 16, and 32 kbit/s.3-8 These 
speech-encoding algorithms can be readily combined with 
a data encryption method to provide security both in 
store-and-forward applications and in telecommunications 
applications. In this section, a review of the requirements 
for the speech coding algorithms is presented to show how 
the DES can be incorporated in a speech coding application. 
Figure 15 illustrates how the DES may be implemented on 
the TMS32010 processor for either data or speech 
encryption. 

Table 3 presents the different speech-coding algorithms 
that have been implemented on the TMS32010. Table 4 
summarizes the loading of the TMS32010, when running the 
DES, for several bit rates corresponding to the bit rates of 
Table 3. Note that these loadings are for half-duplex 
operation, i.e., for either encryption or decryption only. 


Table 3. Speech-Coding Algorithms Implemented on the 
T™S32010 


Bit Rate 
(kbit/s) 


TMS32010 Loading 


Coding Method (percent realtime) 


Analysis 


Synthesis 


LPC 

RELP 

APC 

SBC 

ADPCM 
ADPCM-CCITT 


NOTE: LPC Linear Predicitive Coding, 10th-order model 
RELP = Residual Excited Linear Predictive vocoder 


tl 


APC = Adaptive Predictive Coding 
SBC = Sub-Band Coding 
ADPCM = Adaptive Differential PCM (CCITT algorithm, not 


bit-by-bit compatible) 
ADPCM-CCITT = CCITT ADPCM (bit-by-bit compatible) 


CHANNEL 


CHANNEL 


Figure 15. Speech or Data Encryption/Decryption with the TMS32010 
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Table 4. TMS32010 Loading Time for the DES 
(Half-Duplex) 


TMS32010 Loading 
(percent realtime) 


Bit Rate 
(kbit/s) 


NOTE: The loading (for encryption) is given as a percentage of 42.8 
kbit/s. The other bit rates were selected to match the bit 
rates in Table 3. 


A comparison of Tables 3 and 4 shows that the 
2.4-kbit/s full-duplex system can be implemented in realtime, 
including the encryption, in only one TMS32010. However, 
for the other coding schemes, it is necessary to use a second 
chip. The speech algorithms used have been selected without 
consideration for any other processing of the speech signal. 
It may be possible to restructure the marginal cases in order 
to implement the combined speech and encryption algorithms 
with fewer chips. Other speech coding algorithms with lighter 
computational load may also be considered. 


CONCLUSION 


The TMS32010 Digital Signal Processor can be used 
to implement algorithms, such as the DES, which in the past 
have been implemented on dedicated devices. The 200-ns 
cycle time of the TMS32010 makes possible a half-duplex 
rate of 42.5 kbit/s for the DES. This rate is superior to that 
achieved by some of the available devices. The single-cycle 
multiplication is used in the computation of the S-boxes and 
helps increase encryption speed. Applications of the 
encryption include not only speech, as discussed in this 
report, but any kind of digital data. For example, 


implementation of a V.22 bis modem can be combined with 
the DES to achieve security of the transmitted information. 
An important point to be considered is that, having designed 
the TMS32010 in a system, the data encryption function can 
be added with minimal system cost because of .the 
programmability of the device. This feature is a natural 
extension in the functionality of a digital signal processor. 
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Appendix 


TMS32010 Assembly Language Programs 


The TMS32010 code implementing the DES is not published in this appendix because it falls within the U.S. Department 
of State Export Control Regulations. If a copy of the code is desired, please contact your local TI representative. 
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INTRODUCTION 


Digital voice communication is typically transmitted in a 
64-kbit/s PCM bit stream. Voice and data communications 
demand increasing capacities for signal transmission without 
significant degradation in the quality of the transmitted signal. 
One of the recommended solutions for accomplishing this 
task is that of Adaptative Differential Pulse Code Modulation 
(ADPCM). This solution has been reviewed by CCITT 
(International Telegraph and Telephone Consultative 
Committee), and a specific standard* has been 
recommended. Two solutions, a full-duplex solution and a 
half-duplex solution, are discussed in this application report. 
Both follow the model recommended by CCITT for 32-kbit/s 
ADPCM, although only the half-duplex solution provides 
a bit-for-bit compatible data stream as required by the 
recommendation. At 32 kbit/s, the ADPCM solution provides 
double the channel] capacity of the current 64-kbit/s PCM 
technique. Each solution has been totally incorporated in the 
internal memory space of the Texas Instruments TMS32010 
microprocessor. 

This application report presents a brief review of the 
basic principles of PCM and ADPCM. Hardware 
requirements, software logic flow,.and key features of the 
TMS32010 microprocessor for the implementation of 
ADPCM are also given. Source code is provided for the 
implementation and creation of an ADPCM transmission 
channel. 


DIGITIZATION 


Over the past 20 years, the telecommunications industry 
has changed from totally analog circuits to networks which 
integrate both analog and digital circuits. Digital signal 
encoding has the advantages of greater noise immunity, 
efficient regeneration, easy and effective encryption, and 
uniformity in transmitting voice and data signals. Increased 
bandwidth is required to transmit digital signals while 
maintaining a given analog signal quality at the receiver. 

Voice store and forward systems have been changing 
from totally analog storage media, such as audio tape, to 
digitized storage which allows random access of stored data, 
but with the tradeoff of increased storage media 
requirements. 

Signal quality begins with the digitization of the original 
analog signal. The process of digitization and coding 
introduces a distortion associated with the quantization of 
the digitized signal; as shown in Figure 1. This signal 
distortion or noise is different from the channel noise 
normally associated with a transmitted signal. After a signal 


*Recommendation G.721, 32 kbits/s Adaptive Differential Pulse Code 
Modulation’, CCITT, 1984. 
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has been digitized, the signal is much less susceptible to 
channel noise since the signal can be regenerated as well as 
amplified along the way, thus reducing the possibility of 
being corrupted by the transmission system. The overall 
quality of digital transmission is then limited by the 
digitization process in an error-free transmission system. 

Figures 2 and 3 show general representations of a 
digital communication channel. The actual transmission (and 
storage of a digital waveform) uses an analog channel. The 
outside points of the communications channel are the 
transmitter and receiver, as shown in Figure 2. These are 
commonly combined in a single device known as a combo- 
CODEC (CODing and DECoding device). The codec 
supplies, on the coding or transmitting side, the necessary 
filtering to bandlimit the analog signal and avoid signal alias 
and A/D conversion. On the decoding or receiving side, the 
codec performs a D/A conversion and then interpolates or 
smooths the resultant signal. 

Figure 3 shows the digitized signal modulated for 
transmission in the network and then demodulated at the 
receiving end to retrieve the transmitted digital signal. 


PCM 


Digitization and coding of the analog signal at the 
transmitter can be performed in several ways. The 
complexity of the chosen method is related to availability 
of encoder memory and to the resultant delay in the encoding 
process. 

When digital signal transmission is implemented, 
memory and the resultant delay dictate that a simple scheme, 
such as Pulse Code Modulation (PCM), be implemented. 


PCM codes each sampled analog value of the input waveform 


to a unique or discrete value. The digital quantization 
introduces distortion into the signal waveform, as shown in 
Figure 1. 

A nonuniform quantization scheme may be used to 
COMPAND (COMpress and exPAND) the signal in the 
waveform coding and decoding blocks in the system, 
generating log-PCM. By using larger quantization steps for 
large amplitude signals and smaller steps for small amplitude 
signals, efficient use is made of the data bits for digital 
transmission while maintaining specific signal-to-quantization 
noise thresholds. With the two current methods of 
COMPANDING (A-law and p-law), the signal quality of a 
13-bit digitized signal is maintained while transmitting only 
8 bits per sample. 

While quantizers remove the irrelevancy in a signal, 
coders remove the redundancy. In PCM encoding, each 
sample of the input waveform is independent of all previous 
samples; no encoder memory is required. 
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Figure 2. Digital Communication of Waveforms 
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Figure 3. Digital Channel 


ADPCM 


Analysis of speech waveforms shows a high sample-to- 
sample correlation. By taking advantage of this property in 
speech signals, more efficient coding techniques have been 
designed to further reduce the transmission bit rate while 
preserving the overall signal quality. 


APCM 

Adaptive PCM (APCM) is a method that may be 
applied to both uniform and nonuniform quantizers. It adapts 
the stepsize of the coder as the signal changes. This 
accommodates amplitude variations in a speech signal 
between one speaker and the next, or even between voiced 
and unvoiced segments of a continuous signal. The adaptation 


TRANSMITTER 


QUANTIZER 
STEPSIZE 
ADAPTER 


q(k) 


may be instantaneous, taking place every few samples. 
Alternatively, it may occur over a longer period of time, 
taking advantage of more slowly varying features. This is 
known as syllabic adaptation. 

The basic concept for an adaptive feedback system, 
APCM, is shown in Figure 4. An input signal, s(k), in the 
transmitter is quantized and coded to an output, I(k). This 
output is also processed by stepsize adaptation logic to create 
a signal, q(k), that adapts the stepsize in the quantizer. 
Correspondingly, in the receiver, the received signal, I(k), 
is processed by an inverse quantizer (i.e., decoded), 
producing the reconstructed signal, s;(k). Like the 
transmitter, the quantized signal, I(k), is processed by 
adaptation logic to create a stepsize control signal, q(k), for 
the inverse quantizer. 


RECEIVER 


INVERSE 
QUANTIZER 
STEPSIZE 
ADAPTER 


q(k) 


Figure 4. APCM Block Diagram 
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DPCM 

The method of using the sample-to-sample 
redundancies in the signal is known as differential PCM 
(DPCM). The overall level of high correlation on a sample- 
by-sample basis indicates that the difference between adjacent 
samples produces a waveform with a much lower dynamic 
range. Correspondingly, an even lower variance can be 
expected between samples in the difference signal. A signal 
with a smaller dynamic range may be quantized to a specific 
signal-to-noise ratio with fewer bits. 

A differential PCM system, DPCM, is shown in Figure 
5. In Figure 5, the signal difference, d(k), is determined using 
a signal estimate, sp(k), rather than the actual previous 
sample. By using a signal estimate, se(k), the transmitter uses 
the same information available to the receiver. Each 
successive coding actually compensates for the quantization 
error in the previous coding. In this way, the reconstructed 
signal, s;(k), can be prevented from drifting from the input 
signal, s(k), as a result of an accumulation of quantization 
errors. The reconstructed signal, s,(k), is formed by adding 
the quantized difference signal, dg(k), to the previous signal 
estimate, Se(k). The sum is the input to predictor logic which 
determines the next signal estimate. A decoding process is 
used in both the transmitter and receiver to determine the 
quantized difference signal, dg(k), from the transmitted 
signal, I(k). 


ADPCM 

ADPCM combines the features of both the APCM and 
DPCM systems. Figure 6 shows the basic blocks combining 
adaptation and differencing features in an ADPCM system. 


TRANSMITTER 


QUANTIZER 


INVERSE 
QUANTIZER 
a PREDICTOR “ 


S_(k) 


Both quantizer adaptation and signal differencing require the 
storage (in memory) of one or more samples in both the 
transmitter and receiver. Furthermore, the transmitter must 
use some method to ensure that the receiver is operating 
synchronously. This is accomplished by using only the 
transmitted signal, I(k), to determine stepsize adaptation in 
the quantizer and inverse quantizer and to predict the next 
signal estimate. In this way, the blocks in the receiver can 
be identical to those in the transmitter. Additionally, the 
specific adaptation techniques are designed to be convergent 
and thereby help provide quick recovery following 
transmission errors. 

The ADPCM system, as used in digital telephony, is 
not an original signal coding system, but is actually a 
transcoder, converting between log-PCM and ADPCM 
codes. Currently there are a large number of systems using 
log-PCM for transmission. The ADPCM system incorporates 
both an adaptive quantizer and an adaptive predictor. The 
adaptive quantizer contains speed-control and scale-factor 
adaptation. A measure of the rate-of-change of the difference 
signal provides a means of determining the speed control. 
The scale factor adjustments to the difference signal adapt 
the fit of the quantization levels to minimize the signal-to- 
noise ratio. With speed control, the system can take 
advantage of both the instantaneous and syllabic adaptation 
rates, thereby adapting better to both speech and data signals. 
In the adaptive predictor, the prediction filter coefficients 
are updated by a gradient algorithm. Predictor adaptation 
improves the performance of the predictor for nonstationary 
signals (e.g., speech). 


RECEIVER 


INVERSE 
QUANTIZER 


PREDICTOR 


Figure 5. DPCM Block Diagram 
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ADAPTIVE 
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Figure 6. ADPCM Block Diagram 


THE ADPCM ALGORITHM 


The ADPCM algorithm has a receiver imbedded in the 
transmitter. This is important since, if the signal feedback 
used to determine the signal estimate, se(k), and consequently 
the quantized difference signal, dg(k), is the same as in the 
decoder, then the compensation for quantization errors can 
be made with subsequent difference samples. Since the 
decoder is actually imbedded in the encoder, each of the 
common blocks for transmitting and receiving is discussed 
in the following paragraphs. 

Figures 7 and 8 show block diagrams of an ADPCM 
transmitter and receiver as specified by CCITT. 


Encoder 

The function of the encoder or transmitter, shown in 
Figure 7, is to receive a 64-kbit/s log-PCM signal and 
transcode it to a 32-kbit/s ADPCM signal. This is 
accomplished by converting the log-PCM signal, s(k), to a 
linear signal, sj(k), from which an estimate, se(k), of the 
signal is subtracted to obtain a difference signal, d(k). The 
next step is to adaptively quantize this difference signal, d(k), 
by first taking the log (base 2), then normalizing by the 
quantization scale factor, y(k), and finally coding the result, 
I(k). A more uniform signal-to-noise ratio can be achieved 
by coding the log of the signal rather than the linear 
representation. The normalization provides the adaptation to 
the quantization and is based on past coded samples. 
Adaptation is controlled bimodally, being comprised of a fast 
adaptation factor for signals with large amplitude fluctuations 
(i.e., speech) and a slow adaptation factor for signals which 
vary more slowly (i.e., data). A speed-control factor, aj(k), 
weights the fast and the slow adaptation factors to form a 
single quantization scale factor, y(k). 
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The inverse adaptive quantizer uses the same signal, 
I(k), that has been transmitted to reconstruct a quantized 
version of the difference, dg(k), and the same adaptive 
quantization characteristics as the adaptive quantizer section. 

The quantized difference signal, dg(k), is input to an 
adaptive predictor which uses this input to compute a signal 
estimate, se(k). The signal estimate, se(k), is combined with 
the difference signal, dg(k), to determine a reconstructed 
signal, s;(k), which is the output in the decoder. This output 
is then subtracted from the next input sample to complete 
the feedback loop. 

The adaptive predictor makes use of both an all-pole 
filter and an all-zero filter. The all-pole filter is a second- 
order filter with constrained adaptive coefficient values 
designed to match the slowly varying aspects of the speech 
signal. Since an all-pole predictor is particularly sensitive 
to errors, the predictor makes use of a sixth-order all-zero 
filter to offer signal stability even with transmission errors. 


Decoder 

The function of the decoder or receiver, shown in 
Figure 8, is to receive a 32-kbit/s ADPCM signal and 
transcode it to a 64-kbit/s log-PCM signal. To accomplish 
this, the decoder utilizes many of the elements used by the 
encoder. The received data, I(k), is processed by an inverse 
adaptive quantizer, identical to the one in the corresponding 
encoder, to determine a quantized difference signal, dg(k). 
By filtering the difference signal, dg(k), through the adaptive 
predictor together with the previously reconstructed signal, 
Sr(k), a signal estimate, se(k), is obtained. The signal 
estimate, se(k), is added to the difference signal, dg(k), to 
compute the reconstructed signal, s,;(k). The reconstructed 
signal, s,(k), is converted from a linear-PCM to a log-PCM 
signal, Sp(k), which is then output following a synchronous 
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ADAPTATION 
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Figure 7. ADPCM Encoder Block Diagram 
(Diagram taken from CCITT Recommendation G.721) 


coding adjustment. The coding adjustment limits the errors 
in tandem codings of a signal. 

Note that the algorithm design achieves a convergence 
of the states of the encoder and decoder in spite of 
transmission errors. This convergence is a part of each of 
the adaptation computations and is demonstrated equationally 
in the following sections. The convergence is brought about 
by the inclusion of (1-2-N) terms which provide a finiteness 
to the memory of the adaptation parameters. 


Adaptive Quantization 
Adaptive quantization, a multistage process, is used to 
determine the quantization scale factor and the speed control 
that controls the rate at which the scale factor is adapted. 
Quantization is actually a four-bit quantization (a sign bit plus 
three-bit magnitude), since a four-bit signal is the transmitted 
output of the ADPCM transcoder. The adaptive quantizer 
block can be noted in Figure 7. 

The difference signal, d(k), an input to the quantization 
process, is calculated by subtracting the signal estimate, 
Se(k), from the linear-PCM signal, s)(k). 


d(k) = sj(k) — se(k) (1) 


This difference signal is normalized by taking the log 
(base 2) and subtracting from it the quantizer scale factor, 
y(k). 


[I(k)| —logz |d(k)| — y(k) (2) 


Table 1 is used to provide the magnitude of the 
quantization result, |I(k)|, from this normalized input. The 
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sign bit of the ADPCM output value, I(k), is the sign of the 
difference signal, d(k). 

The quantizer scale factor, y(k), is comprised of two 
parts, and therefore bimodal in nature. The two parts, y)(k) 
and yy(k), are weighted by the speed-control factor, aj(k). 
For speech signals, aj(k) will tend toward a value of one; 
for voiceband data, aj(k) will tend toward zero. Refer to both 
Figures 7 and 8 for the inclusion of the quantizer scale factor 
and speed-control factor adaptation blocks. 


y(k) = ay(kK)yu(k—1) + [1 — aj(k)] yi(k—1) (3) 


where 0 < aj(k) < 1 


One of the factors, yy(k), is considered to be unlocked, 
since it can adapt quickly to rapidly changing signals (e.g., 
speech) and has a relatively short-term memory. This factor, 
Yu(k), is recursively determined from the quantizer factor, 
y(k), and the discrete function, W(I). 


Yuk) = [1 — 2-5] y(k) + 2-5WII(k)] (4) 


where 1.06< yy(k) < 10.00 


The factor, W(1), found in Table 2, is a function of 
I which causes y,(k) to adapt by larger steps for larger values 
of I. This gives yy(k) the freedom to track a signal almost 
instantaneously. Since y(k) is in the logarithmic domain, W(1) 
is effectively a multiplier of the scale factor. 
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Figure 8. ADPCM Decoder Block Diagram 
(Diagram taken from CCITT Recommendation G.721) 


Table 1. I/O Characteristics of the Normalized Quantizer 


Normalized Quantizer 
Output 
logaldg(k)| - y(k) 


Normalized Quantizer 
Input Range 
logo|d(k)| — y(k) 


7 
6 
5 
4 
3 
2 
1 
(@) 


The other factor, y}(k), adapts more slowly and tracks 
signals which change slowly (e.g., voiceband data). This 
factor includes a lowpass filtering of the unlocked factor, 
Yu(k). By including yy(k) in the manner shown, yj(k) is 
implicitly limited to the same range of values as the explicit 
limit placed on yy(k). Furthermore, the unity limit of aj(k) 
provides the same limit implicitly for y(k) as for y,(k) and 
Yuk). 


A speed-control factor, aj(k), adjusts the relative 
weighting of these two scale factors by making use of the 
short- and long-term averages, dms(k) and d,j(k), 
respectively, of the coded output to determine how rapidly 
the signal is changing. The combined scale factor, y(k), 
cannot be larger than either the unlocked, yy(k), or locked 
y}(k), terms. Therefore, aj(k) is limited to one even if the 
predicted speed control, ap(k), is larger than one. 


1 ,if ap(k-1) > 1 
aj(k) = (6) 
ap(k—1) ,if ap(k-1) < 1 


Note that ap(k) is implicitly limited to a maximum value 
of 2, while the speed-control factor used to mix the two scale 
factors is capped at a value of 1. In determining ap(k), an 
additional term of 1/8 is added each time if the difference 
in the short- and long-term averages becomes too large (i.e., 
| dms(k) — dmi(k) | >2—3dmi(k) ) or if there is an idle 
channel (i.e., y(k) < 3 ). Where neither of these conditions 
exist, a uniform, slowly varying signal can be assumed, such 
as occurs in data transmission. 


yi(k) = [1 — 2—6] yy(k—-1) + 2—®yy(k) (5) 


Table 2. Scale-Factor Multipliers 
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[1 — 2-4] ap(k—-1) + 2-3, if 
ldms(k) — dmik)| > 2—3dmu(k) 


ap(k) = & [1 — 2-4] ag(k—-1) + 2-3, if yk) < 3 


[1 — 2-4] ap(k—1), otherwise (7) 
The short-, dms(k), and long-term, dmj(k), averages 
of the transmitted ADPCM signal, I(k), are actually 


determined by averaging a weighted function, F(I), of the 
transmitted I, shown in Table 3. 


dms(k) = [1 — 273] dms(k-1) + 2-9F [I(k)] (8) 


[1 — 2-7] dmik—1) + 2-7F [I(k)] (9) 


dmi(k) 


The scale-factor and speed-control adaptations are a 
part of both the encoder and decoder logic. The adaptive 
quantization block has been specifically included in Figure 
7, showing the encoder. For the decoder, the adaptive 
quantizer is included as part of the synchronization block to 
aid in the reduction of errors in tandem codings. 


Table 3. Rate-of-Change Weighting Function 


P| 6 oe | 4) IL | 0) 
et 7 3 | a af 6] 30 20: | 


Inverse Adaptive Quantization 

Inverse adaptive quantization is a process in which the 
four-bit ADPCM signal, I(k), is used to determine the 
normalized log of the difference signal from Table 1. The 
result is actually a quantized version of the difference signal, 
dg(k), determined by adding the scale factor, y(k), to the 
value specified by Table 1 and calculating, the inverse log 
(base 2) of this sum. 


dg(k) =logy—! [floga ldg(k)l — yk} + yK)] = (10) 


For both the encoder and decoder, this quantized 
difference signal is the input to the reconstruction signal 
calculator and the adaptive predictor, as shown in Figures 
7 and 8. 


Adaptive Prediction 

The adaptive predictive filter is a two-pole, six-zero 
filter used to determine the signal estimate. The combination 
of both poles and zeroes allows the filter to model more 
effectively any general input signal. The sixth-order all-zero 
section helps to stabilize the filter and prevent it from drifting 
into oscillation. For both the poles and the zeroes, the 
coefficients, aj(k) and b;(k), respectively, are adapted. This 
adaptation is based upon a gradient algorithm to further adjust 
the filter model to the input signal. Figures 9 and 10 show 
the sixth-order and second-order filters, respectively. 

The signal estimate, se(k), represents the sum of the 
all-pole filter and the all-zero filter. Since the sum of the all- 
zero filter is used to aid the determination of the pole 
coefficients, it is also extracted as a separate sum, Sez(k). 
The reconstructed signal, the output in the receiver, is the 
sum determined by the quantized difference signal dg(k), and 
the signal estimate, se(k). 


2 
Se(k) = ) aj(k—1)s-(k—i) + Sez(k) (11) 
i=] 
6 
Sez(k) = )) bi(k—1)dg(k—i) (12) 
i=1 
Sr(k—i) = Se(k—1) + dg(k—i) (13) 


The adaptation of the pole coefficients, aj(k), is shown 
in the equations below. The gradient function is determined 
from a signal, p(k), that is equivalent to the reconstructed 
signal minus the contribution of the pole filter output. 
Stability of the filter is further provided by explicitly limiting 
the coefficients. 


$ez(k) 


ACCUMULATOR 


Figure 9. Sixth-Order All-Zero (FIR) Filter 
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dg(k) + 


ACCUMULATOR 
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Figure 10. Second-Order IIR Filter 


ay(k) = [1 — 2-8] ay(k-1) 
+ 362-8sgn [p(k)] sgn[p(k—1)] (14) 


where |aj(k)| < 1 — 2-4 — ag(k) 
a2(k) = [1 — 2-7] ag(k—1) 
+ 2—7{sgn [p(k)] sgn [p(k—2] 
— flay(k—1)] sgn [p(k)] sgn [p(k—1]} (15) 


where |a2(k)| < 0.75 


p(k) = dg(k) + Sez(k) (16) 


4a; __—z if|ay| < 1/2 
f(a) = (17) 
2sgn(aj), if|ay| > 1/2 


where sgn(O) = +1 


For the coefficients, bj(k), of the sixth-order all-zero 
filter, the adaptation procedure is similar, but the limit is 
implicit in the equations to a maximum of +2. The gradient 
function, in this case, is determined by the current difference 
signal, dg(k), and corresponding difference signal, dg(k —i), 
at the specific filter tap. 


bi(k) = [1 — 2-8] b(k—-1) 
+ 2—7sgn [dg(k)] sgn [dg(k—i)] (18) 


where i= 1,2, ..6 and —2 < bj(k) < +2 
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Signal Conversion 

Signal conversion consists or ue conversion from an 
8-bit log-PCM representation of a signal to a 13-bit linear 
PCM representation (note Figure 7), or the reverse (note 
Figure 8). Signal conversions of this type are described in 
the application report on COMPANDING ROUTINES FOR 
THE TMS32010. In the encoder, the log-PCM signal, s(k), 
is expanded to create the linear-PCM value, s}(k). The 
decoder, on the other hand, compresses the reconstructed 
signal, s,;(k), to create the log-PCM signal, Sp(k). 


Reconstructed Signal 
Synchronization 
To avoid a cumulative distortion in synchronous tandem 
codings, an adjustment to the reconstructed signal is 
specified. The adjustment block, shown in Figure 8, estimates 
the quantization of the encoder by determining a difference 
signal and executing the adaptive quantization logic. The 
quantization result is an estimate of the received value of I(k). 
The difference signal, dx(k), is determined by 
subtracting the signal estimate, se(k), from the linear-PCM 
signal, s}x(k), which is itself determined by expanding the 
log-PCM signal, sp(k). 


dx(k) = sjx(k) — Se(k) (19) 


The adaptive quantization process produces the estimate 
of the ADPCM code value, Iy(k). If the estimate implies a 
difference signal that is lower than the received interval 
boundary, the log-PCM code is changed to the next most 
positive value. An estimate implying a difference signal 
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larger than the received interval boundary requires the log- full-duplex transcoder. The TMS320M10 is a masked ROM, 
PCM code to be changed to the next most negative value; microcomputer version of the TMS32010, which requires 
otherwise, the log-PCM value is left unchanged. The adjusted no external program memories. A full-duplex transcoder 
log-PCM value is denoted as sg(k) in the following equation provides transmission in both directions simultaneously. Such 
to differentiate it from the input value, sp(k). a transcoder is depicted in Figure 11. A complete system 

diagram of a full-duplex communications channel is shown 

in Figure 12. In comparison to current systems that modulate 


Sp (k), dx(k) < lower interval boundary a 64-kbit/s A-law or p-law PCM signal on a carrier for 
transmission, the described system transcodes the 64-kbit/s 
Sa(k) = § Sp(k), dx(k) > upper interval boundary code to a 32-kbit/s code. This 32-kbit/s code, which requires 
correspondingly less bandwidth, is modulated on the carrier 

Sp(k), otherwise (20) for transmission. 


where 
Sq(k) = output PCM of the decoder 


Sp (k) = next more positive PCM level (if sp(k) is the 
most positive level, then sp (k) = Sp(k) ) 


Sp (k) = next more negative PCM level (if Sp(k) is the 


A 
most negative level, then sp (k) = Sp(k 64-KBIT/S PCM 
st negative Sp (k) = Sp(k) ) (LAW LAW 


32-KBIT/S 
a 


FULL-DUPLEX IMPLEMENTATION OF 
ADPCM ON A TMS32010 


[arf 
= : 
The specific implementation of ADPCM presented here 


involves the use of a single TMS320M10 to accomplish a Figure 11. Full-Duplex ADPCM Transcoder 


Figure 12. Full-Duplex Telecommunications Channel 
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Hardware Logic and 1/O. 

The hardware required to implement the ADPCM 
system consists of an addition to an existing circuit. As shown 
in Figure 13, the TMS32010 addresses the external I/O 
blocks through its port addressing structure. The lower three 
address lines, A2-A0O, form a port address that can be 
decoded by port decode logic to provide specific enable lines 
(e¢.g., WRTENI and RDEN1) to the various 
peripheral blocks. The TMS32010 reads and writes the 
64-kbit/s data through the codec interface eight bits at a time. 
The sampling frequency is 8 kHz. For this full-duplex 
implementation, one sample is written and one sample is read 
every 125 ys. 

Figure 13 also shows the serial interface to the codec 
that provides the »-law companded PCM data, although this 
is not part of the transcoding system itself. The log-PCM 
signal may already be available (e.g., in existing digital 
telecom networks) and, as such, may be interfaced to the 
TMS32010 either directly as parallel data or serially through 
conversion logic. Parallel codecs are also becoming available 
to reduce the hardware logic and interface required for those 
systems which do not already include a codec. The 
TMS32010 is available at crystal and clock input rates of 
20.5 MHz which may be divided down to provide the codec 
timing and further reduce the logic requirement. 


¢D15 

+D14 

;BI0 | DEN 
™S32010 

$INT 


T Half-duplex, CCITT bit-compatible, version only 
+ Full-duplex version only 


RDEN2 


WRTEN1 Dx 


At the other end of the transcoder function, the 
TMS32010 reads and writes the 32-kbit/s ADPCM data 
through the ADPCM interface four bits at a time for each 
125-ys period. This interface provides four-bit parallel data 
which may be serialized, if required, for transmission or 
storage. 


Software Logic and Flow 

Tables 4 and 5 list the various blocks in the algorithm, 
directly relating them to Figures 7 and 8 by the signal names 
given in the description and function. The blocks are listed 
in the order in which they are executed. Also listed is 
processor demand or loading which consists of the amount 
of program memory used to implement the given function 
and the number of instruction cycles executed in worst case. 
There are more blocks in the table than are shown in the 
figures (e.g., the algorithm uses the adaptive predictor at one 
point to produce the signal estimate, and later returns to 
update or adapt the predictor coefficients). Each block has 
been implemented using the equations given in previous 
sections concerning the ADPCM algorithm. For 
convenience, the equations implemented in each block are 
listed in the description section for the block. A more detailed 
description of the TMS32010 implementation is given in the 
next section. 


ANALOG OUT 


ANALOG IN 


ADPCM OUT 


ADPCM 
INTERFACE 


ADPCM IN 


Figure 13. System Interface of a TMS32010 ADPCM Transcoder 
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Table 4. Full-Duplex Transmitter 


Program 
Memory 


Order Description CPU 
Clocks (Words) 
INPUT Read an 8 bit p-law PCM sample [s(k)] and linearize it to 7 0004 
PCM a 12-bit sample [s;(k)]. 


COMPUTE Calculate the signal estimate [s,(k)] from the previous 
SIGNAL data samples [dg(k)] and reconstructed samples [s,(k)] 
ESTIMATE through the predictor filter. 

(12),(11) 


COMPUTE Calculate speed control [a;(k)] and quantizer scale factor 
ADAPTIVE [y(k)] from past quantizer output [I(k)]. 
QUANTIZER (6),(3) 


COMPUTE Calculate the difference signal [d(k)] from the current 
DIFFERENCE sample [s)(k)] and signal estimate [s.(k)]. 
SIGNAL (1) 


COMPUTE Caiculate the log of the difference signal [d(k)] and 46 OOAD 
QUANTIZED adaptively quantize the result to yield the ADPCM 
OUTPUT output [I(K)]. 

(2) 


OUTPUT Write the ADPCM output [I(k)]. 2 
ADPCM 


COMPUTE Calculate the inverse of the adaptively quantized signal 
RECON- 

STRUCTED 
SIGNAL 


7. 


[dg(k)] and the reconstructed signal difference [s,(k)]. 
(10),(13) 


COMPUTE 
SCALE 
FACTOR 


Calculate the updates for the scale-factor adaptation. 
(4),(5) 


COMPUTE 
SPEED 
CONTROL 


Calculate the update for the speed-control adaptation. 
(8),(9),(7) 


COMPUTE 
PREDICTOR 
ADAPTATION 


Calculate the updates for the adaptive predictor filter 
coefficients. 
(18),(16),(17),(14),(15) 
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COMPUTE 
SIGNAL 
ESTIMATE 


COMPUTE 
ADAPTIVE 
QUANTIZER 


COMPUTE 
QUANTIZED 
DIFFERENCE 


COMPUTE 
SCALE 
FACTOR 


COMPUTE 
SPEED 
CONTROL 


COMPUTE 
RECON- 
STRUCTED 
SIGNAL 


COMPUTE 
PREDICTOR 
ADAPTATION 


COMPUTE 
LOG-PCM 


OUTPUT 
PCM 


Table 5. Full-Duplex Receiver 


Read the ADPCM input [I{k)]. 


Calculate the signal estimate [s,(k)] from the previous 
data samples [dg(k)] and reconstructed samples [s,(k)] 
through the predictor filter. 

(12),(11) 


Calculate speed control [a,(k)] and quantizer scale factor 
{y(k)] from the past quantizer output [I(k)). 
(6), (3) 


Calculate the inverse of the adaptively quantized signal 
[dg(k)]. 
(10) 


Calculate the updates for the scale-factor adaptation. 
(4),(5) 


Calculate the update for the speed-contro! adaptation. 
(8),(9),(7) 


Calculate the reconstructed signal [s,(k)]. 
(13) 


Calculate the updates for the adaptive predictor filter 
coefficients. 
(18),(16),(17),(14),(15) 


Convert the reconstructed linear-PCM signal [s,(k)] to a 
p-law PCM signal [s,(k)). 


Write the y-law output [sp)k)]. 


Implementation and Advantages of 
TMS32010 Architecture 

This implementation is only concerned with p-law 
PCM, although A-law PCM may also be used. Additional 
information on log-PCM companding is found in an 
application report, COMPANDING ROUTINES FOR THE 
TMS32010. The implementation is simplified here so that 
the expansion is a simple table lookup which saves 21 
instruction cycles over the algorithmic approach. 

The processing of the signal through the predictor filter 
is similar to the processing discussed in the application report, 
IMPLEMENTATION OF FIR/HR FILTERS WITH THE 
TMS32010. The filter used in this ADPCM algorithm is a 
combination of a second-order IIR filter and a sixth-order 
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all-zero or FIR filter. The filters are shown in Figures 9 and 
10, respectively, with the system interaction shown in Figure 
14. 

Several manipulations of data format occur in adapting 
the predictor coefficients. In updating the coefficients of the 
all-zero filter (the B;’s), the coefficients that are normally 
Q14 numbers are loaded with a shift allowing the calculations 
to be done in a Q29 representation. This greatly simplifies 
the subtraction of the leakage term and the prediction gain. 
The leakage term, which occurs here in the predictor 
coefficient adaptation and also in the speed-control and scale- 
factor adaptation, controls the rate of change of the parameter 
away from zero and towards the absolute maximum limits 
of the particular parameter. The prediction gain also uses 
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FILTER 


SIXTH-ORDER 


SECOND-ORDER 


FILTER 


Figure 14. Predictor Filter 


an approach whereby the signs are actually stored as a signed 
Q11 value. In this way, the product is a Q22 value of the 
correct sign and can be added to the B value, equivalent to 
a Q29 value times 2—7. As with the filter process itself, the 
signs of the Dq values are propagated through each filter tap 


The adaptive quantization section requires that the log 
(base 2) of the difference signal be taken, the result 
normalized, and the normalized value quantized. Taking the 
log (base 2) of a number is accomplished by using the 
approximation 


delay with the LTD instruction. An example for one of the 
B; values is shown in Figure 15. 

A similar process takes place in adapting the prediction 
coefficients (Aj’s) in the second-order filter, although the 
fixed-point representation of the coefficients is Q26. The 
remaining requirement is to limit-check the Aj values. 


loga(1 + x) = x (21) 


ie Te EE I MIE IE IE TEE IE A Te ee ee Ie ene ee Ae Ree, Pee eae ee ee ee ae Pee 
He 
*; COMPUTE COEFFICIENTS OF THE 6TH-ORDER PREDICTOR 
He 
*; Bi(k) = {1 - 2**-8] * Bi(k-1) 
*; + 2**-7 * SGN[DQ(k)] * SGN[DQ(k-i)] 
He 
*; FOR i =1... 6 
e AND Bi IS IMPLICITLY LIMITED TO +/- 2 
% es 
*; NOTATION: Bn  -- 16b TC (Q14) 
*; SDQn -- +2048 IF DQn POSITIVE (QI!) 
- -2048 IF DQn NEGATIVE (Q11) 
Ke 
PSCC TECCCTOTCT TOSCO OCS STCCTOSCCCOCTTTrr errr erere errr teers 
Me 
GETB6 LT S0Q6 * (Q11) 
LAC B6,15 * (Q29) 
SUB B6,7 * BE * 2**-8 (Q29) 
MPY SDQ * SGN(SDQ) *SGN(SDQ6)*2**~7 (Q29) 
LTD S0Q5 * (Q11) 
SACH B6,1 * (Q14) 


Figure 15. Predictor Coefficient Adaptation Code 
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The characteristic of the result is the bit position of the 
most significant one digit in the absolute value. The result 
can be represented as a Q7 value. Finding the most significant 
digit is most efficiently done by a binary search technique. 
This technique is discussed in the application report, 
FLOATING-POINT ARITHMETIC WITH THE 
TMS32010. Since the exponent is part of the number instead 
of being stored in a separate register, one of the auxiliary 
registers is loaded with the exponent value. The auxiliary 
register stores it in memory and adds it to the mantissa in 
the accumulator. A short example of this is shown in the 
excerpted code in Figure 16 where the signal has an assumed 
exponent value of 9. 


SACL D 
SACH DS 


ADAPTIVE QUANTIZER 


ye See) ee ee ee ee, ee eo 2 


x «x & & &K KK KK KK KK KK HK HR KK KK K KK KK KR XR 


ee ~2~e 2e we @8@ we 2O we we 


AQUAN ABS 
SACL TEMP1 
GETEXP SUB ONE,8 
EXP9 LARK 0,9 * EXP = 9; 


LAC TEMP1,14 


Normalization of this log value is simply a subtraction 
of a scale factor which may be as large in fixed magnitude 
as the largest logarithmic value represented in Q7 notation. 
The result of the subtraction may be a negative value. Since 
the normalized result is to be quantized in a nonuniform 
manner and one of the quantization levels could contain both 
positive and negative values, the normalized result is scaled 
by adding a fixed value of 2048. Nonuniform quantization 
can be performed by a binary-type search technique. The 
normalization and quantization are included in the program 
shown in Figure 16. 


* STORE THE DIFFERENCE SIGNAL 
* SAVE THE SIGN OF D 


3e Fe HE HE HE HE HE TE HE IE HE DE HE HE TE HE HE HE EE HE FE OE HE FE FE HE IE TE OE HE OE HE HE OE HE HE FE HE HE OE OE HE HE HE HE HH HE HE HE 


INPUTS: DIFFERENCED PCM SAMPLE -- D 
QUANTIZER SCALE FACTOR -- Y 
OUTPUT: 4-BIT QUANTIZED, NORMALIZED DIFFERENCE -—- | 
NOTATION: OD -- 16b TC (QO) 
Y -- 13b SM (Q9) POSITIVE VALUE ONLY 


He -9E 96 3b FE OE OE FE HE 9b 9G 3b OE OE FE FE FE FE FE FE HE SE OF HEE HE HE HE HE HE OE HE HE HE UE EE HE FE OE FE FE FE FE SE HE FE FE HE FE FEE TE EE EEE 


SSS SoS + 
V 
--+ DLN +-------- + I 
—o >! QUAN }------ > 
——+ +-------- + 


(eee eee eee ee eee eee ERE ERE SESE ERR RRR ERR RRR RRR RRR SRE ee 


COMPUTE THE LOG OF THE DIFFERENCE SIGNAL 


* LOGS OF NEGATIVES ARE UNREAL. 
* SAVE THE ABSOLUTE DIFFERENCE. 
* BEGIN BINARY SEARCH. 


STORE IN ARO. 


* LOAD TO ISOLATE 8 MSB’S. 


Figure 16. Adaptive Quantization Code 
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SACH TEMP] 
LAC TEMPI " 
B  GETMAN 
GETMAN AND M127 " 
SAR 0,TEMP1 . 
ADD TEMP1,7  * 
We 
*; SCALE BY SUBTRACTION 
He 
SUBTB ADD ONE, 11 


SUB TEMP3 * 


; 4-BIT QUANTIZER 


ITAB! EQU 2041 
ITAB2 EQU 2171 
ITAB3 EQU 2250 
ITAB4 EQU 2309 
ITABS EQU 2358 
ITAB6 EQU 2404 
ITAB7 EQU 2453 


% 


QUAN SUB K2309 * 


BGEZ CI4T0O7 
C10TO3 ADD K138 * 
BGEZ CI2TO3 
C1loTo!l ADD K130 ee 
BGEZ IEQ! 
TEQO LACK O 
B GETIM 
IEQ] LACK 1 
B GETIM 
C1I2TO3 SUB K79 Mg 
BGEZ IEQ3 
1EQ2 LACK 2 
B GETIM 
1EQ3 LACK 3 
B GETIM 
Ci14TO7 SUB K95 * 
BGEZ CI6TO7 
CI5TO6 ADD K46 ” 
BGEZ IEQ5 
TEQ4 LACK 4 
B GETIM 
1EQS LACK 5 
B GETIM 
Cl6éTO7 SUB K49 " 
BGEZ IEQ7 


* SAVE MANTISSA. 
RELOAD FOR MANTISSA RECOMBINATION. 


MASK TO RETAIN ONLY SEVEN BITS. 
MOVE EXPONENT TO MEMORY FROM ARO. 
ADD EXPONENT TO MANTISSA FOR LOG VALUE. 


TEMP3 


ITAB4 
ITAB2 


ITAB1 


ITAB3 


I TAB6 


ITAB5 


I TAB6 


* ADD AN OFFSET OF 2048. 


QUANTIZATION TABLE FOR 32KB OUTPUT (OFFSET: 


Figure 16. Adaptive Quantization Code (Continued) 


17. 32-kbit/s ADPCM with the TMS32010 


value involves another technique. The code to complete this 
task is much shorter in program memory requirement and 
somewhat faster in execution time. The same type of 
approximation is involved in determining the antilog as used 
in taking the log, 


IEQ6 LACK 6 


B GETIM 
IEQ7 LACK 7 
GETIM SACL IM * ACCUM = iI} 
XOR DS * ADD SIGN BIT AND FLIP IF NECESSARY. 
AND M15 * MASK FINAL FOUR-BIT VALUE. 
SACL I * SAVE ADPCM OUTPUT VALUE. 


Figure 16. Adaptive Quantization Code (Concluded) 


Determining the inverse of the 4-bit quantized ADPCM After separating the exponent from the mantissa in the 


log representation, the quantized difference signal may be 
recovered by using the exponent to select a scaling factor. 
The scaling factor or multiplier is used to shift the mantissa 
to the proper representation, either right or left. Some of the 
multipliers may be stored as negative values rather than 


positive values, using the sign of the result to determine 
whether the answer is obtained from the high half of the 


logg—! (x) = 1 +x (22) accumulator (effectively a right shift) or from the low half 


of the accumulator (a left shift). The program for this process 


is shown in Figure 17. 


Ke 
+ 


Peg eae aE OE Re Oe Pe Ae Pe eee eee Le Oe ie ee EE ee ee ee ee ee ee ae 


*; INVERSE ADAPTIVE QUANTIZER 


He 

*5 INPUTS: 4-BIT QUANTIZED SAMPLES -- IM 

ae QUANTIZER SCALE FACTOR -- Y 

He 

"% OUTPUT: RECONSTRUCTED DIFFERENCE SIGNAL -- DQ 
% eo 

ie NOTATION: Y -- 13b SM (Q9) POSITIVE VALUE ONLY 
es DQ -- 166 TC (QO) 

a SDQ -- +2048 IF DQ POSITIVE (Q11) 
*5 -2048 IF DQ NEGATIVE (Q11) 
He 


Ho HHH HH HH HH HM HH HE HH MH TE TH HE HE HE ME TE OE TE TE HE HE HE HE TE OE HE HE EE HE EE Oe HE HE Ee 


Ho 


7 DQSs 


2 ae +---+---+ DQLN +------- + DQL +-------- + DQ 


Ho HHH HHH HE HH HH HH HE HE EE HE EE HE TE HE EE HE TE HE He HE HE ee EE EH EE EE HE EE He EE HO 


% eo 


*; CONVERT QUANTIZED DIFFERENCE BACK TO LOG DOMAIN 


Re 
? 


Figure 17. Inverse Adaptive Quantization Code 
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TAQUAN LAC IM 
ADD INQTAB * RECONSTRUCTION TABLE 
TBLR TEMP! * READ NORMALIZED VALUE. 
ADD NORMALIZING SCALE FACTOR BACK IN 


ADDA LAC TEMP1 


ADD TEMP3 *Y >> 2 
AND M2047 
SACL TEMP2 
He 
*; CONVERT THE LOG VALUE TO THE LINEAR DOMAIN 
% eo 
am 
ALOG LAC TEMP2,9 * EXTRACT EXPONENT. 
SACH TEMP * SAVE EXPONENT VALUE. 
LACK 127 
AND TEMP2 * MASK FOR LOG MANTISSA ONLY. 
ADD ONE,7 * 14x 
SACL TEMP2 * EXTRACT MANTISSA. 
LT  TEMP2 * PREPARE TO SHIFT. 
LAC TEMP! 
ADD SHIFT * LOOK UP MULTIPLIER. 
TBLR TEMP3 
MPY TEMP3 * MULTIPLY MANTISSA BY SHIFT FACTOR. 
PAC 
BLZ LEFTSF * NEGATIVE VALUES CORRESPOND TO LEFT SHIFT. 
SACH DQ, 1 * RIGHT SHIFT; SAVE MAGNITUDE OF DQ. 
B ADDSGN 
LEFTSF ABS * LEFT SHIFT; RESTORE MAGNITUDE. 
SACL DQ * SAVE MAGNITUDE OF 0Q. 
ADDSGN LAC ONE,11! * ASSUME POSITIVE AND SAVE THE SIGN. 
SACL SDQ * (SIGN IS Qliz; REMEMBER FILTER.) 
LAC I * CHECK SIGN OF SAMPLE. 
SUB ONE,3 
BLZ QSFA * FINISHED FOR POSITIVE VALUES (1<8). 
ZAC 
SUB 0Q * COMPUTE TWO’S COMPLEMENT OF THE MAGNITUDE. 
SACL DQ * SAVE NEGATIVE DQ VALUE. 
LAC MINUS,11 * SIGN IS Qli; REMEMBER FILTER. 
* 


SACL SDQ SAVE SIGN. 


Hoe 

*; INVERSE QUANTIZING TABLE 

*%e 

IQTAB BSS 0 
DATA 65401 
DATA 68 
DATA 165 
DATA 232 
DATA 285 
DATA 332 
DATA 377 
DATA 428 


Figure 17. Inverse Adaptive Quantization Code (Continued) 


488 17. 32-kbit/s ADPCM with the TMS32010 


He 
’ 


*; SHIFT MULTIPLIER TABLE 


ee 
? 


SHF T BSS 0 
DATA 256 
DATA 512 
DATA 1024 
DATA 2048 
DATA 4096 
DATA 8192 
DATA 16384 
DATA -1 
DATA -—2 
DATA —-4 
DATA -8 
DATA —16 
DATA -32 
DATA -64 
DATA -128 


Figure 17. Inverse Adaptive Quantization Code (Concluded) 


The adaptation of the speed-control and the scale-factor 
parameters, used to adapt the stepsize in the adaptive 
quantizer and inverse adaptive quantizer, requires multiple 
uses of the technique of adjusting the fixed-point 
representation. The Q point is adjusted for convenience of 
the table constants which are part of the adaptation process 
and for saving the output value from the accumulator. Some 
limit-checking must also take place in calculating the 
unlocked-scale factor and the speed-control parameter. 
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In the calculation of the locked-scale factor and its 
inclusion in the mixing process for determining the overall 
scale factor used for stepsize quantization, the parameter is 
maintained with a greater resolution (19 bits of value plus 
its sign) than can be stored in a single memory. Calculations 
involving this parameter must then become two stage, both 
in terms of accumulations and in determining products. The 
code involving this parameter is listed in Figures 18 and 19. 
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Se ME HE IE FE FE HE HE FE 2 ME HE TE HE OE HE 9 HE HE HE EE HE HE HE TE I HE HE HE HE Oe HE HE 9 FE HE HE HE HE HE HE HE EOE HE EE OE He HE EO EH HH 


QUANTIZER SCALE FACTOR ADAPTATION 
INPUT: I : 32KB CODED SAMPLES 


OUTPUT: YU,YL : NEXT SAMPLE SCALE FACTOR 


NOTATION: Y a 
YU = 
YL = 


13b SM (Q9) POSITIVE VALUE ONLY 
136 SM (Q9) POSITIVE VALUE ONLY 
19b SM (Q1I5) POSITIVE VALUE ONLY 


3 He HE HE HE HE HE ME HE HE HE TE TE TE OE HE HE HE HE FE TE HE OE OE HE HE EE HE HE Ee EE Ee HE HE HE HE HE He EE HE 


x * *®* kK K KK KOK KOK KOK OK OK 
we we we te Oe ee we ee we ee ee ee we we we 


He 
*; UPDATE SLOW ADAPTATION SCALE FACTOR -—- CONSTANT = 1/64 
% oe 
*; YL(kK) = (1-2**-6)*YL(k-1) + 2**-6 * YU(k) 
Cy 
FILTE LAC YLH,6 * SHIFT YL LEFT BY 6. 
SACL TEMP] * TEMPI = YLH * 2**6 
LAC YLL,6 
SACL TEMP2 
SACH TEMP3 * TEMP3 ! TEMP2 = YLL * 2**6 
LAC TEMP3 * SUPPRESS SIGN EXTENSION. 
AND M63 
SACL TEMP3 
ZALH TEMP1 
ADOH TEMP3 
ADDS TEMP2 * ACCUM = YL * 2**6 
SUBH YLH 
SUBS YLL * ACCUM = YL * 2**6 -— YL 
ADD YU,6 * ACCUM = YL * 2**6 -— YL + YU 
SACL TEMP1 
SACH TEMP2 * RESULT = YL (SHIFTED LEFT BY 6) 
LAC TEMP1,10 * SHIFT RESULT RIGHT 6 -—-> ql15 
SACH TEMP! 
LAC TEMP1 
AND M1023 * MASK SIGN EXTENSION. 
ADD TEMP2,10 
SACL YLL * SAVE YLL. 
SACH YLH 
LACK 7 * MASK UPPER 13 BITS. 
AND YLH 
SACL YLH * SAVE YLH. 


Figure 18. Quantizer Scale-Factor Adaptation: Locked-Factor Calculation 
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= * «x kK K 
— Ce we @e we we 


FORM LINEAR COMBINATION OF FAST AND SLOW SCALE FACTORS 


Y(kK) = (1-AL(kK))*YL(kK-1) + AL(kK)*YU(k-1) 


Xx LAC YLL,10 * SHIFT YL RIGHT BY 6. 
SACH TEMP3 * (ITE SCALE YL TO MATCH YU SINCE YL 
LAC TEMP3 * CONTAINS 6 MORE LSB’S) 
AND M1023 
ADD YLH,10 
SACL TEMP3 * LOW HALF 
LAC YU 
SUB TEMP3 * YU=(YEED>>6) 

SACL TEMP3 

ZALH YLH 

ADDS YLL 

LT AL * AL IS IN 1.Q6 

MPY TEMP3 

APAC * YL + AL*(YU-(YLL>>6) ) 
SACL -TEMP3 

SACH TEMP2 = TEMPZ TEMPS Si S296 
LAC TEMP3,10 * SHIFT RIGHT BY 6. 
SACH TEMP3 

LAC TEMP3 

AND M1023 * MASK SIGN EXTENSION. 
ADD TEMP2,10 

AND M8191 

SACL Y * SAVE Y. 

LAC Y,14 

SACH TEMP3 * SAVE Y >> 2. 


Figure 19. Quantizer Scale-Factor Adaptation: Mixing 


CCITT IMPLEMENTATION OF ADPCM ON 
A TMS32010 


The implementation of ADPCM that produces a bit- 
for-bit compatible solution with the CCITT test vectors uses 
a single TMS320M 10 to accomplish a half-duplex transcoder. 
This solution can provide capability as either a transmitter 
or a receiver using either A-law or p-law companding. 


Hardware Logic and I/O 

The hardware system for this transcoder 
implementation differs from Figure 13 in that data pins D15 
and D14 are used to determine the mode of operation. Table 
6 shows the operating mode for the various combined states 
of the data pin inputs. 

Additionally, as has been noted in Figure 13, the 
interrupt or sample timing is an input to the INT pin in 
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Table 6. Operating Mode Selection 


Operating Mode 


p-law transmitter 


p-law receiver 


A-law transmitter 
A-law receiver 


*H = High logic level 
L = Low logic level 


the full-duplex implementation; here it is an input to the BIO 
pin. Each 125-ys period, the TMS32010 reads a 64-kbit/s 
sample from the codec and writes a 32-kbit/s sample to the 
ADPCM interface, or it reads the 4-bit ADPCM sample and 
writes an 8-bit PCM sample to the codec. 

For real-time execution, the TMS32010 requires the 
use of a 25-MHz clock input. 
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Software Logic and Flow 

Tables 7 and 8 list the various blocks in the algorithm, 
directly relating them to Figures 7 and 8 by the signal names 
given in the description and function. No differentiation is 
made between the transmitter or receiver using A-law or p- 
law. The blocks are listed in the order in which they are 
executed. Also listed is processor demand or loading which 
consists of the amount of program memory used to implement 
the given function and the number of instruction cycles 
executed in worst case. There are more blocks in the tables 
than are shown in the figures (e.g., the algorithm uses the 


adaptive predictor at one point to produce the signal estimate, 
and later returns to update or adapt the predictor coefficients). 
Each block has been implemented using the equations given 
in previous sections concerning the ADPCM algorithm. For 
convenience, the equations implemented in each block are 
listed in the description section for the block. Additional 
details of the TMS32010 implementation are given in the next 
section, especially as they differ from the full-duplex 
implementation. The appendix contains a complete listing 
of the code. 


COMPUTE 
SIGNAL 
ESTIMATE 


COMPUTE 
ADAPTIVE 
QUANTIZER 


COMPUTE 
DIFFERENCE 
SIGNAL 


COMPUTE 
QUANTIZED 
OUTPUT 


STRUCTED 
SIGNAL 


COMPUTE 
SCALE 
FACTOR 


COMPUTE 
SPEED 
CONTROL 


COMPUTE 
PREDICTOR 
ADAPTATION 


Table 7. CCITT Transmitter 


Program 
Memory 
(Words) 


Description 


Read an 8 bit log-PCM sample [s({k)] and linearize it to a 
12-bit sample [s,(k)]. 


Calculate the signal estimate [s,(k)] from the previous 
data samples [dg(k)] and reconstructed samples [s,(k)] 
through the predictor filter. 

(12), (11) 


Calculate speed control [a;(k)] and quantizer scale factor 
[y(k)] from past quantizer output [I(k)]. 
(6), (3) 


Calculate the difference signal [d(k)] from the current 
sample [s,(k)] and signal estimate [s,(k)]. 
(1) 


Calculate the log of the difference signal [d(k)} and 
adaptively quantize the result to yield the ADPCM 
output [I(k)]. 

(2) 


Calculate the inverse of the adaptively quantized signal 


[dg(k)] and the reconstructed signal difference [s,(k)]. 
(10), (13) 


Calculate the updates for the scale-factor adaptation. 
(4), (5) 


Calculate the update for the speed-control adaptation. 
(8), (9), (7) 


Calculate the updates for the adaptive predictor filter 
coefficients. 
(18), (1 6), (17), (14), (15) 
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Table 8. CCITT Receiver 


Description CPU 
Clocks 
1. INPUT Read the ADPCM input [I{k)]. 
ADPCM 


Program 
Memory 
(Words) 


COMPUTE Calculate the signal estimate [sg(k)] from the previous 396 
SIGNAL data samples [dg(k)] and reconstructed samples [s,(k)] 
ESTIMATE through the predictor filter. 
(12), (11) 
COMPUTE Calculate speed control [a;(k)] and quantizer scale factor 
ADAPTIVE {y(k)] from past quantizer output [I(k)]. 
QUANTIZER (6), (3) 
COMPUTE Calculate the inverse of the adaptively quantized signal 
QUANTIZED [dg(k)] and the reconstructed signal [s,(k)]. 
DIFFERENCE (10), (13) 
AND RECON- 
STRUCTED 
SIGNAL 
COMPUTE Calculate the updates for the scale-factor adaptation. 33 
SCALE (4), (5) 
FACTOR 
COMPUTE . Calculate the update for the speed-control adaptation. 30 
SPEED (8), (9), (7) 
CONTROL 
COMPUTE Calculate the updates for the adaptive predictor filter 
PREDICTOR coefficients. 
ADAPTATION (18), (16), (17), (14), (15) 
COMPUTE Convert the reconstructed linear-PCM signal [s,(k)] to a 35 
LOG-PCM log-PCM signal [sp(k)]. p-law 
33 
A-law 
SYNCHRON- Calculate an ADPCM signal from the output [s,(k)] and 63 
OUS CODING adjust to create [sg(k)] if it differs from [I(k)]. 
ADJUSTMENT 
2+ 
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Implementation and Advantages of 
TMS32010 Architecture 

Many of the same features are used in the bit- 
compatible implementation as were discussed in the full- 
duplex implementation. Some changes are imperative, since 
performance to the recommended specification requires 
executing certain calculations in a floating-point 
representation. These changes or additions require further 
modifications in order to limit the required amount of 
program memory to the internal memory space of the 
T™MS32010. 

One of the first observed requirements is that the 
processor must be capable of doing either A-law or p-law 
companding and function as either a transmitter or a receiver. 


The burden of determining the mode of operation is 
simplified by selecting one of the four modes from 
information available at the time of reset, and then executing 
from one of the four control loops until the next reset. Each 
loop, therefore, tests the BIO pin to determine when the 
next input sample is ready, rather than depending on the 
hardware interrupt. 

The requirement of selecting either A-law of y-law 
companding also means that a table lookup approach is 
beyond the program memory capacity. The conversion must 
be done algorithmically to reduce the amount of memory. 
Figures 20 and 21 illustrate p-law companding as it is 
implemented in this algorithm. 


XMTMU IN SCRACH, ADC 

EXPNDU LAC SCRACH,8 
XOR KFFOO 
SACL TEMP 1 
AND M32767 
SACH _TEMP2,4 
AND M4095 ; 
ADD BIAS,7 
SACL SCRACH 
LAC TEMP 1 
SACH TEMP! 
LACK  SBASE 
ADD TEMP2, 1 
CALA 
SUB BIAS, 12 
SACH SAMPLE, 4 
LAC SAMPLE ; 
XOR TEMP 1 ; 
SUB TEMP 1 ; 
SACL SAMPLE 

7 : 

SBASE LAC SCRACH,5 
RET 
LAC SCRACH,6 
RET 
LAC SCRACH,7 ; 
RET 
LAC SCRACH,8 
RET 
LAC SCRACH,9 ; 
RET 
LAC SCRACH,10_— ; 
RET 
LAC SCRACH,11 — ; 
RET 
LAC SCRACH,12.— ; 
RET 
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SEEE MMMM 0000 0000 
INVERT FROM TRANSMISSION FORMAT 


SAVE 
OEEE 
SAVE 
0000 
0001 


SIGN = 


CALCULATE PCM SHIFT ADDRESS 


0000000xX 
OOOXXXXX 


POS - DO 
POS -— DO 


00000000 


00000000 


00000000 


00000000 


00000000 


00000000 


00000000 


00000001 


VALUE FOR PCM SIGN 
MMMM 0000 OO00 
EXPONENT VALUE 
MMMM 0000 0000 
MMMM 


1000 0000 


FFFF OR 0000 


XXXXKXXXX XXXXOOO0O OOOCOONON 


KXKXKXKKKXKXK 
NOTHING 
NOTHING 


OOO0001M 


O000001MM 


00001MMM 


0001MMMM 


001MMMM1 


01MMMM10 


1MMMM100 


MMMM1000 


Figure 20. »-Law Expansion Code 
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NEG = 


NEG - 2’ 


MMM10000 
MM100000 
M1000000 
10000000 
00000000 
00000000 
00000000 


00000000 


1’s COMP 


s COMP 


00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 


00000000 


*; COMPRESS--CONVERT TO PCM 


He 
? 


LAC 


CMPRSU SACH 


SCLO23 
SCLO21 


SCALEO 


SCALE 1 


FINI 


CLNUP 


ABS 


SR 


TEMP4 


BIAS 
SCRACH 
ONE ,9 
SCL 427 
THREE, 7 
SCL223 
ONE ,6 
SCALE 1 
M15,1 
SCRACH 
SCRACH 
BIAS 
SAMPLE 
SCRACH, 15 
0,0 
FINI 
M15,2 
SCRACH 
SCRACH 
BIAS, 1 
SAMPLE 
SCRACH, 14 
0,1 
FINI 


SCRACH 
SCRACH 
0,TEMP 1 
TEMP1,4 
TEMP4,7 
M255 
SCRACH 
SAMPLE 
BIAS 
TEMP4 
TEMP4 
SAMPLE 


AQUAN 
SYNC 


M255 
SCRACH 


SCRACH ,DAC 


ee 


we 


@e C@e we 


we we 


se we «we 


we we 


ee we 


. oT) 


ae 


GET RECONSTRUCTED SIGNAL 


SAVE SIGN OF SR 


ADD BIAS 
SAVE BIASED PCM VALUE 
EXP = 7 - 4 OR 3 - O 


EXP = 3 - 2 OR 1 - O 
EXP = 1 OR O 
EXP = 0 


MASK FOR MANTISSA 


BIASED QUANTIZED VALUE 


EXP = 1 
MASK FOR MANTISSA 


BIASED QUANTIZED VALUE 


SAVE NORMALIZED MANTISSA 


ADD EXPONENT 


2°s COMPLEMENT OF MULAW-PCM 
REMOVE BIAS FROM QUANTIZED VALUE 


2°s COMPLEMENT OF QUANTIZED SAMPLE 


FLIP BITS FOR TRANSMISSION 


Figure 21. p»-Law Compression Code 
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The predictor filter implementation is also modified 
from what has been previously presented. In the CCITT 
recommendation, the processing of the signal through the 
predictor filter is performed in a floating-point format. This 
requirement leads to several modifications. First, all input 
signals to the filter, dg(k) and s,(k), must be converted to 
a floating-point notation. The conversion to this notation is 
accomplished by a binary search of the original fixed-point 
word. As previously mentioned, this technique is explained 
in some detail in the application report, FLOATING-POINT 
ARITHMETIC WITH THE TMS32010. Second, the filter 
coefficients, a;(k) and b;(k), must also be floated for each 
sample so that a floating-point multiply can be executed for 
each filter tap. 


Accumulation of the filter taps is carried out in fixed- 
point notation. Fixing a floating-point number is equivalent 
to the scaling presented for taking the anti-log of a number. 
Some of the floating-point results must be left-shifted, while 
others need to be right-shifted. The shift is accomplished by 
use of a scaling factor or multiplier, selected by the exponent 
sum of the floating-point multiply. Positive multipliers are 
used to indicate what is effectively a right shift with the result 
being stored from the high half of the accumulator. Negative 
multipliers indicate that the result is in the low half of the 
accumulator and is used for values which have been left 
shifted. 

The process of a single filter tap, not including the code 
to float the signal and the coefficient, is shown in Figure 22. 


eR NE ES a ee ee ea ee ee ee ee 


SEZ (kK) Bl (k-1)*DQ(k-1) + 


MULTIPLIES ARE DONE IN 


COMPUTE SEZ —--— PARTIAL SIGNAL ESTIMATE 


+ B6(k-1)*DQ(k-6) 


FLOATING POINT 


DQ’s ARE STORED IN FLOATING-POINT NOTATION 


*; B’s ARE FLOATED EACH PASS 
# oe 
ms NOTATION: DQnEXP —-- 4 bits + OFFSET 
*5 DQnNMAN*8 -- 9 bits 
*; Bn -- 16 b TC 3; Ql4 
we SEZ -- 16 b TC 3 QO 
% : 
EST rT rr rrr rrr ret rr rr rrr i rrrt iret itr rr rt Serr t te rrr rr Te 
He 
SIGDIF LAC B6,14 ; COMPUTE B6*DQS5. 
CALL FLOAT 3 RET/W MANTISSA IN TEMP1; EXP IN ACC. 
ADD DQ5EXP 
SACL SUM 1 
LAR O,SUMI1 ; EXP OF PRODUCT. 
LT DQ5MAN ; DQnMAN SCALED BY 2**3. 
LAC THREE,7 ; PRODUCT FUDGE FACTOR (48*8). 
MPY TEMP 1 
LTA *,0 ; BOEMAN* (DQ5MAN*8) +(48*8) 
AND KFF 80 3 SAVE ONLY 8 MSB’S. 
SACL TEMP 1 
MPY TEMP 1 ; APPLY SHIFT FACTOR. 
PAC 
BLZ RS1 3 EXP >= 26 
SACH SUM1,1 5; EXP < 26 
CHK 1 ZALS B6 ; CHECK SIGN OF PRODUCT. 
XOR SDQ6 
AND K32768 
BZ POS] 
NEG! ZAC ; NEGATE IF NECESSARY. 
SUB SUM 1 


Figure 22. Predictor Filter Execution 
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COMPUTE B5*DQ4. 


MAKE POSITIVE BEFORE MASK. 
KEEP LOWER 15 BITS. 
SAVE RESULT. 


Figure 22. Predictor Filter Execution (Concluded) 


SACL SUM 1 
POS 1 LAC B5,14 ; 
RS1 ABS ; 
AND M32767 3; 
SACL SUM 1 3 
B CHK 1 
SUMMARY 


The TMS32010 provides an efficient solution to 
transcoding a 64-kbit/s PCM signal to a 32-kbit/s bit stream. 
Transcoding, as described in this application report, is an 
effective way to maintain the signal quality provided by 7-bit 
PCM while reducing the data rate. 

The basic ADPCM algorithm has been implemented 
in two slightly different ways. One solution provides CCITT 
bit-for-bit compatibility. Using this algorithm, a half-duplex 
transcoder is created that can transcode either A-law or p- 
law signals as either a transmitter or a receiver. No external 
program memory is required for this implementation, 
although it does require the use of a 25-MHz TMS32010 
microprocessor. The second described solution is particularly 
attractive since it uses a single, 20.5-MHz TMS32010 
microprocessor that requires no external program memory 
to perform a real-time full-duplex (non-CCITT) channel 
transcoding. 

In selecting one of these two solutions, the primary 
consideration is the network interfacing requirement. For 
systems that only have analog interfaces to other parts of the 
network, the full-duplex solution will provide the best choice. 
On the other hand, a network that may include a digital 
interface to other ADPCM transcoders will probably require 
the CCITT bit-compatible solution. Both solutions provide 
high-quality signal transcoding. 

A complete assembled code listing is provided in the 
appendix of this report and is also available in 1600-BPI 
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VAX/VMS tape format. The software may be purchased by 
ordering the TMS32010 Software Exchange Library, 
TMDC3240212-18, from Texas Instruments. For further 
information, please contact your nearest TI sales 
representative. 
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Appendix 
ADPCM Assembly Language Programs 
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00S 


OLOCESWL 94) WM WOddV S/Nq4-ZE “LI 


CCITT 


000] 
A000! 
A0002 
A0003 
A0004 
A0005 
A0006 
A0007 
A0008 
AQ009 
A0010 
A001! 
A0NCI2 
A0013 
A0014 
A0QO0O15 
AQ016 
A0017 
A0018 
A0019 
A0020 
A002 1 
A0022 
A0023 
A0024 
A0025 
A0026 
A0027 
A0028 
A0029 
A0030 
A0031 
A0032 
A0033 
A0034 
A0035 
A0036 
A0037 
A0038 
A0039 
A0040 
A0041 
A0042 
A0043 


A0044 
A0045 
A0046 
A0047 
A0048 
A0049 
A0050 


A005 1 


0000 


0000 
ooo! 


0002 
0003 


32010 FAMILY MACRO ASSEMBLER PC2.1 84.107 16:36:03 03-20-85 
PAGE 0001 
COPY INPUT.ASM 
1OT “CCITT” 
OPTION XREF 


0001 
ooo! 
0002 
0000 


F900 
0542 


F900 
0002 


oo ee eS SEL TES SPSS STSCI SCT TT TST TT TT TST TTT Tee Te ee eee 


i ae hee ee eC eRe PSS eS SSCL eS eS CS 2 ee oe 


This is the source module for a half-duplex CCITT 
compatible 32-kbps ADPCM speech system. The transmitter 
assumes that log-PCM data will be available at each 
interrupt (every 125 microseconds) in the lower 8 bits of 
the data bus via !/0 port 1! and it supplies ADPCM data on 


the lower 4 bits of the bus via port 2. The recefver does 
the inverse. An interrupt in this case is determined by 
polling the BIO line, with a low signal level signifying 


the presence of a new data sample. 


The “R’ reset function in the CCITT spec is implemented 
with a hardware reset. At the time of reset, it is 
assumed that the operating mode has been established and 
input via the upper two bits of the data bus. The bit 
condition is read from port 0 so as not to disrupt any 
pending data sample on either of the other two ports. 
Since it is anticipated that the mode pins will be 
selected and maintained in a manner similar to a hardwire 
selection, the actual port from which the mode is read 

is arbitrary. 


i eee ee ee PEP ESS ES SSeS eS SoS 2 2 ee 2 es 


Ce ee a ee ee a a ee a oe 2 ee ae] 
HH te we we ee we ee ee et we ee te we ee ws we we ee 8 ee et ee we we et we we 


System [1/0 channel assignments 

*. 

? 
ADC EQU ! 3 codec input 
DAC EQU t 3 codec output 
CCITT EQU 2 3 adpem output/ input 
CTL EQuU 0 + control input to select mode 
*; + 0000 = mulaw transmitter 
ae 3 4000 = mulaw receiver 
"| + 8000 = alaw transmitter 
*; ; C000 = slaw receiver 
* 
ETE CEE CET TEC TET CET Te eee 

AORG 0 
*; 
B RESET 3 Power-up reset 

*; 
Lal allele Metalic Retetetetetetehetetehel deletet teeta tet ded Add haath eked tht ket E tk tlk 
*"; INTERRUPT HANOLING ROUTINE -~ SYSTEM HANDLES CODEC 
*; SAMPLES ON A SAMPLE BY SAMPLE BASIS. 
5 HW WT MO OE OF HF TE TE OE OF OF Hf OF OF He OF te OOF OF OE eo HE He HO EA HOH nee 
He 
INTRPT B INTRPT 


CCITT 


A0053 
A0054 
A0055 
A0056 
40057 
A0058 
AO0O0S9 
A0060 
A006! 
A0062 
A0063 
A0064 
A0065 
A0066 
A0067 
A0068 
A0069 
A0070 
A0O71 
A0072 
A0073 
A0074 
A0075 
A0076 
A0077 
A0078 
A0079 
A0080 
A008 1 
A0082 
A0083 
A0084 
A0085 
A0086 
A0087 
A0088 
A0089 
A0090 
A0091 
A0092 
A0693 
A0094 
A0095 
A0096 
A0097 
A0098 


A0099 
A0100 
A0101 
A0102 
A0103 


A0104 
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0004 411B 


0005s 
0006 
0007 
0008 
0009 
QOOA 
0008 
oooc 
0000 
OOOE 
OOOF 
0010 
oo! 
0012 
0013 
0014 
oo15 
0016 
0017 


0018 
0019 
OO1A 
0018 
ooIc 
001D 
OOIE 
OOIF 
0020 
0021 
0022 
0023 


2818 
7875 
5021 
7974 
5C22 
7972 
0740 
5018 
2021 
5821 
TE24 
0122 
7F8C 
1C4D 
5C26 
2026 
782) 
1021 
5026 


F800 
0183 
F800 
O2AA 
5001 
4A01 
F800 
0355 
F600 
0004 
F900 
0020 


16:36:03 03-20-85 
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PC2.1 84.107 


MU-LAW TRANSMITTER 


IN SCRACH,AODC ; 


Xv ws 
o 
=x 
Cc 


input mu-ltaw PCM 


oe RHHEKHKHEHHHHHRHHHHERHEEHHKRHHHHRHRHEHEKRHHRHH HEHEHE KRHA Henaen a 


MU-LAW TO LINEAR PCM EXPANSION 


INPUT: MU-LAW PCM SAMPLE -- S  (SCRACH) 
OUTPUT: LINEAR PCM SAMPLE -- SL (SAMPLE) 
NOTATION: S -- 8b SM (Q4) % 


SL -- 14b TC (QO) 


90 46 Oe OE TE OF HE OE OF OE Ob OE Oe SE 4 Ot Oe OE HE ME aE OF Ot Ot Ot Ob OE OF Ot OF OF OF OE UF ME OF te Oe Oe Oe me te ee Oe ee Oe te 


o 1 a 3 0 te Ub OE OE Mt Ot UF OE OE FE Ot Oe OE OF Ot Ot Ot OF OE Oe ae OE aE Ot Ot Mb OE Oe ae OE OE ee oe Ot te ae ee te oe te te 


se & © ke we RE eK Ke Ee eee EE EK KE 


. 
’ 
* 
. 
* 
. 
% 
. 
° 
. 
, 
° 
’ 
. 
’ 
. 
’ 
. 
, 
. 
, 
. 
, 
* 
, 
* 
, 
° 
, 
’ 
’ 
. 
? 


EXPNDU LAC SCRACH,8 ; seee mmmm 0000 0000 
XOR KFFOO ; SEEE MMMM O000 0000 
SACL TEMP! ; save value for PCM sign 
AND M32767 ; OEEE MMMM 0000 0000 
SACH TEMP2,4 ; save exponent value 
AND M4095 ; 0000 MMMM 0000 0000 
ADD BIAS,7 ; 0001 MMMM 1000 0000 
SACL SCRACH 
LAC TEMP}! 
SACH TEMP 1 +; sign = FFFF or 0000 
LACK SBASE 
ADD TEMP2,1 + calculate PCM shift address 
CALA 
SUB BIAS,12 3; GOOOOOO0X XXXXXXXX XXXxX0C000 00000000 
SACH SAMPLE ,4 
LAC SAMPLE 3 OOOX XXXX XXXX XXXX 
XOR TEMP 1 $ Pos - no change : neg - 1’s comp} 
SUB TEMP 1 ; pos —- no change : neg - 2’s comp! 
SACL SAMPLE 
He . 
*:; Now convert PCM vatue fn SAMPLE to ADPCM vatue in 1 
He 
GETI CALL SIGDIF 
CALL AQUAN 
SACL I 
OUT I,CCITT ; output ADPCM 
CALL PRDICT 
MULAWX BIOZ XMTMU 3; wait for next sample 
B MUL AWX 


OLOZESWLL OM YUM WOddV S/Alqy-ZEe “LI 


10s 


CCITT 


A0105 
A0106 
A0107 
A0108 
A0109 
A0110 
AO!11 

AOI12 
A0113 
AO114 
AO115 
A0116 
AO1L17 
A0118 
AO1L19 
A0120 
A01i21 


0024 
0025 
0026 
0027 
0028 
0029 
002A 
0028 
002Cc 
0020 
002E 
002F 
0030 
0031 
0032 
0033 
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2518 
7F80 
2618 
7F8D 
2718 
7F80 
2818 
7F80 
2918 
7F8D0 
2A18 
7F80 
2818 
7F80 
2ciB 
7F80 


Re 
? 


SBASE LAC 


SCRACH,5 
SCRACH,6 
SCRACH,7 
SCRACH,8 
SCRACH,9 
SCRACH, 10 
SCRACH, 11 


SCRACH, 12 


PC2.1 84.1 


00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 


00000001 


07 16: 


0000001M 
OO0001MM 
00001MMM 
0001MMMM 
001MMMM1 
O1MMMM1O 
1MMMM100 


MMMM1000 
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MMM10000 
MM100000 
M!1000000 
10000000 
00000000 
00000000 
00000000 


00000000 


00000000 
00000000 
00000000 
00000000 
00000000 
00000000: 
00000000 


00000000 


CCITT 


A0123 
A0124 
A0125 
A0126 
A0127 
A0128 
A0129 
A0130 
A0131 
A0132 
A0133 
AQ\34 
A0135 
A0136 
A0137 
A0138 
A0139 
A0140 
A014! 
A0142 
A0143 
A0144 
AQ145 
A0146 
AQ147 
A0148 
A0149 
A0150 
AO151 
A0152 
A0IL53 
A0154 
AO155S 
A0156 
A0Q157 
A0158 
A0159 
A0160 
A0161 
A0162 
A0163 
A0164 
A0165 
AN 166 
A0167 


A0168 
A0169 
A0170 
A0O1!71 
A0172 
A0173 


A0174 
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ms 
*; A-LAW TRANSMITTER 
0034 411B XMTA IN SCRACH, ADC 3 {nput A-law PCM 
POPU PPePeIerrrrrrrrrrr Tre rrrrrr err rirrrrrrr rrr TTT errr irri r ys 
*; A-LAW TO LINEAR PCM EXPANSION 
Me 
"3 INPUT: A-LAW PCM SAMPLE -- S_ (SCRACH) 
Be 
“3 OUTPUT: LINEAR PCM SAMPLE -- SL (SAMPLE) 
Re 
*; NOTATION: S -— 8b SM (Q4) 
“3 SL -- 14b TC (QO) 
He 
POPU PPeeerirerr rr errrrr rere rereer rrr rerrrrrrr rrr rere r irri y 
Re 
“; Ss +-------- + SL 
tara >! EXPAND |------- > 
ws tore + 
*; 
SERS SEA ARES ERLE AER ESAS ERA EE AS ORE AER ERE OOOH LARS Oe 
. 
0035 281B EXPNDA LAC SCRACH,8 3; sEEE MMMM 0000 0000 
0036 7848 XOR K32768 ; SEEE MMMM 0000 0000 
0037 5021 SACL TEMPI ; save value for PCM sifgn 
0038 7974 ANO M32767 ; OEEE MMMM 0000 0000 
0039 5C22 SACH TEMP2,4 3; Save exponent value 
003A 7972 AND M4095 ; 0000 MMMM 0000 0000 
0038 501B SACL SCRACH 
003C 2021 LAC TEMP! 
0030 5821 SACH TEMP1 + sign = FFFF or 0000 
QOO3E 7ES52 LACK SBASEA 
O0O3F 0222 ADD TEMP2,2 3; calculate PCM shift address 
0040 7F8C CALA 
0041 5C26 SACH SAMPLE, 4 
0042 2026 LAC SAMPLE 3; OOOX XXXX XXKX XXKXK 
0043 7821 XOR TEMP 1 + pos - no change : neg - 1’S comp! 
0044 1021 SUB TEMP 1 $ pos —- no change : neg - 2’s comp! 
0045 5026 SACL SAMPLE 
* 
baht 
*; Now convert PCM value in SAMPLE to ADPCM value in I 
Be 
0046 F800 CALL SIGOIF 
0047 01863 
0048 F800 CALL AQUAN 
0049 O2AA 
OO4A 5001 SACL 1 
004B 4A01 OUT 1,CCITT 3 output ADPCM 
004C F800 CALL PROICT 
0040 0355 
OO4E F600 ALAWX BI0Z XMTA ; wait -for next sample 
OO4F 0034 
0050 F900 B ALAWX 
0051 OO4E 
He 


7OS 


OLOZESWLL OW YUM WOddv S/lQy-ze “LT 


CCITT 


A0175 
A0176 
AQ!77 
A0178 
A0179 
A0180 
A0181 

A0!82 
A0183 
A0184 
A0185 
A0186 
A0187 
A0188 
A0189 
A0190 
A0191 

A0192 
A0193 
A0194 
A0195 
A0196 
A0197 
A0198 
A0199 
A0200 
A0201 

A0202 

A0203 
A0204 
A0205 


0052 
0053 
0054 
0055 
0056 
0057 
0058 
0059 
OO5A 
0058 
oosc 
0050 
OO5E 
OO5F 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 
0069 
OO6A 
0068 
006C 
0060 
OO6E 
OO6F 
0070 
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261B SBASEA LAC 


004C ADO 
7F80 RET 
7F80 NOP 
2618 LAC 
0671 AOD 
7F80 RET 
7F80 NOP 
2718 LAC 
0771 AOD 
7F80 RET 
7F80 NOP 
2818 LAC 
0871 ADO 
7F80 RET 
7F80 NOP 
2918 LAC 
0971 ADO 
7F 80 RET 
7F80 NOP 
2A18 LAC 
OA71 AOO 
7F 8D RET 
7F80 NOP 
2816 LAC 
0871 AOD 
7F80 RET 
7F80 NOP 
2C18 LAC 
0c71 ADD 
7F8D RET 


SCRACH,6 
ONE ,!3 


SCRACH,6 
BIASA,6 


SCRACH,7 
BIASA,7 


SCRACH,8 
BIASA,8 


SCRACH,9 
BIASA,9 


SCRACH, 10 
BIASA,10 


SCRACH, I 1 
BIASA,}1 


SCRACH, 12 
BIASA, 12 


, 
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00000000 
00000000 


00000000 
00600000 


00000000 
00000000 


00000000 
00000000 


00000000 
00000000 


00000000 
00000000 


00000000 
00000000 


00000000 
00000001 


000000MM 
000000MM 


000001MM 
000001MM 


00000MMM 
00001MMM 


0000MMMM 
0001MMMM 


OO0OMMMMO 
001MMMM! 


OOMMMMOO 
01MMMMIO 


OMMMMOO0O 
1MMMM100 


MMMMO0O00 
MMMM1000 
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MMO00000 
MM100000 


MM100000 
MM100000 


m00o0000 
M1000000 


00000000 
10000000 


00000000 
00000000 


00000000 
00000000 


00000000 
00000000 


00000000 
00000000 


06000000 
00000000 


00000000 
00000000 


00000000: 
00000000 


00000000 
00000000 


00000000 
00000000 


00000000 
00000000 


00000000 
00000000 


00000000 
00000000 


CCITT 


A0207 
A0208 
A0209 
A0210 
A02Z11 
A0212 
A021!3 
A021 4 
A0215 


A0216 
A0217 
A0218 
A0219 
A0220 
AQ221 
A0222 


A0223 


A02Z24 
A0225 
A0226 
A0227 
A0228 
A0229 
A0230 
A0231 

A0232 
A0233 
A0234 
A0235 
A0236 
A0237 
A0238 
A0239 
A0240 
A0241 

A0242 
A0243 
A0244 
A0N245 
A0246 
A0247 
A0248 
A0249 
A0250 
A0251 
A0252 
A0253 
A0254 
A0255 
A0256 


A0257 
A0258 


0071 


0072 
0073 
0074 
0075 
0076 
0077 
0078 
0079 


OO7A 
007B 
o0o7C 
0070 


OO7E 


OO7F 
0080 
0081 
00862 
0083 
0084 
0085 
0086 
0087 
0088 
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*; 
*; MU-LAW RECEIVER 
Re 
4201 RCVMU IN 1,CCITT ; Input ADPCM 
@. 
200! LAC I 3; determine magnitude of ADPCM 
5002 SACL IM 
134C SUB ONE ,3 
FAOO BLZ DO32KU 
OO7TA 
2002 LAC IM 
7860 XOR M15 
5002 SACL IM 
Be 
*; compute pcm output 
Re 
F800 DO32KU CALL SIGDIF 
0183 
F800 CALL PROICT 
0355 
*; 
ha a la a Dh a a lb a lt a ic el le a Rl dhl tad 
*.; LINEAR TO U-LAW PCM COMPRESSION/U-LAW TO LINEAR EXPANSION 
Be 
*; INPUT: LINEAR PCM SAMPLE -—-- SR 
a. 
*; OUTPUT: A~LAW PCM SAMPLE -- SP  (SCRACH) 
*; LINEAR PCM SAMPLE -- SLX (SAMPLE) 
Re 
"3 NOTATION: SR -- 166 TC (QO) 
ais SP -- 8b SM (Q4) 
"3 SLX -- 14b TC (QO) 
He 
Pere errr rrr rrrrrrrr irri rrr rier rrr rrr 
eo 
, 
*; SR $-~-------- + SP tenn + SLX 
®, 00 wanna >! COMPRESS {----- *------ >! EXPAND }------- > 
*; +---------- + H +—-------- + 
"5 ! sP 
ms Balan tee eaeaentententeatententeeanietal > 
a 
a Nek a Na Nae a a ha aR a Nah a aaa a a ate Rielle 
Me 
2013 LAC SR 3 get reconstructed signal 
Re 
*"; compress-—-convert to pcm 
Re 
5824 CMPRSU SACH TEMP4 3; save sign of 3R 
7F88 ABS 
0040 ADD BIAS ; add bias 
5018 SACL SCRACH ; save biased PCM value 
194C SUB ONE ,9 ; exp = 7 - 4o0r3-0 
FoOoOo BGEZ SCL427 
0083 
0770 SCLO23 AOD THREE ,7 3; exp = 3 - 2 or 1 - O 
FoOoO BGEZ SCL223 
OO9E 


OLOZESWLL 2M YUM WOddV S/¥Q4-ZE “LI 


€0S 


CCITT 


A0259 
A0260 


A0261 
A0262 
A0263 
A0264 
A0265 
A0266 
A0267 
A0268 


A0269 
A0270 
A0271 
A0272 
A0273 
A0274 
A0275 
A0276 


A0277 
A0278 


A0279 
A0280 
A0281 
A0282 
A0283 
A0284 
A0285 
A0286 


A0287 
A0288 
A0289 
Ao290 
A0291 
A0292 
A0293 
A0294 


A0295 
A0296 


A0297 
A0298 


A0299 
A0300 
A0301 
A0302 
A0303 
A0304 
A0305 
A0306 
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064C 
FOOoO 
0035 
2160 
7918 
5018 
0040 
5026 
2F 18 
7000 
F900 
OOE6 
2260 
7918 
5018 
014D 
5026 
2E18 
7001 
F900 
OOE6 
174C 
FDOO 
OOAA 
2360 
7918 
5018 
0240 
5026 
2018 
7002 
F900 
QOES 
2460 
791B 
5018 
0340 
5026 
2C18 


7003 


F900 
OOE6 
1970 
FDO0O 
00cB 
OA4C 
FOOO 
00Cc2 
2560 
7918 
5018 
0440 
5026 
2818 
7004 
F900 
O0E6 


sclo2l 


SCALEO 


SCALE 1 


SCL223 


SCALE2 


SCALE3 


SCL427 


SCL425 


SCALE4 


ADD 
BGEZ 


LAC 
ANO 
SACL 
ADD 
SACL 
LAC 
LARK 
8 


LAC 
AND 
SACL 
ADD 
SACL 
LAC 
LARK 
B 


SUB 
BGEZ 


LAC 
AND 
SACL 
ADD 
SACL 
LAC 
LARK 
8 


LAC 
AND 
SACL 
ADD 
SACL 
LAC 
LARK 
8B 


SUB 
BGEZ 


AOD 
BGEZ 


LAC 
AND 
SACL 
AOD 
SACL 
LAC 
LARK 
8 


ONE ,6 
SCALE lt 


M15,1 
SCRACH 
SCRACH 
BIAS 
SAMPLE 
SCRACH, 15 
0,0 

FINI 


M15,2 
SCRACH 
SCRACH 
BIAS, 1 
SAMPLE 
SCRACH,14 
0,1 

FINI 


ONE ,7 
SCALE3 


M15,3 
SCRACH 
SCRACH 
BIAS,2 
SAMPLE 
SCRACH, 13 
0,2 

FINI 


M15,4 
SCRACH 
SCRACH 
BIAS,3 
SAMPLE 
SCRACH, 12 
0,3 

FINI 


THREE ,9 
SCL627 


ONE, 10 
SCALES 


M15,5 
SCRACH 
SCRACH 
BIAS,4 
SAMPLE 
SCRACH, 11 
0,4 

FINI 
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PAGE 0007 
; exp = | or O 
; exp = 0 


oo ee 


oo we 


a 


~ 


mask for mantissa 


biased quantized value 


exp = 1 


mask for mantissa 


biased quantized value 


exp = 3 or 2 


exp = 2 
mask for mantissa 


biased quantized value 


exp = 3 


mask for mantissa 


biased quantized value 


exp = 7 - 6o0r5 - 4 
exp = 5 or 4 
exp = 4 


mask for mantissa 


biased quantized value 


CCITT 


A0307 
A0308 
A0309 
A0310 
AO311 
A0312 
A0313 
A0314 


A0315 
A0316 


A0317 
A0318 
A0319 
A0320 
A0321 
A0322 
A0323 
A0324 


A0325 
A0326 


A0327 
A0328 
A0329 
A0330 


A0331 
A0332 
A0333 
A0334 
A0335 
A0336 
A0337 
A0338 
A0339 
A0340 
A0341 
A0342 
A0343 
A0344 
A0345 
A0346 
A0347 
A0348 
A0349 
A0350 
A0351 


A0352 
A0353 


A0354 
A0355 
A0356 


o0c2 
00c3 
00c4 
00cs 
oocé 
0oc7 
0oc8 
o0o0c9 
OOCA 
00cB 
oocc 
00cod 
Ooce 
OOCF 
ooD0 
0001 
o0d2 
00D3 
0004 
00D5 
0006 
0007 
0008 
0009 
OODA 
0008 
ooDC 
0000 
OODE 
OOOF 
QOEO 
OOE 1 
OOE2 
OOE3 
QO0E4 
OoE5 
OOE6 
O0E7 
ooes 
QOE9 
OOEA 
Oo0EB 
OOEC 
OOED 
OOEE 
OOEF 
OOFO 
OOF 1 


ooF2 
OOF3 


OOF 4 
OOFS 


OOF 6 
OOF7 
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266D 
7T91B 
5018 
0540 
5026 
2A1B 
7005 
F900 
OOE6 
1B84C 
FOOO 
0007 
2760 
791B 
501B 
064D 
5026 
2918 
7006 
F900 
OOE6 
1C4C 
FAOO 
OOOF 
2767 
5026 
TETF 
F900 
OOEA 
286D 
791B 
501B 
0740 
5026 
2818 
7007 
5818 
2018 
3021 
0421 
0724 
7947 
5016 
2026 
1040 
7824 
1024 
5026 


F800 
O2AA 


F800 
0188 


71847 
501B 


SCALES 


SCL627 


SCALE6 


SCALE7 


SATCH 


NORMAL 


FINI 


CLNUP 


LAC 
ANDO 
SACL 
ADD 
SACL 
LAC 
LARK 
B 


SUB 
BGEZ 


LAC 
AND 
SACL 
ADO 
SACL 
LAC 
LARK 
B 


SUB 
BLZ 


LAC 
SACL 
LACK 
B 


LAC 
ANO 
SACL 
AOD 
SACL 
LAC 
LARK 
SACH 
LAC 
SAR 
AOD 
ADO 
AND 
SACL 
LAC 
SUB 
xOR 
SuB 
SACL 


CALL 


CALL 


XOR 
SACL 


M15,6 
SCRACH 
SCRACH 
BIAS,5 
SAMPLE 
SCRACH, 10 
0.5 

FINI 


ONE, 11 
SCALE7 


M15,7 
SCRACH 
SCRACH 
BIAS,6 
SAMPLE 
SCRACH,9 
0.6 

FINI 


ONE, 12 
NORMAL 


K63,7 
SAMPLE 
127 
CLNUP 


MI5,8 
SCRACH 
SCRACH 
BIAS,7 
SAMPLE 
SCRACH,8 
0,7 
SCRACH 
SCRACH 
0,TEMPI 
TEMP1,4 
TEMP4,7 
M255 
SCRACH 
SAMPLE 
BIAS 
TEMP4 
TEMP4 
SAMPLE 


AQUAN 


SYNC 


M255 
SCRACH 


PC2.1 84.107 


we ee 


wo we 


m0 we o “ oo <0 we ee ee 


Byy 


16:36:03 03-20-85 
PAGE 0008 


exp = 5 


mask for mantissa 


biased quantized value 


exp = 7 


exp = 6 
mask for mantissa 


biased quantized value 


exp = 7 


mag > 8191 ? 


save max biased quantized value 
set maximum mulaw magnitude 


mask for mantissa 


biased quantized value 


save normalized mantissa 


add exponent 


signed magnitude of mulaw-PCM 
remove bias from quantized value 


2’s complement of quantized sample 


flip bits for transmission 


vos 
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PAGE 0009 PAGE 0010 
A0357 OOF8 4918 OUT SCRACH,DAC ; output mu-law PCM A0361 "3 
A0358 OOF9 F600 MULAWR BIOZ RCVMU 3; wait for next sample A0362 *; A-LAW RECEIVER 
OOFA 0071 A0363 *; 
A0359 OOFB F900 B MULAWR A0QN364 OOFD 4201 RCVA IN 1,CCITT 3 input ADPCM 
OOFC OOFS A0365 *s 
A0366 OOFE 2001 LAC i rans ; determine magnitude of ADPCM 
AOQ367 OOFF 5002 SACL IM 
A0368 0100 134C SUB ONE ,3 
A0369 010! FAOQO BLZ DO32KA 
0102 0106 
A0370 0103 2002 LAC IM 
A0371 0104 786D XOR MI5 
A0372 0105 5002 SACL IM 
A0373 ip 
A0374 *; compute pcm output 
A0375 a 
A0376 0106 F800 OO32KA CALL SIGDIF 
0107 01B3 
A0377 0108 F800 CALL PROICT 
0109 0355 
A0378 at 
A0379 SPER OSA AEE ARRON EAE EES EEE EE EE EER a Re 
A0380 *; LINEAR TO A-LAW PCM COMPRESSION/A-LAW TO LINEAR EXPANSION 
A038 1 "s 
A0382 7s INPUT: LINEAR PCM SAMPLE -- SR 
A0383 ia 
A0384 "3 OUTPUT: A-LAW PCM SAMPLE ~- SP (SCRACH) 
A0385 a5 LINEAR PCM SAMPLE -—-- SLX (SAMPLE) 
A0386 *; 
A0387 3 NOTATION: SR -- 165 TC (QO) 
A0Q388 me SP -- 8b SM (Q4) 
A0389 ws SLX -- 14b TC (QO) 
A0390 *; 
AO0391 eg OTH OF OE Oe OS OE ee Oe OO Oe ee Ot 
A0392 Hs 
A0393 he SR +---------- + SP +-~—~------ + SLX 
A0394 #300 see >! COMPRESS | ----- *—---~+-— >! EXPAND |-----~-- > 
A0395 "3 eS arts + ' tte + 
A0396 ai: H SP 
AQ397 #35 toss os Sc se > 
A0398 bate 
A0399 a a Ra i dl et le cli Ni Rh i aE ia li ial lel Si Bh dea ia lh ede Eh eid 
A0400 ia 
A040! O10A 2013 LAC SR 3; get reconstructed signal 
A0402 bat 
A0403 *; compress--convert to pcm 
A0404 *; 
A0405 0108 5824 CMPRSA SACH TEMP4 ; save sign of SR 
A0406 010C 7F88 ABS 
AQ407 0100 0024 ADD TEMP4 ; add 1 for negative vals 
A0408 O10E 501B SACL SCRACH ; save PCM value 
A0409 OLOF 194C SUB ONE ,9 3; exp = 7 - 4o0r3-0 
A0410 0110 FDOO BGEZ SCL4T7 
Oll1 O13F 
A04!1 0112 0770 SCLOT3 ADD THREE ,7 3; exp = 3 - 2 or !- 0 
A0412 0113 FDOO BGEZ SCL2T3 
OPES 
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PAGE 0011 PAGE 0012 
A0413 0115 064C SCLOT1 AOD ONE .6 ; exp = | or 0 A0461 OI1L4E 2660 SCALS5A LAC M15,6 ; exp = 5 
A0414 0116 FDOO BGEZ SCALIA A0462 O14F 7918 AND SCRACH 3; mask for mantissa 
0117 O121 : A0463 0150 501B SACL SCRACH 
AO415 0118 226D SCALOA LAC M15,2 ; exp = 0 A0464 0151 0540 ADO BIAS,5 
AO416 0119 7918 AND SCRACH : mask for mantissa A0465 0152 5026 SACL SAMPLE 3; Quantized value 
AO4SI7 GLIA 5S01B SACL SCRACH AQ466 0153 2A18 LAC SCRACH,10 
A0418 011B O14C ADD ONE, 1 A0467 0154 7005 LARK 0,5 
AO0419 O11C 5026 SACL SAMPLE 3; Quantized value A0468 0155 F900 B FINISH 
AG420 0110 2E18 LAC SCRACH, 14 0156 0172 
A042! O11€ 7000 LARK 0,0 A0469 0157 184C SCL6T7 SUB ONE, 11 3; exp = 7 or 6 
AQ422 OLIF F900 B FINISH A0470 0158 FOOO BGEZ SCAL7A 
0120 0172 0159 0163 
AQ423 0121 226D SCALIA LAC M15,2 3; exp = I A0471 015A 2760 SCAL6A LAC Mi5,7 ; exp = 6 
A0N424 0122 7918 ANDO SCRACH 3; mask for mantissa A0472 O15B 7918 AND SCRACH ; mask for mantissa 
AQ425 0123 501B SACL SCRACH A0473 015C 5018 SACL SCRACH 
A0426 0124 0140 ADD BIAS, 1 AQ474 0150 064D ADD BIAS,6 
AQ427 0125 5026 SACL SAMPLE +; Quantized value A0475 O1S5E 5026 SACL SAMPLE 3; Quantized value 
AQ428 0126 2E1B LAC SCRACH, 14 A0476 OISF 291B LAC SCRACH,9 
A0429 0127 7001 LARK 0,1 A0477 0160 7006 LARK 0,6 
A0430 0128 F900 B FINISH A0478 0:61 F900 B FINISH 
0129 O172 0162 0172 
A0431 012A 174C SCL2T3 SUB ONE , 7 ; exp = 3 or 2 A0479 0163 1C4C SCAL7A SUB ONE ,12 ; exp = 7 
AG432 0128 FDOO BGEZ SCAL3A AQ480 0164 FAOO BLZ NORMLA 3; mag > BI91 ? 
012C 0136 0165 016B 
AQ433 012D 2360 SCAL2A LAC “15,3 3; exp = 2 A0481 0166 2767 SATCHA LAC K63,7 
A0434 O12E 7918 AND SCRACH ; mask for mantissa A0482 0167 5026 SACL SAMPLE 3; save maximum quantized value 
A0435 O12F 5018 SACL SCRACH A0483 0168 TJE7F LACK 127 3; save maximum alaw magnitude 
AQ436 0130 0240 ADD BIAS,2 A0484 0169 F900 B CLNUPA 
A0437 0131 5026 SACL SAMPLE 3 Quantized value O16A 0176 
A0438 0132 2018 LAC SCRACH, 13 A0485 016B 2860 NORMLA LAC M15,8 
A0N439 0133 7002 LARK 0,2 AQ486 0!16C 7918 ANO SCRACH + mask for mantissa 
A0440 0134 F900 B FINISH A0487 0160 501B SACL SCRACH 
0135 0172 A0488 O16E 0740 ADO BIAS,7 
A044! 0136 2460 SCAL3A LAC M15,4 : exp = 3 A0489 OL6F 5026 SACL SAMPLE 3 Quanitzed value 
AQ442 0137 791B AND SCRACH : mask for mantissa A0490 0170 2818 LAC SCRACH,8 
A0443 0138 5018 SACL SCRACH A0491 0171 7007 LARK 0,7 
A0444 0139 0340 ADD BIAS,3 A0492 0172 581B FINISH SACH SCRACH 3 save normalized mantissa 
A0445 013A 5026 SACL SAMPLE $ quantized value A0493 0173 2018 LAC SCRACH 
A0Q0446 0138 2CiB LAC SCRACH, 12 A0494 0174 3021 SAR 0,TEMP1 
AQ447 013C 7003 LARK 0,3 A0495 0175 0421 AOD TEMPI,4 3; add exponent 
A0448 0130 F900 B FINISH A0496 0176 0724 CLNUPA ADD TEMP4,7 
O013E€ 0172 AQ497 0177 7947 ANDO M255 
AQ449 O13F 1970 SCL4T7 SUB THREE .9 ; exp = 7 - 6 or 5 - 4 A0498 0178 501B SACL SCRACH ; signed magnitude of alaw-PCM 
AQ450 0140 FDOO BGEZ SCL6T7 A0499 0179 2026 LAC SAMPLE 
0141 0157 AOQS500 OI7A 7824 XOR TEMP4 
A0451 0142 OA4C SCL4TS ADD ONE ,10 3; exp = 5 or 4 AOS501 01786 1024 suB TEMP4 
A0452 0143 FOOO BGEZ SCALSA AQ502 O17C 5026 SACL SAMPLE 3; 2’°s complement of quantized sample 
0144 O14E A0503 "5 
A0453 0145 2560 SCAL4A LAC M15,5 3; exp = 4 A0504 017D F800 CALL AQUAN 
A0454 0146 7918 AND SCRACH 3; mask for mantissa O17E O2ZAA 
A0455 0147 501B SACL SCRACH A0505 *3 
A0456 0148 0440 AOD BIAS,4 A0SO6 O|7F F800 CALL SYNC 
A0457 0149 5026 SACL SAMPLE + Quantized value 0180 0188 
AQ458 O14A 2B!1B LAC SCRACH, i} A0507 wis 
A0459 0148 7004 LARK 0,4 A0508 0181 787F XOR M0080 : flip bits for transmission 
A0460 014C F900 B FINISH A0509 0182 5S0I1B SACL SCRACH 
0140 0172 A0OSIO 0183 4918 OUT SCRACH,DAC ; output A-law PCM 


90S 
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A0511 0184 F600 
0185 OOFD 
A0512 0186 F900 
0187 0184 


ALAWR BIOZ 


B 
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PAGE 0013 


wait for next sample 
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AO514 
AOSI5 
AOS16 
AOS1I7 
A0518 
A0519 
A0520 
A0521 
A0522 
A0523 
A0524 
A0525 
A0526 
A0S27 
A0528 
A0529 
A0530 
A0531 
A0532 
A0533 
A0534 
A0535 
AQ0536 
A0537 
A0538 
A0539 
A0N540 
A0541 
A0542 
A0543 
A0S44 
A0545 
A0546 


A0S47 


A0548 
A0549 
A0550 


A055 1 


A0552 
A0553 
A0554 
A0555 
A0556 
A0557 


A0558 
A0559 
A0560 
A0561 
A0S62 


A0563 
A0564 


0188 
0189 
O18A 
0188 
018C 
018D 
O18E 
018F 
0190 
0191 
0192 
0193 
0194 
0195 
0196 
0197 
0198 
0199 
O19A 
0198 
019C 
019D 
O19E 
O19F 
01A0 
O1A1 
01A2 
O1A3 
O1A4 
OLAS 
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PAGE 0014 
ba 
He ERE EHRHA HEE ER EERE ENE EEO EER EE ee ee ee 
*; SYNCHRONOUS CODING ADJUSTMENT 
INPUT: LOG PCM SAMPLE -- SP  (SCRACH) 
RECEIVED ADPCM -~ I 
REGENERATED ADPCM -- ID (TEMP1) 
QUTPUT: ADJUSTED LOG PCM -- SO (SCRACH) 
NOTATION: 1 -- 46 SM (QO) 
ID -- 4b SM (QO) 
SP -- 8b SM (Q4) 
SO -- 8b SM (Q4) 


7838 
5021 
234C 
7801 
1021 
FAOO 
O1A0 
FFOO 
0181 
2018 
106F 
FCOO 
0198 
FFOO 
0199 
074C 
7F8D 
TETF 
7F80 
104C 
FFOO 
019F 
OO6F 
7F8D 
2018 
174C 
FOOO 
O1A8 
OO6F 
FAOO 


HO 40 Ob OO SE ME Ot HF AE Oe OF OE OF OE SE Oe AE Ht OO OE Oe Ot OE TE OF SE He Ot dE Ot Ob Ob OO OE OE HE te aE HE OF Ot oe Ot OE OE OE OE OE OE a te ot te te te ot te 


z2*e ee @ & ©& B&B Be HR KEE EE HE KEE RE HR HE 


Ke ee we we we we we we we ee we we we we we ewe we we we we te oe we 


+------ + 
a a pads a H 
1D H H so 
wooo >| SYNC |--------> 
SP ' H 
er eae pa H 
+------ + 
HAHRRHEHHHEHREHRHRHHRHHHRHHHEHHHHREHEHHHEEHRHEHAHRHHRHRHEEHHE EHH HH HH EE 
SYNC XOR EIGHT 
SACL TEMP 1 ; flip the polarity bit in ID 
LAC ONE ,3 
XOR I 3; flip the polarity bit in I 
SUB TEMP 1 ; IM - ID 
BLZ IDGTIM :; 10 > IM... 
BZ IDEQIM ; ID = IM 
IDLTIM LAC SCRACH >; ID < IM ... + 
SUB M127 
BGZ SUBONE 
BZ MAXPOS 
ADD ONE ,7 3; SD = SP + 1 QO <= SP < 127 
RET 
MAXPOS LACK 127 ; SD = 127 : SP = 127 
RET 
SUBONE SUB ONE 
BZ ANOMLE ; SD = 0 : SP = 128 
ADD M127 ; SD = SP + 1 : 255 >= SP > 128 
ANOMLE RET 
IOGTIM LAC SCRACH 
SUB ONE ,7 
BGEZ ADDONE 
ADD M127 ; SD = SP - 1: 0 ¢< SP ¢= 127 
BLZ ANOMLY 
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PAGE 0015 PAGE 0016 
O1A6 OLAF 0002 COPY SIGDIF.ASM 
A0565 O1A7 7F8D RET 80001 #5 
A0566 O1A8 1O6F ADDONE SUB MIL27 B0002 HHH HHHHHHHHKHHHKHHHHEEHH HH HHHHH LEAH HHEHHRHHHH KKH H HOH HH 
A0567 OLAS FDOO BGEZ MAXNEG BOO003 *; SIGDIF 
OLAA O1AD Bo004 "3 
“A0S68 01AB 084C ADO ONE ,8 ; SD = SP - 1 : 255 > SP >= 128 BoOo05 *; Implements the following modules (per CCITT spec): 
A0569 OI1AC 7F80 RET Bo006 *; 
A0570 O1AD 7VEFF MAXNEG LACK 255 ; SO = 255 : SP = 255 B0007 as DELAY D -- delay of 0Q and SR derivatives 
AO571 OLAE 7F80 RET B0008 "; FMULT -- Bn * DQn, An * SRn 
A0572 OLAF 7VE80 ANOMLY LACK 128 ; SO = 128 : SP = 0 BOO009 ws DELAY A -~ (implicft in use of last frames data) 
A0573 0!B0 7F8D RET Beant baat ACCUM -- Accumulate partial products for SEZ, SE 
A0574 0181 201B IDEQIM LAC SCRACH ; SO = SP 0011 me 
A0575 0182 7F80 RET B0012 "3 LIMA -- compute AL(k) 
BOO0!3 *; MIX -- compute Y(k) 
Bo014 "3 
B0015 be i a a a ia a a i i a aN alt a ial hall 
BOO16 ms 
BOoO!7 *; compute SEZ-- partial signal estimate 
B0018 "3 
Bo019 ®; SEZ(k) = B1(kK-1)*DQ(k-1) + ... + B6(k—-1)*0Q(k~-6) 
B0020 "5 
BO0021 "5 Multiplfies are done in floating pt 
BOO022 bolt) DQ’s are stored in f.p. notation 
B0023 be B’s are floated each pass 
B0024 bale 
B0025 Be a eh i eR cB aN tla ac el ih hin aR ial a th aad ae iets teak 
BO026 *; FLOATING POINT MULTIPLY (FMULT) 
B0027 *; 
B0028 bast INPUT: QUANTIZED DIFFERENCE -- DQn (DQNEXP/DQnMAN) 
g80029 bet PREDICTOR COEFFICIENTS -- Bn 
BO030 beet 
BoO03 1 ba OUTPUT: FILTER TAP OUTPUTS -- WBn (SUMn) 
B0032 "; 
BO033 a NOTATION: DQnEXP -- 46 + offset 
B0034 "3 DQnMAN*8 -~- 9b magnitude 
BOO35 baler Bn -- 16b TC (Q14) 
BO036 a SUMn -- 16b TC (Q)1) 
80037 "3 
B0038 He HORAK HHHHHHEHKAHKHHHHAHHREE HEHEHE HHHHEHEEHHHERR RHEE EHH E EHH 
B0039 "3 
B0040 3 -1 -1 | -1 -1 -1 
B0041 *; DQ z z z 2 z z 
80042 M80 O06 m H H - 0-H HH 0-H O- HOO-  D ---0O---)>---0 
BO043 sd H H ' ' H ' 
B0044 *; 
BOO045S bat vB1i(k) vB2(k) vB3(k) vB4(k) vB5(k) vB6(k) 
B0046 ba H H ' H ! H 
BO0047 "5 ' i ' ' H H 
B0048 baat O--- > --~-0--- > ---0-- — >) -- -0-- - > - -0-- - > - - 0-08 
BO0049 ws wB1 WB2 wB3 WB4 wB5S WwB6 SEZ 
B0050 wig 
B0O0S51 0 IRAE AEA EGE AO OPA IESE NA SOE A AE SE AEE AOE OE 
B0052 "3 
B0053 0183 2EOF SIGDIF LAC B6,14 3 compute 86*DQ5 
BO054 01B4 F800 CALL FLOAT 3; ret/w mantissa in TEMP!; exp in ac 
01B5 O4DE 


B0055 0186 OOILA ADD DQSEXP 


80S 
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BO0S6 
BOO57 
Bo058 
BO0059 
BO0060 
Boo6! 
Bo062 
B0063 
Boo64 
BO065 
BOO66 


BOO67 
B0068 
BO069 
BoOo70 
BO071 


B0072 
B0073 
BO074 
BOO75 
BOO76 


B0077 
B0078 
B0079 
B0080 
60081 
B0082 
B0083 
B0084 
Bo08ss 
B0086 
B0087 
80088 
B0089 


BOOS0 
BO009! 
B0092 
B0093 
BO094 


B0095 
BO096 
B0097 
B0098 
BOO039 


BO0100 
BO101 
BO102 
BO103 
B0104 
BO105 
BO106 


0187 
01B8 
01B9 
O1BA 
01BB 
01BC 
01BD 
O1BE 
0O1BF 
01co 
O1Cc! 

01C2 
01C3 
01Cc4 
01C5 
01C6 
01C7 
01Cc8 
01c9 
OI1CA 
01CcB 
01CC 
01CcOdD 
O1CE 
OLCF 
0100 
O101 

0102 
0103 
0104 
0105 
0106 
0107 
0108 
01D9 
O10A 
0108 
010C 
0100 
O1DE 
O1DF 
O1E0 
O1Et 

O1E2 
O1E3 
Q1E4 
O1ES 
O1E6 
O1E7 
O1E8 
OL1E9 
OIEA 
O1EB 
O1EC 
O!1ED 
O1EE 
O1EF 


32010 FAMILY MACRO ASSEMBLER 


5022 
3822 
6A60 
2770 
6021 
6C80 
T96E 
5021 
6021 
TF8E 
FAOO 
04B6 
5922 
660F 
785A 
7948 
FFOO 
01CcC 
7F89 
1022 
5022 
2E0E 
FBOO 
O4DE 
0019 
6919 
5023 
3823 
6B5F 
2770 
6021 
6C80 
796E 
5021 
6021 
7F8E 
FAQO 
04BB 
5923 
660E 
7859 
7948 
FFOO 
O1E6 
TF 89 
1023 
5023 
2E00 
F800 
04DE 
0018 
6918 
5025 
3825 
6B5E 
2770 
6021 


CHK 1 


NEGI 


POS | 


CHK2 


NEG2 


POS2 


SACL 
LAR 
LT 
LAC 
MPY 
LTA 
AND 
SACL 
MPY 
PAC 
BLZ 


SACH 
ZALS 
XOR 
AND 
BZ 


ZAC 
SUB 
SACL 
LAC 
CALL 


ADD 
OMOV 
SACL 
LAR 


SUM] 
0,SUM1 
DQSMAN 
THREE, 7 
TEMP 1 
#0 
KFF80 
TEMP 1 
TEMP 1 


RS1 


DQ4EXP 
DQ4EXP 
SUM2 

0,SUM2 
DQ4MAN 
THREE ,7 
TEMP} 


DQ3EXP 
DQ3EXP 
SUM3 

0,SUM3 
DQ3MAN 
THREE ,7 
TEMP! 


0 oe we 


we we 


we 


Po 


te ee we 


wee 40 ee 
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exp of product offset by table add 
scaled up by 2**3 

multiply fudge factor 

fetch shift fac 


mult mant, add 48, 


apply shift factor = f{(exp) 
exp >= 26 
exp < 26 


check sign of product 


negate if necessary 


compute B5*0Q4 


ret/w mantissa in TEMP|1; exp in ac 


exp of product offset by table add 
scaled up by 2**3 

multiply fudge factor 

fetch shift fac 


mult mant, add 48, 


apply shift factor = f(exp) 


exp >= 26 


exp < 26 
check sign of product 


negate if necessary 


compute B4*0Q3 


ret/w mantissa in TEMP1; exp in ac 


exp of product offset by table add 
scaled up by 2**3 
multiply fudge factor 
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BO107 
B0108 
BO109 
Bo110 
BOl11 
BO112 


BO113 
BO114 
BO115 
B0116 
BO117 


BO118 
BO119 
B0120 
BO121 
B0122 


BO123 
BO124 
BO125 
BO126 
BO127 
B0128 
BO129 
BO0130 
BO131 
B0132 
80133 
BO134 
B0135 


BO136 
BO137 
B0138 
BO0139 
BO140 


BO!I41 
B01 42 
BO143 
B0144 
BO145 


B0146 
BOl47 
B0148 
BO149 
BO150 
BOIS! 
BO152 
B0153 
BO154 
BOo155 
BO156 
BO157 


OIFO 
O1F 1 
O1F2 
O1F3 
OlF4 
O1FS 
O1F6 
OFT 
O1F8 
O1F9 
OIFA 
O1FB 
O1FC 
O1FD 
O1IFE 
O1FF 
0200 
0201 
0202 
0203 
0204 
0205 
0206 
0207 
0208 
0209 
O20A 
0208 
o20C 
020D 
O20E 
O20F 
0210 
0211 
0212 
0213 
0214 
0215 
0216 
0217 
0218 
0219 
O21A 
0218 
O21C 
0210 
O21E 
O21F 
0220 
0221 
0222 
0223 
0224 
0225 
0226 
0227 
0228 
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6C80 
7T96E 
5021 
6021 
TF8E 
FAOO 
04CO0 
5925 
6600 
7858 
7948 
FFOO 
0200 
7F89 
1025 
5025 
2E0C 
F800 
O4DE 
0017 
6917 
S5O01E 
381E 
6B5D 
2770 
6D21 
6C80 
796E 
5021 
6D21 
TF8E 
FAOO 
04C5 
5S9IE 
660C 
7857 
7948 
FFOO 
O21A 
7F89 
1O1E 
SOLE 
2E0B 
F800 
O4DE 
0016 
6916 
501F 
381F 
6B5C 
2770 
6021 
6C80 
796E 
5021 
6021 
TF8E 


CHK3 


NEG3 


POS3 


CHK4 


NEG4 


POS4 


LTA 
AND 
SACL 
MPY 
PAC 
BLZ 


SACH 
ZALS 
XOR 
AND 
BZ 


ZAC 
SUB 
SACL 
LAC 
CALL 


ADD 
DMOV 
SACL 
LAR 
LTD 
LAC 
MPY 
LTA 
AND 
SACL 
MPY 
PAC 
BLZ 


SACH 
ZALS 
XOR 
AND 
BZ 


ZAC 
SUB 
SACL 
LAC 
CALL 


ADD 
DMOV 
SACL 
LAR 
LTD 
LAC 
MPY 
LTA 
AND 
SACL 
MPY 
PAC 


*,0 

KFF80 
TEMP 1 
TEMP} 


RS3 


SUM3,1 
B4 
SD0Q4 
K32768 
POS3 


SUM3 
SUM3 
B3,14 
FLOAT 


DQ2EXP 
DQ2EXP 
SUM4 
0,SUM4 
DQ2MAN 
THREE, 7 
TEMP 1 
#20 
KFF80 
TEMP 1} 
TEMP 1 


DQIEXP 
DQIEXP 
SUM5 
0,SUM5 
DQ 1 MAN 
THREE, 7 
TEMP} 
",0 
KFF80 
TEMP 1 
TEMP 1 


’ 


a0 0 ee 


’ 
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mult mant, add 48, fetch shift fac 


apply shift factor = flexp) 


exP >= 26 


exp < 26 
check sign of product 


negate if necessary 


compute B3*DQ2 


ret/w mantissa in TEMP1; exp in ac 


exp of product offset by table add 
scated up by 2**3 

multiply fudge factor 

fetch shift fac 


mult mant, add 48, 


apply shift factor = f(exp) 


exp >= 26 


exp < 26 
check sign of product 


negate if necessary 


compute B2*0Q! 


ret/w mantissa in TEMP1; exp in ac 


exp of product offset by table add 
scaled up by 2**3 

multiply fudge factor 

fetch shift fac 


mult mant, add 48, 


apply shift factor = f(exp) 


OLOCESWL 94) YUM WOddV S/Q4-ZE “LI 
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CCITT 


B0158 


B0159 
BO160 
BO161 
BO162 
B0163 


B0O164 
BO165 
B0166 
BO167 
B0168 


BO169 
BO! 70 
BO17} 
BOl72 
B01 73 
B0O174 
BO175 
BO176 
BOI77 
BO178 
BO0179 
BO180 
B0181 


B0182 
B0183 
BO0184 
BO185 
BO186 


B0187 
60188 
B0189 
BO190 
BO!I9I 
BO192 
BOi93 
BO0i94 
BO195 
B0196 
BO197 
B0198 
B0199 
B0200 
BO201 
B0202 
B0203 
B0204 
B0205 
BO206 
BO207 
B0208 
BO209 


0229 
022A 
022B 
022C 
022D 
022E 
022F 
0230 
0231 
0232 
0233 
0234 
0235 
0236 
0237 
0238 
0239 
023A 
023B 
023C 
0230 
023E 
023F 
0240 
0241 
0242 
0243 
0244 
0245 
0246 
0247 
0248 
0249 
O24A 
024B 
024C 
024D 
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FAOO BLZ RS5 3; exp >= 26 
O4CA 
591F SACH SUM5, 1 ; exp < 26 
660B CHKS ZALS B2 ; check sign of product 
7856 XOR S0Q2 
7948 AND K32768 
FFOO BZ POS5 
0234 
7F89 NEG5S ZAC ; negate if necessary 
1O1F SUB SUM5 
SO1F SACL SUMS 
2E0A POSS LAC B1l,14 ; compute B1*DQ 
F800 CALL FLOAT ; ret/w mantissa in TEMP!; exp in accum 
O4DE 
0015 ADD DQE XP 
6915 DMOV OQE XP 
5020 SACL SUM6 
3820 LAR 0,SUM6 ; exp of product offset by table addr 
6B5B LTD DQMAN ; scaled up by 2**3 
2770 LAC THREE,7 3; multiply fudge factor 
6021 MPY TEMP 1 
6C80 LTA *,0 ; mult mant, add 48, fetch shift factor 
T96E AND KFF80 
5021 SACL TEMP! 
6021 MPY TEMP 1 ; apply shift factor = f(exp) 
TF BE PAC 
FAOQO BLZ RS6 ; exp >= 26 
O4CF 
5920 SACH SUM6, 1 ; exp < 26 
660A CHK6 ZALS Bl ; check sign of product 
7855 XOR spQi 
7948 AND K32768 
FFOO BZ POSG6 
O24E 
7F89 NEG6 ZAC ; negate if necessary 
1020 SUB SUM6 
5020 SACL SUM6 
O024E POS6 EQU $ 
Pee eee eee e Sere eit rite ee eer trier trier 
*; compute SE -- signal estimate 
Me 
*; SE = AL(k-1)*SR(k-1) + A2(k-1)*SR(k-1) + SEZ(k) 
wt. 
* Multiplies are dONE in floating pt 
bat SR’s are stored in f.p. notation 
© A’s are floated each pass 
Re 
POO OS ee STOOP eee reer rire ree iit irri 
*; FLOATING POINT MULTIPLY (FMULT) 
Re 
fis INPUT: RECONSTRUCTED SIGNAL -- SRn (SRNEXP/SRMMAN) 
*; PREDICTOR COEFFICIENTS -- An 
Me 
ws OUTPUT: FILTER TAP OUTPUTS -- WAn (SUMN+6) 


CCITT 


BO210 
BO211 
BO212 
B0213 
BO214 
BO0215 
BO216 
BO217 
B0218 
BO219 
B0220 
Bo221 
BO222 
BO0223 
B0224 
Bo0225 
BO226 
B0227 
B0228 
BO229 
BO0230 
B0231 


B0232 
BO0233 
B0234 
BO0235 
BO236 
B0237 
B0238 
B0239 
BO240 
B0241 
B0242 
B0243 


BO244 
BO0245 
B0246 
B0247 
B0248 


BO0249 
BO250 
BO0251 
BO252 
B0253 


BO254 
BO255 
Bo0256 
BO257 
BO0258 
BO259 
BO260 
B0261 
BO262 


O24E 
O24F 
0250 
0251 
0252 
0253 
0254 
0255 
0256 
0257 
0258 
0259 
O25A 
025B 
025C 
025D 
O25E 
O25F 
0260 
0261 
0262 
0263 
0264 
0265 
0266 
0267 
0268 
0269 
026A 
026B 
026C 
026D 
O26E 
O26F 
0270 
0271 
0272 
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2E12 
F800 
O4DE 
0010 
5027 
3827 
6A53 
277D 
6D21 
6C80 
796E 
5021 
6021 
7F BRE 
FAOO 
0404 
5927 
6612 
7814 
7948 
FFOO 
0267 
7F89 
1027 
5027 
2E11 
F800 
O4DE 
OO1C 
691C 
5028 
3828 
6B52 
2770 
6021 
6C80 
796E 
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Wis NOTATION: SRnEXP -~ 4b + offset 
"3 SRnNMAN*8 -- 9b magnitude 
#3 An -- 16b TC (Q14) 
5 SUMN+6 -- 16b TC (Q)) 
Me 
Pree rerereer errr rrrrr reir rer irr rrrrrer irri rrr rir 
He 
Rg -1 -i 
*; SR 2 z 
*; O~--->---0--->---o 
"5 ' H 
"; ' H 
"5 VAI (Kk) vA2(k) 
bag 4 H 
* ! t 
"; 0--~<---0---<~---0---<---o 
st SE WA! WA2 SEZ 
Re 
rer rre rrr rer errr rier irri rirt irr rriir iri i iri irrti iii r 
He 
GETSE LAC A2,14 3 compute A2*SR1 
CALL FLOAT 3; ret/w mantissa in TEMP!1; exp in accum 
ADD SRIEXP 
SACL SUM7 
LAR 0,SUM7 3; exp of product offset by table addr 
LT SRIMAN ; scaled up by 2**3 
LAC THREE,7 ; multiply fudge factor 
MPY TEMP! 
LTA *,0 3; mult mant, add 48, fetch shift factor 
AND KFF8O 
SACL TEMP 1 
MPY TEMP 1 ; apply shift factor = f(exp) 
PAC 
BLZ RS11 3; exp d= 26 
SACH SUM7,1 3; exp < 26 
CHK!I1 ZALS A2 3; check sign of product 
XOR SR1 
AND K32768 
BZ POS 11 
NEGI1 ZAC + negate if necessary 
SUB SUM7 
SACL SUM7 
POSI11 LAC A1l,14 3; compute Al*SR 
CALL FLOAT 3; ret/w mantissa in TEMPI; exp fn accum 
ADD SREXP 
OMOV SREXP 
SACL SUM8 
LAR 0,SUM8 ; exp of -product offset by table addr 
LTD SRMAN 3; scaled up by 2**3 
LAC THREE,7 3; multiply fudge factor 
MPY TEMP 1 
LTA *,0 s; mult mant, add 48, fetch shift factor 
AND KFF80 


Ors 
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B0263 
B0264 
B0265 
B0266 


B0267 
B0268 
BO0269 
B0270 
B0271 
BO0272 


BO0273 
BO274 
B0275 
B0276 
B0277 
B0278 
BO0279 
B0280 
B0281 
BO282 
80283 
BO284 
B0285 
B0286 
BO0287 
80288 
BO289 
B0290 
BO291 

B0292 
80293 
B0294 
B0295 
B0296 
B0297 
B0298 
B0299 
B0300 
B030) 

B0302 
BO0303 
B0304 
B0305 
B0306 
B0307 
B0308 
BO0309 
BO0310 
BO311 

BO312 
B0313 
BO314 
B0315 
BO316 
B0317 


0273 
0274 
0275 
0276 
0277 
0278 
0279 
O27A 
027B 
a27C 
0270 
O27E 
O27F 
0280 
0281 


0282 
0283 
0284 
0285 
0286 
0287 
0288 
0289 
O28A 
028B 
028C 
028D 
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5021 SACL TEMP! 
6D21 MPY TEMP 1 3; apply shift factor = fl(exp) 
7TF8E PAC 
FAOO BLZ RS21 3; exp >= 26 
0409 
5928 SACH SUM8,1 3 exp < 26 
6913 CHK21 DMOV SR 
6613 ZALS SR 3 check sign of product 
7811 XOR Al 
7948 AND K32768 
FFOO BZ POS2! 
0282 
7F89 NEG21 ZAC 3; negate if necessary 
1028 SUB SUM8 
5028 SACL SUM8 
0282 POS21 EQU $ 
Re 
MOTT TOSTT TOC Irrrryrrrrrr reer Tree rerrrrrry Terr rrrrr rT Titers 
*; ACCUMULATE FILTER TAP OUTPUTS (ACCUM) 
He 
*s INPUT: FILTER TAP OUTPUTS -- WAn & WBn (SUMm) 
te 
*; OUTPUT: PARTIAL SUM OF ZEROES FILTER -- SEZ 
ba SIGNAL ESTIMATE -- SE 
He 
#s NOTATION: SUMm -- 165 TC (Q1) 
es SEZ -- 15b TC (QQ) [sign extended} 
iat SE -- 15b TC (QO) [sign extended] 
Me 
HL OHHH HHH HEHEHE HEHEHE HHH ENE HHH HHH EHR HHH NHN 
Me 
, 
2F20 LAC SUM6,15 3; accumulate products 
OF 1F ADD SUM5,15 
OF IE ADD SuUM4,15 
OF 25 AOD SUM3,15 
OF 23 ADO SUM2,15 
OF 22 AOD SUM1,15 
5904 SACH SEZ,1} 
OF 27 ADO SUM7,15 
OF 28 ADO SUM8,15 
5903 SACH SE, 1 
2F03 LAC SE,15 
5803 SACH SE 
He 
POSE IO SITTS II ITTTITTTTrrrrryriTrrrirrirerrr rrr Trrrrerer iy 
*; limit speed contro! parameter: AL <= 1.0 
He 
"3 AL = 1 {ff APP > 1 
ex AL = APP if APP <= 1 
He 
bt} INPUT: UNLIMITED SPEED CONTROL -- AP (APP) 
Me 
"3 OUTPUT: LIMITED SPEED CONTROL -~ AL 
"3 NOTATION: APP -- unsigned 10b (Q8) 
*; AL -- unsigned 7b (Q6) 
Re 
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80318 
BO319 
B0320 
80321 
BO0322 
BO0323 
BO324 


B0325 
B0326 
B0327 
BO328 
B0329 
BO330 
BO0331 
B0332 
80333 
B0334 
B0335 
BO336 
B0337 
B0338 
B0339 
BO0340 
B0341 
B0342 
80343 
B0344 
B0345 
B0346 
80347 
B0348 
B0349 
BO0350 
BO0351 
BO352 
B0353 
B0354 
BO0355 
B0356 
80357 
80358 
BO0359 
B0360 
BO361 
B0362 
BO0363 
B0364 


80365 
B0366 
B0367 
BO0368 
BO369 
BO370 


028E 
O28F 
0290 
0291 
0292 
0293 
0294 
0295 
0296 


0297 
0298 
0299 
029A 
0298 
029C 
0290 
O29E 
O29F 
02A0 
O2A1 
O2A2 
02A3 
02A4 
O2A5 
O2A6 
O2A7 
02A8 
O2A9 
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2C4C 
5006 
2005 
184C 
FDOO 
0297 
2405 
7970 
5006 
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a a i a lala aa le Ra gi ae sh a ali adi i a ial i aol Rial Halla Mahia oie ateaeiid 


Me 
? 


LIMA LAC ONE ,12 
SACL AL 
LAC APP 3 check if APP >=1 
SUB ONE ,8 
BGEZ MIX ;APP >= 1 
LAC APP ,4 
ANO MF FCO 
SACL AL s;APP < 1 
Me 
Prrree errr errr rrirrer ere rrr ir rrrr retire err iii rrrrirr rr y 
*; MIX 
*; form linear combination of fast and slow scale factors 
He 
*; Y(K) = (1-AL(K))*YLCK-1) + AL(K)*YU(K-1) 
*s INPUT: SLOW QUANTIZER SCALE FACTOR -- YL (YLL/YLH) 
abr FAST QUANTIZER SCALE FACTOR -- YU 
Wie LIMITED SPEED CONTROL -- AL 
He 
*; OUTPUT: QUANTIZER SCALE FACTOR -- Y 
*s RESCALED QUANTIZER SCALE FACTOR -- YOVER4 
He 
*s NOTATION: YL -- 19b unsigned (Q15) 
i} stored as: 
e3 low 15b -- YLL 
m5 ht 4b -- YLH 
*; YU -~- 13b unsigned (Q9) 
* AL -- 7b unsigned (Q6) 
*; Y -- 13b unsigned (Q9) 
an YOVER4 -- lib unsigned (Q7) 
He 
PUTT TST ITT TIr err rrr reer rrr rrr errr rir irri rrr rrr reer errr y 
te 
MIX LAC YLL,10 ; shift yl right by 6 
SACH TEMP3 
LAC YLH,9 
ADD TEMP3 
SACL TEMP3 3 YL>>6 
LAC YU 
SUB TEMP3 3; YU-(YL>>6) 
SACL TEMP! 
LT AL 
MPY TEMP 1 
PAC 3 AL*(YU-(YL>>6)) 
BGEZ NONNEG 3; negative truncation 
ADD M4095 
NONNEG AOD TEMP3,12 
SACH Y,4 
LAC Y,14 $ compute and save y>>2 
SACH YOVER4 
RET 3; ret from SIGDIF 
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0003 
cooo! 
co002 
C0003 
C0004 
cooos 
co006 
co007 
co008 
co009 
coo10 
cool! 
cool2 
co013 
cool14 
cools 
cool6 
Ccoo!7 
cools 
cool9 
co020 
co021 
C0022 
C0023 
C0024 
co025s 
C0026 
C0027 
c0028 
co0029 
C0030 
C0031 
C0032 
C0033 
C0034 
co035 
C0036 
C0037 
C0038 
C0039 
co040 
C0041 
C0042 
C0043 
C0044 
C0045 
C0046 
C0047 
C0048 
C0049 
co050 
C0051 
co0s2 
C0053 
C0054 
cooss 
C0056 


O2AA 2026 


O2AB 
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COPY AQUAN.ASM 


AR RORRHHRHHEH HEHEHE HE EEE HHH HEHE TEER H HEHE 
DIFFERENCE SIGNAL COMPUTATION 

INPUT: LINEAR PCM SAMPLE -- SL (SAMPLE) 
SIGNAL ESTIMATE -- SE 


OUTPUT: DIFFERENCE SIGNAL -- D (accumulator) 


NOTATION: SL -- !14b TC (QO) 
SE -- 155 TC (QO) 
D -- 16b TC (QO) 


{sign extended} 
{sign extended] 


(tee eee eee eee eRe RR ER REE RR RRR REE RRR RRR RRR ERR RRR RES R EEE 


LAC SAMPLE ; 
SUB SE 


compute difference sig 


9 Ot 3b aE Tt 5b TE TE SF HF OE TF Ot FE OF SE HE OE OE HP HE OOF OF OD tt OF HE OE OE OF OF OF OF OE OE Ot Oe UP Oe ae OF OE Oe te Oe ee Oe te te ot 


ADAPTIVE QUANTIZER 


Implements the following modules (per CCITT spec): 
LOG -- 
SUBTB -- 
QUAN ~~ 


computes log of difference signal 
scales log by subtracting Y 
computes 4b output 


(Pee eee ee Pee PERE EERE REE ERR ERR RRR RRR RR RRR RRR ARERR RRR) 


INPUT: OIFFERENCEO PCM SAMPLE -- D (accumulator) 


QUANTIZER SCALE FACTOR -- Y (YOVER4) 
OUTPUT: ADPCM OUTPUT SAMPLE ==." 
NOTATION: 0D cis 


YOVER4 -- 
I aes 


[6b TC (QO) 
11b SM (Q7) POSITIVE VALUE ONLY 
4b SM (QO) 


ot Ot Mh 0G St Ob We Ot He OG St TE OE He UE OE 1b SE Ob OE te 8 Oe te TE Ob HE OE OE Ot TE Ot TE Ot OF Wt He OD tb HE Oot te OD tt Ot tt ee te ee ee 


Cr ee 


mee we we we ee te we ae we ee we ee We ee We we ee we we we we we we we we we we we ee we we 
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C0057 
co0s8 
C0059 
co060 
C0061 
c0062 
C0063 
C0064 
C0065 
C0066 
C0067 
C0068 
C0069 
C0070 


C0071 
co072 


C0073 
C0074 


co075 
C0076 


C0077 
C0078 
C0079 


C0080 
C008! 
c0082 


C0083 
co084 


C0085 
Cco086 
C0087 


coo88 
C0089 
C0090 


coogi 
c0092 


C0093 
C0094 


C0095 
C0096 
C0097 


coo98 
co03s9 
C0100 


O2AC 
O02AD 
O2AE 
O2ZAF 
02B0 
02B! 
02B2 
02B3 
02B4 
0285 
0286 
0287 
0288 
0289 
02BA 
028B 
02BC 
02BD 
O2BE 
02BF 
02CO 
02Ci 
02C2 
02C3 
02C4 
02C5 
02C6 
02C7 
02c8 
02C9 
O2CA 
02CB 
02cc 
o2CD 
O2CE 
O2CF 
02D0 
0201 
0202 
0203 
0204 
02D5 
0206 
0207 
0208 
0209 
020A 
02DB 
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5824 
TF BE 
5021 
184C 
FOO00 
O2E7 
0460 
FDOO 
O2CE 
0270 
FOOO 
02C3 
014C 
FDOO 
O2BF 
7000 
2721 
F900 
0321 
7001 
2621 
F900 
0321 
124C 
FDOO 
O2CA 
7002 
2521 
F900 
0321 
7003 
2421 
F900 
032! 
1470 
FOOO 
020C 
Q54C 
FDOO 
0208 
7004 
2321 
F900 
0321 
7005 
2221 
F900 
0321 
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i Y 


PRR PRS RRSZR EERE SERRE S ERE RESP RR EERE RE RE RE RE RE SESSA ERR ERE REE SS 


first get log of difference signal -- express 
as unsigned !t!lb number (4b exp/7b mantissa) 


First order log approximation: log2 (!+x) = x. 


see e eB Oe we ES 
a0 we we we we we we we wes 


SACH TEMP4 s; -l if neg; 0 if positive (0S) 
ABS 
SACL TEMP] 
GETEXP SUB ONE ,8 3; binary search to get exponent 
BGEZ c8T014 
COTO7 ADD M15,4 3 TEMP1-16 exp = 0-7 
BGEZ C4TO7 
COoTO3 ADD THREE,2 ; TEMPI-4 exp = 0-3 
BGEZ C2TO3 
coTo!t aAodDdD ONE, | 3 TEMP1-2 exp = 0-1 
BGEZ EXP 1 
EXPO LARK 6,0 ; exp = 0 
LAC TEMP1,7 
B GETMAN ; save exponent and get mantissa 
EXP 1 LARK 0,1 3 exp = 1 
LAC TEMP1,6 
B GETMAN 
C2TO3 SUB ONE, 2 ; TEMP!-8 exp = 2-3 
BGEZ EXP3 
EXP2 LARK 0,2 3; exp = 2 
LAC TEMPI1,5 
B GE TMAN 
EXP3 LARK 0,3 ; exp = 3 
LAC TEMP1,4 
B GETMAN 
C4TO7 SUB THREE ,4 ; TEMP1-64 exp = 4-7 
BGEZ C6TO7 
C4TOS ADD ONE ,5 3 TEMP1-32 exp = 4-5 
BGEZ EXP5 
EXP4 LARK 0,4 3 exp = 4 
LAC TEMP!1,3 
B GETMAN 
EXP5 LARK 0.5 3; exp = 5 
LAC TEMP1,2 
B GE TMAN 


cls 
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PAGE 0025 PAGE 0026 
CO101 O2DC 164C Cé6TO7 SUB ONE ,6 3 TEMP1-128 exp = 6-7 C0145 0315 F900 B GETMAN 
C0102 0200 FDOO BGEZ EXP7 0316 0321 
O2DE O2E3 C0146 0317 7000 EXP1i3 LARK 0,13 3; exp = 13 
C0103 O2DF 7006 EXPE LARK 0,6 : exp = 6 C0147 0318 2A21 LAC TEMP!,10 
C0104 O2E0 2121 LAC TEMP),1 C0148 0319 5821 SACH TEMP 1 
C0105 02E1 F900 B GETMAN C0149 O31A 2021 LAC TEMP) 
O2Z2E2 032) C0150 0318 F900 B GETMAN 
C0106 O02E3 7007 EXP7 LARK 0,7 ; exp = 7 031C 0321 
C0107 O2E4 2021 LAC TEMP] C0151 0310 7OOE EXP14 LARK 0,14 >; exp = 14 
C0108 O2E5 F900 B GE TMAN C0152 O31E 292} LAC TEMP1,9 
O2E6 032) C0153 O31F 5821 SACH £TEMP1 
C0109 O2E7 1860 C8T014 SUB M15,8 3 TEMP1-4096 exp = 8-14 C0154 0320 202) LAC TEMP 1 
C0110 O02E8 FDOO BGEZ CCTOE C0155 0321 796F GETMAN AND M127 
O2E9 0308 C0156 0322 3021 SAR O.TEMP1 
C0111 O2ZEA OATD C8TO11 ADD THREE,10 ;TEMP1-1024 exp = 8-11 C0157 0323 0721 ADD TEMP1,.7 ; OL 4e...7m (sign=SGN(D)) 
C0112 O2EB FDOO BGEZ CATOB cois8 *; 
O2ZEC O2FC c0159 8 gm rr rr rr rrr enn  chaedeateetinenteatetetete tate 
C0113 O2ED 094C C8TO9 ADD ONE ,9 3 TEMP1-512 exp = 8-9 C0160 ia : 
C0114 O2ZEE FDOO BGEZ EXP9 CoO161 "3; scale LOG D by subtraction (Y>>2 is in YOVER4) 
O2EF O2F6 C0162 93 
Co1t5 O2FO 7008 EXP8 LARK 0,8 3; exp = 8 C0163 Ya a rn rr rr rrr rrr rrr 
CO116 O2Fi 2F2)1 LAC TEMP1,15 C0164 *; 
C0117 O2F2 5821 SACH TEMP! C0165 0324 0B4C SUBTB ADD ONE,!11 3; offset by 2K 
C0118 O2F3 2021 LAC TEMP! C0166 0325 1029 suB YOVER4 
CO119 O2F4 F900 B GETMAN C0167 "3 
O2F5 0321 c0168 Da rr renee 
C0120 O2F6 7009 EXP9 LARK 0,9 3; exp = 9 co169 *; 16 LEVEL quantizer 
C0121 O2F7 2E21 LAC TEMP1,14 co170 *; 
C0122 O02F8 5821 SACH TEMP 1 Co171 *; Table values defined in CCITT spec p67 
C0123 O2F9 2021 LAC TEMP 1 C0172 *; Implemented table is offset by 2048 
C0124 O2FA F900 B GE TMAN C0173 8 a rn rr rece 
O2FB 0321 C0174 *; 
C0125 O2FC 1A4C CATOB SUB ONE ,10 3; TEMP1-2048 exp = 10-11 Cco175 O7F9 ITAB! EQU 2041 3 bottom of level 1 
C0126 O2FD FDOO BGEZ EXP11 C0176 087B ITAB2 £EQU 2171 ; bottom of level 2 
O2FE 0305 C0177 O8CA ITAB3 EQU 2250 3; bottom of level 3 
C0127 O2FF 7OOA EXPIO LARK 0,10 ; exp = 10 coi78s 0905 ITAB4 EQU 2309 3 bottom of level 4 
C0128 0300 2021 LAC TEMP1,13 C0179 0936 ITABS EQU 2358 ; bottom of level 5 
C0129 0301 5821 SACH TEMP 1 co180 0964 ITAB6E EQU 2404 3; bottom of level 6 
C0130 0302 2021 LAC TEMP) co181 0995 ITAB7 €EQU 2453 3 bottom of level 7 
C0131 0303 F900 B GETMAN C0182 “3 
0304 0321 C0183 0326 107C QUAN SuB K2309 3 TEMP2-2309 
C0132 0305 7008 EXPI1 LARK 0,11 : exp = 11 C0184 0327 FDOO BGEZ C14TO7 
C0133 0306 2C21 LAC TEMPi1,12 0328 033E 
C0134 0307 5821 SACH TEMP 1 C0185 0329 0078 CI1O0TO3 ADD K138 3 TEMP2-2171 I = 0-3 
C0135 0308 2021 LAC TEMP} C0186 032A FOOO BGEZ C12TO3 
C0136 0309 F900 B GE TMAN 0328 0335 
030A 032) C0187 032C OO7A CIOTO! ADD K130 ; TEMP2-2041 I = 0-1 
C0137 0308 1C7D CCTOE SsuUB THREE,.12 ; TEMP1-16384 exp = 12-14 C0188 0320 FDOO BGEZ TEQ! 
co138 030C FDOO BGEZ EXP14 032E 0332 
0300 0310 C0189 032F 7E00- IEQO LACK 0 
C0139 O30€ OD4C CCTOD ADD ONE , 13 ; TEMP1-8192 exp = 13-14 C0190 0330 F900 8 GETIM 
C0140 O30F FODOO BGEZ EXP13 0331 0351 
0310 0317 C0191 0332 7E01 IEQI LACK | 
CO141 0311 7OOC EXPI2 LARK 0,12 3; exp = 12 C0192 0333 F900 B GETIM 
C0142 0312 2821 LAC TEMP1,11 0334 0351 
C0143 0313 5821 SACH TEMP 1 C0193 0335 1078 CI2TO3 SUB K79 ; TEMP2-2250 Il = 2-3 
C0144 0314 202) LAC TEMP 1 C0194 0336 FDOO BGEZ 1EQ3 
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C0195 
C0196 


C0197 
C0198 


C0199 
co200 


co0201 
co202 


C0203 
c0204 


C0205 
C0206 


C0207 
C0208 


c0209 
C0210 


co0211 
co21l2 
C0213 
c0214 
co2is5 


0337 
0338 
0339 
033A 
03368 
033C 
0330 
O33€ 
033F 
0340 
0341 
0342 
0343 
0344 
0345 
0346 
0347 
0348 
0349 
O34A 
0348 
034C 
0340 
O34E 
O34F 
0350 
035! 
0352 
0353 
0354 
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0338 
TEO2 
F900 
0351 
7EQ3 
F900 
0351 
1079 
FDOO 
034A 
0064 
FOOO 
0347 
7EO4 
F900 
0351 
TEOS 
F900 
0351 
1065 
FDOO 
0350 
7E06 
F900 
0351 
TEO7 
5002 
7824 
7960 
7F 80 


1EQ2 


1EQ3 


CI14TO7 


CISTO6 


TEQ4 


1EQ5 


C16TO7 


TEQ6 


1EQ7 
GETIM 


QDONE 


2 
GETIM 


3 
GETIM 


K95 
CI6TO7 


K46 
1EQS 


4 
GETIM 


5 
GETIM 


K49 
1EQ7 


6 
GETIM 


7 
IM 
TEMP4 
M15 


Pry 


wn 0 ee ee 
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TEMP2-2404 I = 4-7 

TEMP2~-2358 1 = 5-6 

TEMP2-2453 1 = 6-7 


accumulator = iI} 

add sign bit and flip if necessary 
mask for final four-bit value 
return from AQUAN 
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0004 
poool 
00002 
00003 
DO004 
DOO05 
00006 
00007 
00008 
00009 
00010 
00011 
00012 
00013 
00014 
00015 
DOO!I6 
00017 
00018 
00019 
00020 
00021 
D0022 
00023 
00024 
D0025 
00026 
00027 
00028 
0bO0629 
DO0030 
00031 
00032 
00033 
D0034 
00035 
00036 
00037 
00038 
D0039 
DO0040 
00041 
00042 
D0043 
00044 
00045 
00046 
00047 
00048 
DO0049 
00050 
DOOS51 
00052 
00053 
00054 
00055 
00056 


0355 
0356 
0357 
0358 
0359 
035A 
0358 
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2002 
OO6A 
672} 
034C 
6769 
034C 
6768 
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COPY PROICT.ASM 


Re 
¥ 
HS HHHHHHHREHHH HHH HH HEHEHE HEE HEHE HHH RHE HH EHH HHH HEH EHH 


*; ADAPTATION/PREDICTION 


*; Implements the following modules per CCITT spec: 
Be 
"3 Inverse Adaptive Quantizer 
pen RECONST -- reconstructs D from I 
"3 ADDA ~- adds back scale factor 
Wis ANTILOG -- log to lin conversion to get 0Q 
* FLOAT A -- float DQ 
"3 Scale Factor Adaptation 
es FUNCTW -- map 1 to log scale factor 
wie FILTO -- update fast scale factor 
ar LIMB ~- limit scale factor 
a FILTE -~- update stow scale factor 
kt Adaptation Speed Control 
a FUNCTF -- map I to F function 
3 FILTA ~~ update short term ave of F 
os FILTB -- update long term ave of F 
"3 SUBTC ~- determ speed control] update 
"3 technique 
wy FILTC -~- update speed control 
ight Adaptive Predictor 
"5 ADDB -~- compute reconstructed signal 
*; FLOAT A -- float SR 
"3 ADDC -~~ compute sign of PK 
LA UPA2 -- update A2 coeff of 2nd order pred 
*; LIMC -- limit A2 
ae UPA] ; - =- update Al coeff of 2nd order pred 
"3 LIMD ~~ limit Al ‘ 
*; UPB -- update coeffs of 6th order pred 
beh XOR -- compute sign of DQ*DQn 
es e 
*5 NOTE: DELAY A/B/C implicit in timing of MIX/LIMA 
o3 and computation of SEZ/SE 
Re 
Pe eee er errr rere rrr rrr rrr errr irri irri iii rrr 
Be 
% 
*; First convert quantized difference back to log domain. 
*; This is done by table look-up. Also use ADPCM magnitude 
*; to look-up the scale-factor multiptiers WI and rate-of- 
*; change weighting function FI. 
He 
? 
PRDICT LAC IM 
ADD INQTAB ; reconst table 
TBLR TEMP 1 ; DQLN 
ADD ONE ,3 3; WI! table address and offset 
TBLR | 3 lookup WI 
ADD ONE ,3 ; Fl table address and offset 
TBLR FI ; lookup FI 


He 
il A al aN ad a i a i acl a a ll a led aaa 


*; INVERSE ADAPTIVE QUANTIZER 


"3 INPUT: ADPCM INPUT SAMPLE -~- I (1M->TEMP1) 


vis 


OLOTESILL 9H) YUM WOddV S/iqy-ce “LI 
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00057 
00058 
DO0059 
00060 
00061 
00062 
D0063 
D0064 
00065 
00066 
00067 
00068 
00069 
00070 
00071 
00072 
D0073 
00074 
DO0075 
00076 
00077 
00078 
00079 
DO0080 
00081 
DOo082 
D0083 
00084 
00085 
00086 
00087 
00088 
DOO89 
00090 
00091 

D0092 
00093 
00094 
00095 
00096 
00097 
00098 
DO099 
00100 
00101 

00102 
00103 
00104 
00105 
00106 
DO107 
00108 


00109 
00110 
DO111 
DO112 


035C 
0350 


035E 
O35F 
0360 
0361 

0362 
0363 
0364 
0365 
0366 
0367 
0368 
0369 
036A 
036B 
036C 
0360 
036E 
O036F 
0370 
0371 

0372 
0373 
0374 
0375 
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2521 
0529 


OC4C 
5C15 
7972 
oc4c 
5058 
2015 
0066 
6723 
6A23 
044C 
6723 
605B 
7F8E 
5C10 
2B4C 
5054 
2001 
134C 
FAQO 
0377 
7F89 
1010 
5010 
2B4B 


ee We ee Oe wt we we we we we we we we we we oe we ee we 


add back sc 


eee eee emhUch RDU oF seeseeseseesevBeesve®=ez~awe eve lB 8 


O oe we we we oe we ae we ve 
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QUANTIZER SCALE FACTOR -- Y (YOVER4) 


ale factor 


ADDA LAC TEMP! ,5 
AOD YOVER4,5 
Me 
*; now covert to linear domain 
He 
ALOG ADD ONE, 12 
SACH OQEXP,4 
AND M4095 
ADD ONE, 12 
SACL DQMAN 
LAC OQEXP 
ADD SHIFT 
TBLR TEMP3 
LT TEMP3 
AOD ONE , 4 
TBLR TEMP3 
MPY DQMAN 
PAC 
SACH 0Q,4 
ADDSGN LAC ONE, 11 
SACL SDQ 
LAC I 
SUB ONE ,3 
BLZ FLTDQ 
ZAC 
SUB 0Q 
SACL 0Q 
LAC MINUS, 11 


SHHHHRHHHHHEHHEHHREHEH HEHEHE eee Oe 


we we we oe we 


OUTPUT: QUANTIZED DIFFERENCE SIGNAL -- DQ 

NOTATION: I -- 4b SM (Q0) 
IM -~ 3b magnitude (QO) 
DQLN(TEMP1) -- 12b TC (Q7) [Sign extended] 
YOVER4 -~- tlb SM (Q7) POSITIVE VALUE ONLY 
DQ -~- 15b TC (QO) [sign extended] 
DQMAN*8-- 9b magnitude 
DQEXP -- 4b magnitude 


tt TF AF MO We OE OE OP OF Ot OE oP te ot AO OF OE at OE OE Hh OE MF OP OF Se GE OF OF Ub OE Ot me OE Ee OE OE OE OE OE Ot Oe te et 


0QSs 
ee me ce ee ee ee ee ee ee a ee ee + 
H 
Vv 
~--+ DQLN +-~----- + OQL +-----~--- + 0Q 
ONST } ------ >! ADDA |$------ >TANTILOG {------ > 
--~+ +--~---- + +-------- + 
1 Y 


HeHHHHHRHHHKHEHHHHHRERRHHHR EEE D 


{nc exponent for floated value 
save exponent + sign ext 
isolate mantissa * 2°*5 

Alog x = 1 + x 

OQMAN = 0001 XXXX XXxXO 0000 


add table ptr 
get multiplier 


offset to mask table 
mask for daqman 
adjust mantissa 


+2048 represents +sign 


check sign 


I carried negative sign 


-~2048 represents -sign 
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00113 
00114 
00115 
00116 
00117 
00118 
DO1I9 
00120 
D012) 

DO122 
00123 
00124 
00125 
00126 
00127 
00128 
00129 
00130 
00131 

D0132 
00133 
00134 
00135 
00136 
00137 
DO138 
00139 
DO0140 
00141 

00142 
00143 
00144 
00145 
00146 
00147 
00148 
00149 
DO150 
D015! 

00152 
00153 
00154 
D0155 
00156 
00157 
DO158 
00159 
00160 
00161 

00162 
00163 
00164 
DO165 
DO0166 
00167 
DO0168 
D0169 


0376 


0377 
0378 
0379 
O37A 
0378 
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5054 SACL $0Q 
J 
PTI TITIITITITITITTI TTT TT TTTTiTiTrti rer iretrrree 
*; FLOAT DQ -- convert 2’s comp number to floating 
” 
*; INPUT: DQ 
* 
sl OUTPUT: 4b exponent in OQEXP (saved from log value) 
wes 6b mantissa*8 in DQMAN (adjusted from log) 
*; sign preserved in 0Q 
Be 
PTET ET OOIITT TTT Terrrrr irre rere rrr rr rirr irri rrr rr rrr ee 
* 
’ 
255B FLTOQ LAC DQMAN,5 ; 00000000 O0000001X XXXXxXxX00 00000000 
5C58 SACH DQMAN,4 ; DQMAN = 0000 0000 O01X XXXX 
2358 LAC DQMAN,3 ; DQMAN * 2**3 
7923 AND TEMP3 
5058 SACL DQMAN 


Ho HH SH HM OF OE Ot Oe TE OE OE Ot OF Oe OE OE OO OF OE OE OE Ot aE te Oe ee ee ee ee 


3 QUANTIZER SCALE FACTOR ADAPTATION 


INPUT: ADPCM SAMPLE -- I 
OUTPUT: FAST QUANTIZER SCALE FACTOR -- YU 
SLOW QUANTIZER SCALE FACTOR -- YL (YLL/YLH) 
NOTATION: I -- 4b SM (QO) 
YU -- 13bd unsigned (Q9) 
YL -- 19b unsigned (Q15) 


stored as: ‘* 
Tow 156 -- YLL 
hi 4b -- YLH 


HHKKRKHHAHKSEHHRHHHHEEHHHHHHHHEHSEHHHRHHHRHHHHEKHHHHHHHHHHHRHH ERE Do 


WI +------- +YUT +------— +YUP +-------- + YU 
--->} FILTD {--->;} LIMB {-+->{} DELAYB } ------------~---- > 
a H 
iy {0 teen +YLP +---~--~--- + YL 
’ +->! FILTE |--->} DELAYG }-+-> 
+------- + +-------- + } 
* H 
Ho nn + 


(Ae eee eee EERE RRR ERE R RE RR EERE ERE RRR SERRE EERE RRR RE 


Update fast adaptation scale factor 


YUCK) = (1-2°%*-5)*Y(k) + (2%*-5) *W( 1 (Kk) ) 


INPUT: QUANTIZER SCALE FACTOR -- Y 


SCALE FACTOR MULTIPLIER -- WI 


OUTPUT: FAST QUANTIZER SCALE FACTOR -- YU 


, 
* 
, 
Re 
? 
Re 
9 
Be 
. 
Be 
’ 
Me 
’ 
Be 
, 
He 
? 
He 
, 
He 
; 
He 
; 
Be 
? 
Re 
3 
Be 
; 
Be 
’ 
Be 
$ 
He 
. 
He 
’ 
*; +------- + +------ 4+ | t-------- + 
Be 
% 
He 
% 
He 
, 
He 
Hy 
Be 
’ 
* 
. 
Be 
° 
He 
, 
Re 
; 
Be 
? 
Me 
, 
He 
° 
He 
. 
Be 
, 
Mae 
$ 
Me 
? 
Re 
1 
Re 
. 


NOTATION: WI 


12b TC (Q4) [sign extended] 
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Bo170 
DO171 
DO172 
DO173 
DOI74 
00175 
DO176 
00177 
00178 
00179 
00180 
D018! 
DO182 
00183 


DO184 
DO185 


00186 
00187 


00188 
00189 
00190 
00191 
00192 
00193 
00194 
00195 
00196 
00197 
00198 
00199 
00200 
00201 
00202 
00203 
00204 
00205 
00206 
DO207 
00208 
00209 
00210 
D0211 
00212 
00213 
00214 
Doe215 
00216 
00217 
D0218 
DO219 
00220 
D022} 
DO222 
00223 


037C 
037D 
O37E 
O37F 


0380 
038) 

0382 
0383 
0384 
0385 
0386 
0387 
0388 
0389 
038A 


0388 
038C 
0380 
O38E 
O38F 
0390 
0391 
0392 
0393 
0394 
0395 
0396 
0397 
0398 
0399 
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"3 Y -- 13b unsigned (Q9) 
*; YU -- 13b unsigned (Q9) 
* 
Perri reir rrit rrr 
Re 
2C09 FILTO LAC Y,12 ; Y (Q21) 
1709 SUB Y,7 + Y/32 (Q21) 
0cé69 ADD W1,12 3 WI/32 (Q21) 
5C4E SACH YU,4 ; YU (Q9) 
me 
*; limit quant scale factor 1.06 <= YU <= 10.0 
1C6B LIMB SUB K544,12 3 check lo threshold 
FOOO BGEZ CHKHI! 
0386 
2068 LAC K544 
F900 B STRLIM 3 go store limited value 
038A 
1C61 CHKHI SUB K4576,12 3; check hi threshold 
FBOO BLEZ FILTE 3s; within lLimits--continue 
0388 
206C LAC K5120 
504E STRLIM SACL YU 
[ay 
Pere e eT errr Terre rr irri 
*; Update slow adaptation scale factor 
Me 
, 
ws YL(K) = (1-2*%*-6)"*YL(k-1) + 2%"*-6 * YU(k) 
He 
"3 INPUT: SLOW QUANTIZER SCALE FACTOR -- YL (YLL/YLH) 
“3 FAST QUANTIZER SCALE FACTOR ~~ YU 
Re 
y 
"3 OUTPUT: SLOW QUANTIZER SCALE FACTOR -- YL (YLL/YLH) 
Re 
"5 NOTATION: YU -- 13b unsigned (Q9) 
"3 YL -- 19b unsigned (Q15) 
"3 stored as: 
"3 tow 15b -- YLL 
*; hi = 4b -- ~YLH 
Me 
A ONO Ree ORR A RRR ORR ERE R ORE OHNO ER RE EHHEHEH REESE EHS EE 
fe 
2649 FILTE LAC YLH,6 3; shift yl left by 6 
502} SACL TEMP] 
2F2i LAC TEMP1,15 : YL (Q21) 
064A ADD YLL,6 
1F 49 SUB YLH,15 ; YL/64 (Q21) 
104A SUB YUL 
O64E ADD YU,6 3 YU/64 (Q21) 
5921 SACH TEMP1,1! 
7974 AND M32767 
5022 SACL TEMP2 3; result = yl (shifted left by 6) 
2A22 LAC TEMP2,10 3: shift result right 6 --> 4.Q15 
5822 SACH TEMP2 
2921 LAC TEMP! ,9 s YL (QI5) 
0022 ADD TEMP2 
5949 SACH YLH,1 


CCciTT 


DO224 
00225 
00226 
00227 
00228 
D0229 
00230 
00231 

00232 
00233 
00234 
00235 
00236 
00237 
00238 
00239 
00240 
00241 
00242 
00243 
00244 
00245 
00246 
00247 
00248 
00249 
00250 
00251 
00252 
00253 
00254 
D0255 
00256 
00257 
00258 
00259 
DO0260 
D026} 

00262 
00263 
00264 
00265 
00266 
00267 
00268 
00269 
00270 
00271 

00272 
00273 
00274 
00275 
D0276 
00277 
00278 
00279 
00280 
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O39A 7974 AND M32767 
039B 504A SACL as 
Me 
Po EIPSOOTTTITrrrrrirrrrrrrrerrrrrrrrrrrrrrrirrrirr rrr rete 
*; ADAPTATION SPEED CONTROL 
He 
*;5 INPUT: ADPCM SAMPLE. -- | 
Me 
“3 OUTPUT: UNLIMITED SPEED CONTROL -- AP (APP) 
Re 
*; NOTATION: I -- 4b SM (QO) 
*3 APP -- 10b unsigned (Q8) 
He 
OTT TISIITITITTIIITrrrrrrrrrrrririrrirrirrrrrrritrri rr tTi ry 
“3 
"5 iY 
Ws Vv 
*; Fl +------- +DMSP+-—------ + AX +------- +APP +-------- + AP 
“; -+->!} FILTA {-+->! SUBTC !--->! FILTC $--->! DELAYA !-+4+-> 
as {00 terre + bo tee ener + +------- + Steet + ! 
"sf ‘ v . 7 H 
*s; § OMS{ +-------- + } achat tertenetastetetenteteeteetaaete + 
Sigh +-} DELAYA ; |} 
“ef sca recta + 4 
"3 4 H 
3 fo tence + DMLP ; 
3; +->) FILTB |} --4+--~---- + 
bat $ecocm=-— + | 
we “ Vv 
*; OML | +----~---~ + 
*3 +-| DELAYA | 
bet $-------- + 
“3 
HHH HHHHEHHHHHHHHHESHHHHHHHHHHHHHTHRHHREHHH HHH RHO HAE 
“; update short term average of FI 
We 
’ 
*; DMS(k) = (1-2"*-5)*DMS(k-1) + 2°°-5 * FI(k) 
Be 
"3 INPUT: SHORT TERM AVERAGE ~~ OMS 
"3 RATE-OF-CHANGE FUNCTION -- FI 
@e 
“3 OUTPUT: SHORT TERM AVERAGE -- OMS 
Ms. 
“3; NOTATION: DMS -- I2b unsigned (Q9) 
°; FI -- 7b unsigned (Q4) 
Be 
PPT ITITITITITITITTITITrrrrrrrrre errr rirrrirrriri tri Trrr tree 
He 
039C 2F68 FILTA LAC FI,15 + FE/32 (Q24) 
0390 OFO7 ADD DMS,15 3; OMS (Q24) 
O39E 1!A07 SUB OMS,10 + DMS/32 (Q24) 
039F 5907 SACH DMS, 1 
Me 
OTITTIITITTITITTT ITT TTTTTTT Titi TiTirTiT irri ite 
"; update long term average of Fi 
Be 
9 
"5 DML (k) = (1-2°*-7)*OML(k-1) + 2"°*-7 * FI(k) 
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00281 
D0282 
00283 
00284 
00285 
00286 
DO0287 
00288 
00289 
00290 
00291 
00292 
00293 
00294 
00295 
00296 
00297 
00298 
00299 
00300 
00301 
00302 
00303 
00304 
00305 
00306 
D0307 
00308 
00309 
00310 
DO311 
00312 
00313 
00314 
00315 
00316 
00317 
00318 
00319 
00320 
DO0321 
00322 
00323 
00324 
00325 
00326 
00327 
00328 


00329 
00330 
00331 
DO0332 
00333 
00334 
00335 


03A0 
03A1 
03A2 
03A3 


03A4 
03A5 
03A6 
O3A7 
03A8 
03A9 
O3AA 
03AB 
03AC 
03AD 
O3AE 
O3AF 
0360 
03B1 
03B2 
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*y 
bea INPUT: LONG TERM AVERAGE -- DML 
©; RATE-OF-CHANGE FUNCTION -- FI 
Me * 
bag: OUTPUT: LONG TERM AVERAGE -- OML 
Me 
as NOTATION: OML -- 14b unsigned (Q11) 
"5 FI -- 7b unsigned (Q4) 
We 
MOTT TTIITITIITITTITITITTI TTI TTT TTI TTT Ti TrrriTerTiieeeeiee 
Me 
2F6é8 FILTB LAC FI,15 > FI/128 (Q26) 
OFO8 ADD DML ,15 > OML (Q26) 
1808 Sus OML .8 3 DML/!128 (Q26) 
§908 SACH DML, ) 
a. 
ae en oF gs Sr ee 
"; Compute mag of diff of short and long term functions of 
*; quantizer output sequence and perform threshold 
*; comparison to compute speed control parameter--low-pass 
*; result. 
*; 
sadir APP(k) = (1-2%*-4)*APP(k-1) + 2"*-3 , if Y < 3 or 
®; ff {;DMS-DML! > 2**-3 * DML 
"3 else 
*; 
—_ APP(k) = (1-2**-4)*APP(k-1) 
Me 
"3 INPUT: SHORT TERM AVERAGE -- DMS 
"3 LONG TERM AVERAGE -~ OML 
bo UNLIMITED SPEED CONTROL -- APP 
i.e QUANTIZER SCALE FACTOR -- Y 
Me 
, 
ws OUTPUT: UNLIMITED SPEED CONTROL -- APP 
* 
*; NOTATION: APP -~ 10b unsigned (Q8) 
"; Y -- 13b unsigned (Q9) 
"; OMS -- I2b unsigned (Q9) 
ba OML -- 14b unsigned (QI!) 
me 
OPTI ITTTTTTITTTT Terr TIrrrrerrrririrrrr ir rrr rier rT r riety 
me 
6505 FILTC ZALH APP 3; APP (Q24) 
1C05 SUB APP,12 ; APP/16 (Q24) 
5805 SACH APP + (1-2"°*°-4)*APP (Q8) 
2009 LAC Y 
1970 suB THREE ,9 ; 3 (Q9) 
FAOO BLZ ADDI8 
0383 
2008 LAC OML,13 ; OML/8 (Q27) 
5823 SACH TEMP3 ; DML/8 (Q11) 
2207 LAC OMS ,2 ; OMS (Q11) 
1008 SUB OML 3 OMS-DML 
7F88 ABS 
1023 SuB TEMP3 3 |OMS—-DML }-DML/8 
FAOO BLZ APRED 
0386 
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D0336 
DO0337 
00338 
00339 
00340 
00341 

D0342 
00343 
00344 
00345 
00346 
00347 
00348 
00349 
00350 
0035)! 

00352 
00353 
00354 
00355 
00356 
00357 
00358 
00359 
00360 
00361 
00362 
D0363 
00364 
DO365 
00366 
00367 
00368 
00369 
00370 
D037! 

00372 
00373 
00374 
00375 
00376 
00377 
00378 
00379 
00380 
00381 

00382 
00383 
00384 
00385 
00386 
00387 
00388 
00389 
00390 
00391 

00392 


03B3 
0384 
0385 


0386 
0387 
0388 
0389 
33BA 
03BB 
03BC 
0380 
O3BE 
O3BF 
03c0 
03C1 
03C2 
03C3 
03C4 
03G5 
03C6 
03C7 
03C8 
03c9 
O3CA 
03CB 
o3CC 
03CD 
O3CE 
O3CF 
0300 
0301 
0302 
03D3 
0304 
0305 
0306 
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2005 AODI8 LAC APP 3; APP (Q8) 
054C ADD ONE ,5 
5005 SACL APP i + 1/8 (Q8) 
Be 
OPPOSITE Irrrrrrrrrr rrr Terre errr rir reer rrr 
*; ADAPTIVE PREDICTOR 
Me 
POOUeOEe SET T Perrier re rrrr errr rrrr rrr ir rrr rere errr rrr rrr ry 
Re 
03B6 APREOD EQU $ 
Me 
Pere Ieee rrr rrrr rrr ree errr rrr rrr rire rer rrr rrr rrr rir 
“3; compute coeff of 6th order predictor 
ae 
"3; Bi(k) = (1-2"*-8)*Bi(k-1) + 2*°*-7*SGN(0Q(k) ]*SGN(OQ(k-f)] 
"3 for { = 1...6 
ms and Bi is Implicitly limited to +/- 2 
a. 
sa NOTATION: Bn -- 16b TC (Q14) 
“3 S0Qn -- +2048 if sign positive 
“3 -2048 if sign negative 
He 
ree rerre rere rrrrrrrre rrr rrr irri itiri rir rrr 
* 
6A5A GETBE LT . S0Q6 
280F LAC 86,8 ; BE * 2**-8 TRUNCATED 
5821 SACH TEMP 1 
2F OF LAC B6,15 + Q29 
1F2) SUB TEMP1,15 
6054 MPY soQ + SGN(SDQ)*SGN(SDQ6) * 2**-7 = (Q29) 
6859 LTD S0Qs 
590F SACH Bé,1 3 Qi4 
280E GETB5 LAC BS,8 ; BS * 2°*-8 TRUNCATED 
5821 SACH TEMP 1 
2F0E LAC B5,15 + Q29 
1F2} SUB TEMP1,15 
6054 MPY SDQ 
6B58 LTD SDQ4 
590E SACH B5,1 3 Qi4 
2800 GETB4 LAC B4,8 ; B4 * 2**-8 TRUNCATED 
5821 SACH TEMP} 
2F 0D LAC B4,15 3 Q29 
1F2) SUB TEMP1,15 
6054 MPY soQ 
6B57 LTO $0Q3 
5900 SACH B4,1 3 Ql4 
280C GETB3 LAC 53,8 ; B63 * 2**-8 TRUNCATED 
5821 SACH TEMP 1 
2F0C LAC B3,15 + Q29 
1F21 SUB TEMP1,15 
6054 MPY s0Q 
6856 LTD S0Q2 
590C SACH B3,1 3+ QI4 
2808 GETB2 LAC 82,8 ; B2 * 2°*-8 TRUNCATED 
5821 SACH TEMP] 
2F 0B LAC B2,15 + Q29 
1F21 SUB TEMP1,15 
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D0393 
00394 
00395 
00396 


- 00397 


D0398 
00399 
D0400 
00401 
00402 
00403 
00404 
00405 
00406 
00407 
00408 
D0409 
00410 
00411 
00412 
00413 
00414 
D0415 
DO416 
00417 
00418 
D0419 
00420 
00421 
00422 
00423 
D0424 
00425 
D0426 
D0427 
D0428 
00429 
00430 
00431 
00432 
00433 


D0434 
D0435 


00436 
00437 


D0438 
00439 
00440 


00441 
00442 
00443 
00444 
00445 


0307 
0308 
0309 
030A 
0308 
030C 
0300 
O30€ 
O30F 
O3€0 


O3E1 
03E2 
03E3 
O3E4 
O3E5 
03E6 
O3E7 
03E8 
O3E9 


Q3EA 
03EB 
O3EC 
Q3ED 
O3EE 
O3EF 
O3F0 
O3F 1 
O3F2 
O3F3 
O3F 4 
O3F5 
O3F6 
O3F7 
03F8 
O03F9 


32010 FAMILY MACRO ASSEMBLER PC2.1 84.107 16:36:03 03-20-85 
PAGE 0035 
6054 MPY SDQ 
6B55 LTO SOQ! 
5908 SACH B2,1 3; Q14 
280A GETB! LAC B1,8 ; BL * 2**-8 TRUNCATED 
5821 SACH TEMP} 
2FOA LAC B1,15 3 Q29 
1F2) SUB TEMP1,15 
6054 MPY SDdQ 
6B54 LTO SDQ 
590A SACH Bl,1 3 Q14 
Re 
eee STEEP eTee TST e rere rr rr rree eer err rrr reer irre rr rrr rr 
*; To update coefficients of 2nd order predictor, 
*; First get sign of sum of SEZ and DQ 
Me 
ie NOTATION: if SEZ+D0Q >= 0 then PKO = 512 
bie else PKO = -512 
He 
POTTS ESET TTT S ITT e Tree ere rere errr rere errr errr rere 
Me 
6950 ADOC DMOV PK1 3; PK1l==>PK2 
694F DMOV PKO 3; PKO==>PK} 
2004 LAC SEZ 
0110 ADD 0Q,1 
5821 SACH TEMP 1 3; FFFF or 0000 
2A21 LAC TEMP1,10 ; FCOO or 0000 
094C ADD ONE ,9 ; FEOO or 0200 ! -512 or +512 
504F SACL PKO 
6A4F SUMGTO LT PKO 
Re 
POTS TITISTETISI TTT TI TTT TT errr rrr Titer r i rer Teer 
bast now calculate 1/2 * FLAI(K-1)]j 
He 
ws = 2*Al if {AL} <= 1/2 
"3 = SGN(A1) if fAIe > 1/2 
He 
eer rere rererrrrrre rete errr er rrr rier ierrrr rrr rrr rir ity 
es / 
2111 4GETF LAC Al,1 3 2"°Al 
5023 SACL TEMP3 
FAQO BLZ GETF2 
O3F 4 
1E4C GETF1 SUB ONE ,14 3; is tAl! ¢< 1/2 
FAOO BLZ GETA] 
O3FA 
2062 LAC K16382 $ approx 1 
F900 B DONEF 
O3F9 
7F88 GETF2 ABS 
1E4C SUB ONE ,14 ; is TAL! ¢€ 1/2 
FAOO BLZ GETA1 
O3FA 
2063 LAC M16382 $ approx -} 
§023 DONEF SACL TEMP3 


"3 
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*; Compute Al coeff of 2nd order predictor 
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00446 
00447 
00448 
D0449 
D0450 
00451 
D0452 
D0453 
00454 
D0455 
D0456 
00457 
00458 
00459 
00460 
00461 
D0462 
D0463 
D0464 
00465 
DO466 
00467 
00468 
00469 
00470 
00471 
00472 
00473 
D0474 
D0475 
00476 
DO477 
00478 
D0479 
DO480 
D048 1 


D0482 
00483 
D0484 
D0485 
00486 
00487 
00488 
D0489 
00490 
00491 
00492 
D0493 
00494 
DO0495 
00496 
00497 
00498 
D0499 
00500 
DOS501 


O3FA 
O3FB 
O3FC 
O3FD 
O3FE 
O3FF 
0400 
0401 
0402 
0403 


0404 
0405 
0406 
0407 
0408 


0409 
O40A 
0408 
040C 
0400 
O40E 
O40F 
0410 
0411 
0412 
0413 


0414 
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“3 AL(K) = (1-2"*-8)*A1(K-1) 
bee + (3*2"**-8)*SGN[p(k) J*SGN[p(k~-1) J] 
*: NOTATION: Al -- 16b TC (Q14) 
tit PKn -- +512 if SGN[p(k)] = 1 
eat -Sl2 if SGN{p(k)j} = -1 
He 
TUS SCE TTTSE SECT eeeee eee eee rer rere ree ceri er Seer eer 
He 
2811 GETAIL LAC Al,8 3; Al*2**-8 TRUNCATED 
5822 SACH TEMP2 
2Cl} LAC Al,12 3; Q26 
1C22 SUB TEMP2,12 
6050 MPY PK1 ; SGN(p(k-1)*SGN[ p(k) ] 
TF BF APAC 
TF 8F APAC 
TF 8F APAC ; +3"SGN(p(k-1)*SGN{p(k) ] 
5C11 SACH Al,4 ; store as Q14 
TF BE PAC 3; save sign 
OUT TETISTTTSSLISES STEP TTT TST T eT TTT errr rr rrr rere eee eee rere 
*; Compute A2 coeff of 2nd order predictor 
* 
#5 A2(k) = (1-2**-7)*A2(k-~-1) 
#s + (2**-7)*{SGN(pP(k) ]*SGNE (p(k-2) j 
ie - F[AI(k-1) ]*SGN[ p(k) ]*SGN(p(k-1) ]} 
a NOTATION: A2 -- 16b TC (Q14) 
": F(),TEMP3 ~~ 16b TC (Q14) 
we PKn. -- +512 if SGN{p(k)J] = 1 
*; -512 if SGN{p(k)j) = -!1 
* 
POV ET TS ISTTeSe eer errre TTT eT TTT er eee e rere eer errr errr errr Ty 
FDOO GETA2 BGEZ SUBF ; if sign + --> subtract F 
0409 
7F89 ZAC ; else negate F and subtract 
1023 SUB TEMP3 
5023 SACL TEMP3 
2912 SUBF LAC A2,9 +: A2"*2**-7 TRUNCATED 
581E SACH SUM4 
6051 MPY PK2 3; SGN(p(k-2)]*SGN[ p(k) ] 
7FBE PAC 
TF 8F APAC 3; 2*2**-8* above (Q26) 
1623 SUB TEMP3,6 3; 2° TEMP3*2""*-7 (Q26) 
5C23 SACH TEMP3,4 3; Q14 
2012 LAC A2 ‘ 
1OIE SUB SUM4 3; leak factor 
0023 ADD TEMP3 
5012 SACL A2 3 Q14 
* 
5821 SACH TEMP1 3; save sign to make +/- .75 
*s Limit A2 to +/- .75 and prevent overflow 
. ad 
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PAGE 0037 PAGE 0038 

00502 0415 7F88 LIMC ABS DO556 0433 FDOO BGEZ D4TO7 

00503 0416 IC70 SUB THREE, 12 3 tvalue! must be < .75 0434 044A 

00504 0417 FBOO BLEZ LIMO 00557 0435 0:70 DOTO3 ADD THREE,1 ; TEMPI-2 -- exp = Q-3 
0418 0410 DO558 0436 FDOO BGEZ D2TO3 

DO0S505 0419 2C7D LAC THREE ,12 3; -75 (Q14) 0437 0430 

DOSO6 041A 7821 XOR TEMP 1 3; 17s complement if negative DO0559 0438 2052 DOTOL LAC SRMAN 3; exp = 0-1 

D0507 0418 1021 SUB TEMP} 3 2°s complement if negative DO560 0439 50!1C EXxX01 SACL SREXP 

00508 041C 5012 DONEC SACL A2 3; Q14 60561 043A 284C LAC ONE ,8 

00509 iat 00562 043B 5052 SACL SRMAN 

DO510 "3; limit Al(k) to +/- [1-2"*-4 - A2(k)] 00563 043C 7F8D RET 

00511 *; 00564 043D 114C D2TO3 SUB ONE, 1 ; TEMPI-4 -- exp = 2-3 

00512 0410 2A6D LIMD LAC M15,10 3; 1-2"**-4 (Q14) 00565 043E FDOO BGEZ EXX3 

DO0513 O41E 1012 SUB A2 043F 0445 

00514 O41F 5021 SACL TEMP 1 3 1-2"**-4-A2P (Q14) DO0566 0440 2752 EXxX2 LAC SRMAN,7 3; exp=2 

DO515 0420 2011 LAC Al DO567 0441 5052 SACL SRMAN 

DO0516 0421 5824 SACH TEMP4 + save sign to make +/- LIMIT 00568 0442 7E02 LACK 2 

DO517 0422 7F88 ABS DO569 0443 SOIC SACL SREXP 

D05!8 0423 1021 SUB TEMP! DO0570 0444 7F80 RET 

DO519 0424 FBOO BLEZ FLTSR > Al <= LIMIT DO571 0445 2652 EXX3 LAC SRMAN,6 ; exps3 
0425 042A 00572 0446 5052 SACL SRMAN 

00520 0426 2021 AILIM LAC TEMP 1 ; ABS value of LIMIT 00573 0447 7E03 LACK 3 

D052! 0427 7824 XOR TEMP4 3 1’s complement if negative 00574 0448 SOIC SACL SRE XP 

00522 0428 1024 SUB TEMP4 3 2°S complement if negative DOS75 0449 7F8D RET 

00523 0429 5011 SACL Al 3; Q14 DO576 044A 1370 ODATO7 SUB THREE,3 ; TEMP!1-32 -- exp = 4-7 

00524 ke DO577 0448 FOOO BGEZ D6TO7 

00525 HSH HHH HHH EERE HHH HHH N HEHEHE RHEE RHEE HEHEHE EEE 044C O45A 

DO0S26 *; COMPUTE RECONSTRUCTED SIGNAL 00578 044D 044C OD4TOS ADD ONE ,4 ; TEMP1-16 -- exp = 4-5 

00527 ae DO579 044— FOOO BGEZ EXxX5 

00528 ak INPUT: QUANTIZED OIFFERENCE SIGNAL -- 0Q O44F 0455 

00529 ms SIGNAL ESTIMATE ~- SE 00580 0450 2552 EXxX4 LAC SRMAN,S ; exp=4 

D0530 3 0058! 0451 5052 SACL SRMAN 

00531 dee OUTPUT: RECONSTRUCTED SIGNAL ~-- SR 00582 0452 7E04 LACK 4 

00532 “3 00583 0453 501C SACL SREXP 

00533 ie NOTATION: OQ -- 15b TC (QO) [sign extended] 00584 0454 7F8D RET 

00534 "5 SE -- 15b TC (QO) [sign extended} 00585 0455 2452 EXxX5 LAC SRMAN,4 ; exp=5 

DO535 "5 SR -- 16b TC (QO) 00586 0456 5052 SACL SRMAN 

00536 ws DO587 0457 7E05 LACK 5 

00537 HHH HHH HH HHHH HHH HHH HHH HRN HOHE PREP RHH 00588 0458 SOIC SACL SRE XP 

00538 "3; FLOAT SR -- convert 2’s comp number to floating DO0589 0459 7F8D RET 

00539 #5 00590 045A 154C D6TO7 SUB ONE ,5 ; TEMP1-64 -~ exp = 6-7 

00540 *3 INPUT: accumulator D0591 045B FODOO BGEZ EXX7 

00541 * 045C 0462 

00542 #3 OUTPUT: --4b exponent left in SREXP 00592 0450 2352 EXX6 LAC SRMAN,3 ; exp=6 

00543 "5 --6b mantissa*"8 left in SRMAN DO593 O45E 5052 SACL SRMAN 

D0544 baler ~-sign preserved in SR 00594 O45F 7E06 LACK 6 

DO545 fs DO595 0460 501C SACL SRE XP 

00546 HNN RENEE E HHH REHEHNHNH HHH HHH HERE HHH HNN EHR 00596 0461 7F8D RET 

00547 * DO597 0462 2F52° EXX7 LAC SRMAN, 15 

00548 042A 2010 FLTSR LAC 0Q ; compute reconstructed signal 00598 0463 5852 SACH SRMAN 

DO0549 0428 0003 ADD SE 00599 0464 2352 LAC SRMAN, 3 

DO0550 042C 5013 SACL SR DO0600 0465 5052 SACL SRMAN 

D0551 0420 7F88 ABS + convert to floating point notation 00601 0466 7E07 LACK 7 

DO5S2 042E 5052 SACL SRMAN DO602 0467 S5O0IC SACL SRE XP 

00553 042F 174C SUB ONE ,7 + binary search to get exponent DO603 0468 7F80 RET 

DO554 0430 FDOO BGEZ D8TOF 00604 0469 1177 O8TOF SUB K960,1 >; TEMP1-2048 -- exp = 8-15 
0431 0469 DO605 046A FOOO BGEZ DCTOF 

DO555 0432 0360 O0OTO7 ADD M15,3 ; TEMP!1-8 -- exp = 0-7 0468 0491 
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D0606 
00607 


00608 
DOo609 


00610 
00611 
00612 
00613 
00614 
00615 
00616 
D0617 
D0618 
00619 
00620 
00621 
00622 
00623 
D0624 
DO625 


00626 
00627 
00628 
D0629 
00630 
00631 
00632 
00633 
00634 
00635 
00636 
00637 
00638 
00639 
00640 
00641 


00642 
00643 


D0644 
00645 
D0646 
D0647 
00648 
00649 
00650 
DO0651 
00652 
00653 
00654 
00655 
00656 
DO0657 


046C 
0460 
046E 
O46F 
0470 
0471 
0472 
0473 
0474 
0475 
0476 
0477 
0478 
0479 
O47A 
0478 
047C 
047D 
O47E 
O47F 
0480 
0481 

0482 
0483 
0484 
0485 
0486 
0487 
0488 
0489 
048A 
0488 
048C 
0480 
048E 
O48F 
0490 
0491 

0492 
0493 
0494 
0495 
0496 
0497 
0498 
0499 
049A 
049B 
049C 
049D 
049E 
O49F 
04A0 
04A1 

04A2 
04A3 
04A4 
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0970 
FDOO 
0480 
084C 
FDOO 
0479 
2E52 
5852 
2352 
5052 
7TEO8 
501C 
7F 8D 
2052 
5852 
2352 
5052 
TEOS 
501C 
7F8D 
194C 
FDOO 
048A 
2C82 
5852 
2352 
5052 
TEOA 
501C 
7F80 
2B52 
5852 
2352 
5052 
7EOB 
50I1IC 


TF 8D 


187D 
FDOO 
04A5 
oc4c 
FOOO 
049E 
2A52 
5852 
2352 
5052 
TEOC 
501C 
7F80 
2952 
5852 
2352 
5052 
7EOO 
501C 
7F8D 


08TOB 
D8TO9 


EXX8 


EXX9 


DATOB 


EXX10 


EXX11 


OCTOF 
DCTOD 


EXX12 


EXX13 


ADD 
BGEZ 


ADD 
BGEZ 


LAC 
SACH 
LAC 
SACL 
LACK 


SACL . 


RET 
LAC 
SACH 
LAC 
SACL 
LACK 
SACL 
RET 
SUB 
BGEZ 


LAC 
SACH 
LAC 
SACL 
LACK 
SACL 
RET 
LAC 
SACH 
LAC 
SACL 
LACK 
SACL 
RET 
SUB 
BGEZ 


ADD 
BGEZ 


LAC 
SACH 
LAC 
SACL 
LACK 
SACL 
RET 
LAC 
SACH 
LAC 
SACL 
LACK 
SACL 
RET 


THREE ,9 
DATOB 


ONE ,8 
EXx9 


SRMAN,14 
SRMAN 
SRMAN, 3 
SRMAN 

8 

SREXP 


SRMAN, 13 
SRMAN 
SRMAN, 3 
SRMAN 

9 

SREXP 


ONE ,9 
EXX11 


SRMAN, 12 
SRMAN 


SRMAN, 11 
SRMAN 
SRMAN, 3 
SRMAN 

11 

SREXP 


THREE, 11 
DE TOF 


ONE, 12 
EXX13 


SRMAN, 10 


SRMAN,9 
SRMAN 
SRMAN, 3 
SRMAN 
13 
SREXP 


. 
% 
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TEMP1-512 -- exp = 8-11 


TEMP1-256 -- exp = 8-9 


exp=8 


exp=9 


TEMP1-1024 -- exp=10-11 


exp=10 


exp=11 


TEMP1-8192 -~ exp=12-15 
TEMP1-4096 -- exp=12-13 


exp=12 


exp=13 


03-20-85 


CCITT 


00658 
00659 


00660 
00661 
D0662 
DO0663 
00664 
00665 
00666 
00667 
D0668 
00669 
00670 
00671 
00672 
D0673 


04A5 
O4A6 
O4A7 
04A8 
O4A9 
O4AA 
04AB 
O4AC 
O4AD 
O4AE 
O4AF 
04B0 
04B1 
04B2 
04B3 
04B4 
04B5 
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1D4C 
FDOO 
O4AF 
2852 
5852 
2352 
5052 
TEOE 
501C 
7F80 
2752 
5852 
2352 
5052 
TEOF 
501C 
7F8D 


DE TOF 


EXX14 


EXX15 


SUB 
BGEZ 


LAC 
SACH 
LAC 
SACL 
LACK 
SACL 
RET 
LAC 
SACH 
LAC 
SACL 
LACK 
SACL 
RET 


ONE, 13 
EXX15 


SRMAN,8& 
SRMAN 
SRMAN,3 
SRMAN 
14 
SREXP 


SRMAN,7 
SRMAN 
SRMAN, 3 
SRMAN 
15 
SREXP 


PC2.1 84.107 16:36:03 
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TEMP1-16384 -- exp=14-15 
exp=14 
exp=15 


03-20-85 


07S 


OIOZESWL 9 WWM WOddV S/Q4-ZE “LI 


CCITT 


0005 
£0001 
E0002 
E0003 
E0004 
E0005 
E0006 
E0007 


E0008 
E0009 
E0010 
E0011 


E0012 
E0013 
E0014 
E0015 


E0016 
E0017 
£0018 
E0019 


E0020 
E0021 
E0022 
E0023 


E0024 
£0025 
E0026 
£0027 


£0028 
E0029 
E0030 
E0031 


E0032 
E0033 
E0034 
E0035 


04B6 
04B7 
0488 
04B9 
O4BA 
04BB 
04BC 
0480 
O4BE 
O4BF 
04CO0 
04C1 
04C2 
04C3 
04C4 
04C5 
04C6 
04C7 
04C8 
04c9 
O4CA 
04CB 
o4cc 
04CD 
O4CE 
O4CF 
0400 
0401 

0402 
04D3 
04D4 
04D5 
0406 
0407 
0408 
0409 
O4DA 
04DB 
040C 
04DD 
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7F88 
7974 
5022 
F900 
01C4 
7F88 
7974 
5023 
F900 
01DE 
TF88 
7974 
5025 
F900 
O1F8 
7F88 
71974 
SOIE 
F900 
0212 
7F88 
7974 
SOIF 
F900 
022C 
7F88 
1974 
5020 
F900 
0246 
7F88 
7974 
5027 
F900 
O25F 
7F88 
7974 
5028 
F900 
0279 


RS2 


RS3 


RS4 


RSS 


RS6 


RS11 


RS2! 
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COPY UTILITY.ASM 


code to do left shifts for SEZ/SE calculations 


ABS ; make positive before mask 
ANDO M32767 ; keep lower 15 bits 
SACL SUM 1 3; Save result 
8 CHK} ; return 
ABS 

AND M32767 

SACL SUM2 

B CHK2 

ABS 

AND M32767 

SACL SUM3 

8 CHK3 

ABS 

AND M32767 

SACL SUM4 

B CHK4 

ABS 

AND M32767 

SACL SUM5 

B CHK5 

ABS 

AND M32767 

SACL SUM6 

B CHK6 

ABS 

“AND M32767 

SACL SUM7 

B CHKI 1 

ABS 

AND 432767 

SACL SUM8 

B CHK21 
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E0037 
£0038 
£0039 
£0040 
£0041 
£0042 
£0043 
£0044 
E0045 
E0046 
£0047 
E0048 
£0049 
E0050 
£0051 
£0052 
E0053 
£0054 
E0055 


E0056 
£0057 


E0058 
E0059 


E0060 
E0061 


E0062 
E0063 
E0064 
E0065 
E0066 
E0067 
E0068 
E0069 
£0070 
E0071 


E0072 
E0073 
E0074 
E0075 
E0076 
E0077 
E0078 
E0079 
E0080 
E0081 


E0082 
E0083 
E0084 
E0085 
E0086 
E0087 


Q40E 
O40F 
O4E0 
O4E1 
O4E2 
04E3 
O4E4 
Q4E5 
O4E6 
O4E7 
04E8 
O4E9 
O4EA 
O4EB 
O4EC 
O4ED 
O4EE 
O4EF 
O4F0 
O4F | 
O4F2 
O4F3 
O4F 4 
O4F5 
O4F6 
O4F7 
04F8 
O4F9 
O4FA 
O4FB 
O4FC 
O4FD 
O4FE 
O4FF 
0500 
0501 
0502 
0503 
0504 
0505 
0506 
os07 
0508 
0509 
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002A 


5821 
202} 
7F88 
5021 
164C 
FOOO 
0511 
0076 
FDOO 
0501 
0170 
FDOO 
O4F6 
2021 
FEOO 
O4F2 
254C 
5021 
TE2A 
7F 8D 
2521 
5021 
TE2B 
7F8D 
114C 
FpOoO 
O4FD 
2421 
5021 
TE2C 
7F80 
2321 
5021 
TE2D 
7F8D 
134C 
FDOO 
0508 
2221 
5021 
TE2E 
7F80 
144 


FDOO 


2 
Pr 
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PC2.1 84.107 


FLOAT SUBROUTINE~-convert 2’s comp number to floating 


fe ENPUT: accumulator 
ea5 OUTPUT: 
baa --~4b exponent left in accum 
*s --6b mantissa left in TEMPI 
*3 --sign preserved in original number 
Me 
Bee pre as ceca ohh) oad dn pian es ea 
FLTSFT EQU 42 ; address of shift multipliers 
Re 
=| 
FLOAT SACH TEMP} 
LAC TEMP 1 
ABS 
SACL TEMP} 
SuB ONE ,6 ; binary search to get expONEnt 
BGEZ E7TOD 
EO0TO7 ADD K56 ; TEMP1-8 -- exp = 0-6 
BGEZ E4TO6 
EO0TO3 ADD THREE,! ; TEMP1-2 -- exp = 0-3 
BGEZ E2TO3 
EOTO!] LAC TEMP} 3; exp = 0-1 
BNZ E1 
EO LAC ONE ,5 >; exp=0 
SACL TEMP 1 
LACK FLTSFT+0 
RET 
El LAC TEMP1.5 3; exp=1 
SACL TEMP 1 
LACK FLTSFT+1 
RET 
E2TO3 SUB ONE, 1 3; TEMPI-4 -- exp = 2-3 
BGEZ E3 
E2 LAC TEMP1,4 3; exp=2 
SACL TEMP! 
LACK FLTSFT+2 
RET 
£3 LAC TEMP!,3 ; exp=3 
SACL TEMP 1 
LACK FLTSFT+3 
RET 
E4TO6 SUB ONE ,3 >; TEMP1-16 -- exp = 4-6 
BGEZ E5TO6 
E4 LAC TEMP!,2 ; exp=4 
SACL TEMP 1 
LACK FLTSFT+4 
RET 
E5TO6 SUB ONE, 4 ; TEMP1-32 -- exp = 5-6 
BGEZ E6 


OLOZESWNL 9D BM WOddv S/Q4-Z€ “LI 


17S 


CCITT 


£0088 
E0089 
£0090 
£0091 
E0092 
E0093 
£0094 
E0095 


E0096 
£0097 


E0098 
E0099 


E0100 
E0101 
E0102 
E0103 
E0104 
E0105 
E0106 
E0107 
£0108 
E0109 


E0110 
E0111 

£0112 
E0113 
E0114 
E0115 
E0116 
E0117 
E0118 
E0119 


E0120 
E0121 


E0122 
£0123 
E0124 
E0125 
E0126 
E0!27 
E0128 
E0129 
£0130 
E0131 


E0132 
E0133 
£0134 
EQI135 


O50A 
0508 
050C 
0500 
OSOE 
OSOF 
0510 
0511 
0512 
0513 
0514 
O515 
0516 
0517 
0518 
0519 
OSI1A 
0518 
O51C 
0510 
OSIE 
OSIF 
0520 
0521 
0522 
0523 
0524 
0525 
0526 
0527 
0528 
0529 
O52A 
052B 
052C 
0520 
052E 
052F 
0530 
053) 
0532 
0533 
0534 
0535 
0536 
0537 
0538 
0539 
053A 
0538 
053C 
0530 
053€ 
O53F 
0540 
0541 
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O50F 
2121 
5021 
TE2F 
7F8D 
7E30 
7F 80 
1077 
FDOO 
0520 
0870 
FOOO 
0522 
074C 
FDOO 
O51E 
2F21 
5821 
7E31 
7F80 
2E21 
582) 
TE32 
7F8D 
184C 
FDOO 
0529 
2021 
5821 
7TE33 
7F80 
2C21 
5821 
TE34 
7F80 
1A70 
FDOO 
0538 
OB4C 
FDOO 
0537 
2821 
5821 
7E35 
7F8D 
2A21 
5821 
TE36 
7F 80 
1C4C 
FOOO 
O4EE 
2921 
5821 
7E37 
7F80 


E5 


E6 


E7TOD 


E7TOA 


—7TO8 


E7 


E8 


ESTOA 


E9 


E10 


El2 


EDTOE 


E13 


LAC 
SACL 
LACK 
RET 
LACK 
RET 
SUB 
BGEZ 


ADD 
BGEZ 


ADD 
BGEZ 


LAC 
SACH 
LACK 
RET 
LAC 
SACH 
LACK 
RET 
SUB 
BGEZ 


LAC 
SACH 
LACK 
RET 
LAC 
SACH 
LACK 
RET 
SUB 
BGEZ 


ADD 
BGEZ 


LAC 
SACH 
LACK 
RET 
LAC 
SACH 
LACK 
RET 
SUB 
BGEZ 


LAC 
SACH 
LACK 
RET 
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PC2.1 


TEMP1,1 ; 
TEMP} 
FLTSFT+5 


exp=5 


FLTSFT+6 3; exp=6 


i] 
~ 
t 
Ww 


K960 ; TEMP1-1024 -- 


EBTOO 


exp 


TEMP1-256 -- 


It 
| 
| 
oO 


THREE .8& ; 
ESTOA 


exp 


i 
~~ 
| 
@ 


ONE ,7 3 TEMPI-128) -- 


E8 


exp 


TEMP1,15 3 
TEMP 1 
FLTSFT+7 


exp=7 


TEMP1,14 ; 
TEMP 1 
FLTSFT+8 


exp=8 


ONE ,8 ; TEMP1-512 9-10 


E10 


-- exp = 


TEMP!,13 ; 
TEMP 1 
FLTSFT+9 


exp=9 


TEMP1,12 ; exp=i10 
TEMP! 

FLTSFT+10 
THREE,.10 3; TEMP1-4096 -- exp=1!-13 
EDTOE 
ONE,11 3; TEMP1-2048 -- exp=11-i12 
E12 


TEMPI,11 ; 
TEMP] 
FLTSFT+11 


exp=11 


TEMP1,10 3 
TEMP} 
FLTSFT+12 


exp=12 


ONE,12 ; TEMP1-8192 -- exp=0 


EO 


TEMP!,9 ; 
TEMP 1 
FLTSFT+13 


exp=13 


CCITT 


0006 
FOOO! 
Foo0o2 
F0003 
FO00O04 
FOO0OS 
FOO006 
FOO07 
FO008 
FO0O009 
FOO10 
FOOI! 
FOOI12 
FOO13 
FOO14 


FOOI15 
FOOL6 
FOO17 
FOOI8 
FOO19 
FOQ20 
FQ021 
FO0022 
FO023 
FO024 
Fo025 
FOO26 


F0027 
F0028 
FO0029 
F0030 
F0031 
F0032 


F0033 
F0034 


F0035 


F0036 


0542 
0543 


0544 
0545 
0546 
0547 
0548 
0549 


O54A 
054B 
054C 
0540 
O54E 
O54F 
0550 
0551 
0552 
0553 
0554 
0555 


0556 
0557 
0558 
0559 
OSSA 
0558 
055C 
0550 
O55E 
OSSF 
0560 
0561 

0562 
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OO04A 
0036 


7F81 
6E00 


7035 
6880 
7F89 
5080 
F400 
0547 


7EO1 
504C 
6A4C 
859B 
TF8E 
7136 
7049 
688 | 
67A0 
004C 
F400 
0551 


4021 
2021 
FAOO 
OSSE 
FFOO 
0020 
F900 
OOFS 
71974 
FFOO 
O04E 
F900 
0184 


COPY 


INIT.ASM 


PC2.1 84.107 


eR EEA EERE RENE ERR SEE OE EE SAR ee Ee ee ee te 


*; SYSTEM INITIALIZATION 


Mg HO ee ee Oe ee 


NOCONS EQU 
PTCONS EQU 
He 

RESET DINT 
LOPK 


SETPAC LARK 
LARP 
ZAC 

SACL 


BANZ 


ZRAMA 


LACK 
SACL 
LT 
MPYK 
PAC 
LARK 
LARK 
LARP 
TBLR 
ADD 
BANZ 


NXCONS 


IN 
LAC 
BLZ 


MULAW 8Z 


ALAW 


74 
54 


ONE 
ONE 
CONS 


1,PTCONS 
0,NOCONS- 1 
l 

*+.0 

ONE 
NXCONS 


TEMP! .CTL 
TEMP 1 
ALAW 

MUL AWX 
MULAWR 


M32767 
ALAWX 


ALAWR 


“- 
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Disable interrupts 


Initialize data page 


Zero iram 


ROM ADDR 
RAM ADDR 


ces 


a WWM WOddv S/Iq4-Ze “LI 


, 
i) 


O1OCESIN 


CCITT 


0007 
Go001 
G0002 
G0003 
G0004 
Go005 
G0006 
GO007 
Go008s 
Gooo09 
GOOIO 
G0011 
G0012 
G0013 
GO0014 
GO015 
G0016 
G0017 
G0018 
GO019 
G0020 
Go021 
G0022 
G0023 
G0024 
Goo025 
G0026 
G0027 
G0028 
G0029 
G0030 
G0031 
G0032 
G0033 
G0034 
G0035 
G0036 
G0037 
G0038 
G0039 
G0040 
G0041 
G0042 
G0043 
G0044 
G0045 
G0046 
G0047 
G0048 
G0049 
G0050 
GO051 
G0052 
G0053 
G0054 
G0055 
G0056 


0563 


0563 
0563 
0564 
0565 
0566 
0567 
0568 
0569 
056A 
056B 
056C 
0560 
OS6E 
OS6F 
0570 
0571 
0572 


0573 
0S74 
0575 
0576 
0577 
0578 
0579 
OSTA 
0578 
0s7C 
057) 
OS7E 
OS7F 
0580 
0581 
0582 


0583 
0583 
0584 
0585 
0586 
0587 
0588 
0589 
OS8A 


0588 
058B 
058C 
0580 
O58E 
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0000 
0001 
0002 
0004 
0008 
0010 
0020 
0040 
0080 
0100 
0200 
0400 
0800 
1000 
2000 
4000 


FEOO 
FFOO 
FF8O 
FFCO 
FFEO 
FFFO 
FFFB 
FFFS 
FFF8 
FFF8 
FFFB 
FFF8 
FFF8 
FFF8 
FFF8 


FFFS 


FF79 
0044 
OOAS 
OOE8 
0110 
014C 
0179 
O1AC 


FFF 4 
0004 
001B 
0032 
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ROMRAM. ASM 


HM NHHHHE RH HHRNHHHHHHHHHH HEHEHE HHH KHHRHREKHHRHRHNRHHEHHRRH HED 


HSH HHHHHHHHEHHHHHHHHEHHEHNEE HHH HEHHHHHNRHERHHKEHRHHHRHRERREO EHO 


log dq to fit dq exponent adjustment 


COPY 

ms 

o 5 ROM 

ROMLOC BSS 0 

*; SHIFT MULT TABLE 

SHF T BSS {8} 
DATA 0 
DATA 1 
DATA 2 
DATA 4 
DATA 8 
DATA 16 
DATA 32 
DATA 64 
DATA 128 
DATA 256 
DATA 512 
DATA 1024 
DATA 2048 
DATA 4096 
DATA 8192 
DATA 16384 

*; DQMAN MASK TABLE 
DATA 65024 
DATA 65280 
DATA 65408 
DATA 65472 
DATA 65504 
DATA 65520 
DATA 65528 
DATA 65528 
DATA 65528 
DATA 65528 
DATA 65528 
DATA 65528 
DATA 65528 
DATA 65528 
DATA 65528 
DATA 65528 

*; INVERSE QUANTIZING TABLE 

IQTAB 8SS Oo 
DATA 65401 
DATA 68 
DATA 165 
DATA 232 
DATA 285 
DATA 332 
DATA 377 
DATA 428 

“; WI TABLE 

WTABLE BSS (a) 
DATA 65524 
DATA 4 
DATA 27 
DATA 50 


CCITT 


G0057 
G0058 
G0059 
G0060 
G0061 
G0062 
G0063 
G0064 
G0065 
G0066 
G0067 
Go06s 
G0069 
G0070 
G0071 
G0072 
G0073 
G0074 
G0075 
G0076 
G0077 
G0078 
G0079 
Gooso 
G0081 
G0082 
G0083 
Go084 
G0085 
G0086 
G0087 
G0088 
G0089 
G0090 
GO09S1 
G0092 
G0093 
G0094 
GO095 
Go096 
G0097 
Go098 
G0099 
GO100 
GO101 
G0102 
GO103 
GO104 
GO105 
G0106 
GO107 
G0 108 
G0109 
GO110 
GO111, 
GO112 
GO113 


OS8F 
0590 
0591 
0592 


0593 
0593 
0593 
0594 
0595 
0596 
0597 
0598 
0599 
O59A 


0598 
0598 
059C 
0590 
O59E 
O59F 
O5A0 
OSA 

O5A2 
O5A3 
O5A4 
O5A5 
O5A6 
OSA7 
O5A8 
Q5A9 
OS5SAA 
OSAB 
O5AC 
O5A0 
OSAE 
OSAF 
05B0 
0581 

05B2 
0583 
0584 
0585 
0586 
0587 
05B8 
0589 
O5SBA 
05B8 
o58C 
0580 
OSBE 
OSBF 
05C0 
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0062 
0088 
0154 
0454 


0000 
0000 
0000 
0010 
0010 
0010 
0030 
0070 


0002 
0004 
0008 
0010 
0020 
0040 
0080 
0100 
0200 
0400 
0800 
1000 
2000 
4000 
FFFF 
FFFE 
FFFC 
OOFF 
8000 
0001 

0800 
FFFF 
0001 
0021 

0220 
0200 
0200 
0200 
0100 
0100 
0800 
0800 
0800 
0800 
0800 
0800 
0800 
0100 


*; Fi 


DATA 
DATA 
DATA 
DATA 
TABLE 


FITABL BSS 


Be 
, 


*; misc constants to be 


CONS 


BSS 

DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


BSS 

DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


PAGE 0046 

98 
184 
340 
1108 
0 
0 
0 
0 
0 
16 
16 
16 
48 
112 

initialized 
0 
2 : lath entry of shift table (Ist f1 a1] 0) 
4 
8 
16 
32 
64 
128 
256 
512 
1024 
2048 
4096 
8192 
16384 
-1 
-2 
-4 
255 3M™255 
32768 3K32768 
1 ;YLH 
2048 ;YLL 
coal | 
1 
33 ; BIAS 
544 sYU 
512 3;PKO 
512 3PK1 
512 ;PK2 
256 3 SRMAN 
256 3;SRIMAN 
2048 3S0Q 
2048 ;SDQ1 
2048 +S0Q2 
2048 3$0Q3 
2048 ;SDQ4 
2048 3S0Q5 
2048 3 S0Q6 
256 ; DQMAN 
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€zs 
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GO114 
G0115 
G0116 
GO117 
G0118 
G0119 
GO120 
GO121 
GO122 
G0123 
G0124 
G0125 
G0O126 
G0127 
G0128 
G0129 
GO0130 
GO131 
G0132 
G0133 
GO0134 
G0135 
G0136 
G0O137 
G0138 
G0139 
GO140 
G0141 
GO142 
G0143 
GO144 
GO145 
G0146 
G0147 
GO148 
G0149 


Oscl 
05C2 
05C3 
05C4 
05C5 
05C6 
05C7 
05C¢8 
05¢c9 
OSCA 
05CB 
o5cc 
OSCD 
OSCE 
OSCF 
0500 
0sD1 

0502 
0503 
0504 
0505 
OS06 
0507 
0508 
0509 
O5DA 
0508 
o5D0C 
0500 
O5DE 
OSOF 
O5€0 
O5E1 

OSE2 
O5E3 
OSE4 
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0100 
0100 
0100 
0100 
0100 
11E0 
3FFE 
co02 
002E 
0031 
0563 
003F 
0000 
0000 
0583 
0220 
1400 
OOOF 
FF80 
007F 
FFCO 
1080 
OF FF 
0000 
7FFF 
FFOO 
0038 
03C0 
004F 
OO5F 
0082 
008A 
0905 
0003 
0000 
0080 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


256 
256 
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;DQ1MAN 
3 DQ2MAN 
; DQ3MAN 
3 DQ4MAN 
; OQSMAN 
3K4576 
3K16382 
3M16382 
3K46 
;K49 

; SHIFT 
;K63 
;F i 

7WI 

; INQTAB 
3K544 
3K5120 
3M15 
;KFF80 
3K127 
3MFFCO 
¢; BIAS*2**7 
344095 
;spare 
3432767 
3KFFOO 
3K56 
3K960 
3K79 
3K95 
3K130 
7K138 
3K2309 
3 THREE 
sspare 
;M0080 


03-20-85 


CCITT 


GOI51 
G0152 
GO153 
G0154 
G0155 
GO156 
GO157 
G0158 
G0159 
G0160 
GO161 
G0162 
G0163 
G0164 
G0165 
G0166 
GO167 
G0168 
G0O169 
GO170 
GO171 
GO172 
G0173 
GO174 
GO175 
G01 76 
GO177 
G0178 
G0179 
G0180 
G0181 
G0182 
G0183 
G0184 
GO0185 
GO 186 
G0187 
G0 188 
G0189 
G0190 
G0191 
G0192 
G0193 
G0194 
GO0195 
G0196 
GO197 
G0 198 
GO199 
GO200 
GO020! 
GO0202 
G0203 
G0204 
G0205 
GO0206 
G0207 


OS5SE5 
0000 


0000 


0001 


0002 


0003 


0004 


0005 


0006 


0007 


0008 


0009 


QOOA 


0008 


oooc 


000D 


OOOE 


OOOF 


0010 
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0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 


0000 
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Twa RRR RE RRR E ERE RERERE RE RRR RRR RRR RRR RRR RR RRR ERR RRR RRS 
, 


TPR CRORE REAR EC ER ERE REREERERRSRRE RSE ERER SERRE RRR ERR ERR RRR ERE ES) 
’ 


Bis RAM 
RAMLOC BSS 0 
DORG 0 


N 


on 


wee ee Ze ee Eee we Fm we we TY we wo FT we we fT we we Ewe we 
C vu 


SO he ee On ee a eee ee ea ee eS eS 
wee ee Li) wo we A we we me we we 


ae Ove oe Dias oe ae ae 


RAM Location # 


DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA ie) 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


000 


001 


002 


003 


004 


005 


006 


007 


008 


009 


010 


O12 


013 


015 


016 


- 


spare 
32Kb output 

8-level version of | 
estimate 


signal 


partial signal estimate 


unlimited speed contro! 


parm 


limited speed control 


parm 


short term average of F 


long term average of F 


quantizer scale factor 


6th order predictor coefficient 


6th order predictor coefficient 


6th order predictor coefficient 


6th order predictor coefficient 


6th order predictor coefficftent 


6th order predictor coefficient 


quantized diff signal 


7s 


OLOTESWLL 2 YM WOdAV S/QH-ZE “LI 


CCITT 


GO0208, 
G0209 
G0210 
G0211 
GO212 
G0213 
G0214 
G0215 
G0216 
G0217 
G0218 
G0219 
G0220 
GO022} 
G0222 


'G0223 


G0224 
G0225 
G0226 
G0227 
G0228 
G0229 
G0230 
G023} 
G0232 
G0233 
G0234 
G0235 
G0236 
G0237 
G0238 
GO0239 
G0240 
G0241 
G0242 
G0243 
G0244 
GO0245 
G0246 
G0247 
G0248 
G0249 
GO0250 
G0251 
GO252 
G0253 
GO0254 
G0255 
G0256 
GO0257 
G0258. 
G0259 
GO0260 
GO0261 
GO0262 
G0263 
GO264 


0011 


0012 


0013 


0014 


0015 


0016 


0017 


0018 


0019 


COIA 


001B 


OoICc 


0010 


OOIE 


OOIF 


0020 


0021 


0022 
0022 


0023 


32010 FAMILY MACRO ASSEMBLER 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


00006 


0000 


0000 


0000 


0000 


0000 


0000 


*; RAM Location # 017 


A DATA 0 
cd 
*; RAM Location 
A2 DATA 0 
* 

? 
*; RAM Location 
SR DATA 0 
Re 
*; RAM Location 
SR1 DATA 0 


*; RAM Location 
DQEXP DATA 0 
Re 


*; RAM Location 
DQIEXP DATA 0 
Re 


*; RAM Location 
DQZ2EXP DATA 0 
Re 

*; RAM Location 
DQ3EXP DATA 0 
Re 

*; RAM Location 
DQ4EXP DATA 0 
es 

*; RAM Location 
DQ5EXP DATA 0 
He : 


*; RAM Location 
SCRACH DATA 0 
He 
*; RAM Location 
SREXP DATA {8} 
Re 
*; RAM Location 
SRIEXP DATA (8) 
Re 

* 
*; RAM Location 
SUM4 DATA is) 
He 
*; RAM Location 
SUM5 DATA 0 


baie RAM Location 
SUM6 DATA 0 


*; RAM Location 
TEMP1 DATA 0 


He 
*"; RAM Location 
SUM1 BSS ie} 
TEMP2 DATA 0 
Me 

*; RAM Location 
SUM2 BSS 0 


018 


019 


020 


021 


022 


023 


024 


025 


026 


027 


028 


029 


030 


031 


032 


033 


034 


035 


we 


~ 


Po 


Pry 


we 


“ 


ary 
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coefficients of 2nd order predictor 
coefficients of 2nd order predictor 
reconstructed signal frame k 
reconstructed signal frame k 
exponent of 0Q 

exponent of DQ! 

exp of DQ2 
exp of DQ3 
exp of DQ4 
exp of DQS5 
scrach variable 
exp of SR 

exp of SRI 
temp 

temp 

temp 


temp 


temp 
temp 


temp 


CCITT 


GO265 
G0266 
G0267 
G0268 
G0269 
G0270 
G0271 
GO0272 
G0273 
GO274 
G0275 
G0276 
G0277 
G0278 
G0279 
GO0280 
G0281 
G0282 
G0283 
G0284 
G0285 
GO0286 
G0287 
G0288 
G0289 
G0290 
G0291 
G0292 
G0293 
G0294 
GO295 
G0296 
G0297 
G0298 
G0299 
G0300 
GO301 
GO0302 
G0303 
GO0304 
G0305 
G0306 
G0307 
G0308 
G0309 
GO310 
GO311 
GO312 
GO313 
G0314 
GO315 
G0316 
G0317 
G0318 
GO319 
G0320 
G0321 


0023 


0024 


0025 


0026 


0027 


0028 


0029 


002A 


0028 


002C 


0020 


002E 


002F 


0030 


0031 


0032 


0033 


0034 


0035 


32010 FAMILY MACRO ASSEMBLER 


0000 


0000 


0000 


leleteze) 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


TEMP3 DATA 0 


te 
’ 


ws RAM Location 
TEMP4 DATA 0 
Re 

wie RAM Location 
SUM3 DATA 0 


me 
*; RAM Location 
SAMPLE DATA 0 


me 
, 


*; RAM Location 
SUM7 DATA i¢) 
Me 


*; RAM Location 
SUM8 DATA 0 


He 

"3; RAM Location 

YOVER4 DATA 0 

Me 

"3; RAM Location 
DATA 0 


RAM Location 
DATA 0 


ms RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


a RAM Location 
DATA 0 


*% RAM Location 
DATA 0 


we RAM Location 
DATA 0 


Wis RAM Location 
DATA 0 


at RAM Location 
DATA 0 


zs 


3; RAM Location 
DATA 0 


=z 8 
we we 


RAM Location 


036 


037 


038 


039 


040 


041 


042 


043 


044 


045 


046 


047 


048 


049 


050 


051 


052 


053 


054 


Pt 


PY 


Po 


oo 


oe 


- 
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temp 

temp 

temp 


Linear sample 


tap 


temp storage of SR1!*A1 


temp storage of SR2"A2 tap 


Y>>2 


first location of shift table 


OLOTESWL OU) DIM WOddv S/NQ4-Ze “LT 


Ses 


CCITT 


GO0322 
G0323 
G0324 
GO325 
G0326 
G0327 
G0328 
G0329 
G0330 
G0331 
G0332 
G0333 
G0334 
G0335 
G0336 
G0337 
G0338 
G0339 
G0340 
G0341 
G0342 
G0343 
G0344 
G0345 
GO0346 
G0347 
G0348 
G0349 
G0350 
G0351 
G0352 
G0353 
G0354 
GO355 
G0356 
G0357 
G0358 
G0359 
G0360 
G0361 
G0362 
G0363 
GO0364 
G0365 
G0366 
G0367 
G0368 
G0369 
G0370 
G0371 
G0372 
G0373 
G0374 
G0375 
GO0376 
G0377 
G0378 


0036 


0037 


0038 


0039 


003A 


0038 


003C 


003D 


0O3E 


003F 


0040 


0041 


0042 


0043 


0044 


0045 


0046 


0047 


0048 


32010 FAMILY MACRO ASSEMBLER 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


@. 
9 


DATA 0 


RAM Location 
DATA 0 


RAM Location 


EIGHT OATA 0 


He 
’ 


M255 


Be 
’ 
ee 

, 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA .¢) 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 6) 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 
DATA 0 


RAM Location 


K32768 DATA 0 


He 
, 


* 


RAM Location 


055 


056 


057 


058 


059 


060 


061 


062 


063 


064 


065 


066 


067 


068 


069 


070 


O71 


072 


073 


16:36:03 03-20-85 
PAGE 0051 


PC2.1 84.107 


last loc of table (42-70) 


sign bit 


CCITT 


G0379 
G0380 
G0381 
G0382 
G0383 
G0384 
GO385 
G0386 
G0387 
G0388 
G0389 
G0390 
G0391 
G0392 
G0393 
G0394 
G0395 
G0396 
G0397 
G0398 
G0399 
G0400 
G0401 
G0402 
G0403 
G0404 
G0405 
GO0406 
G0407 
G0408 
Go409 
G0410 
G0411 
GO0412 
G0413 
G0414 
GO415 
G0416 
GO0417 
G0418 
G0419 
G0420 
GO0421 
G0422 
G0423 
G0424 
G0425 
G0426 
G0427 
G0428 
G0429 
G0430 
G0431 
G0432 
G0433 
G0434 
G0435 


0049 
004A 
004B 
004Cc 
0040 
OO4E 
O0O4F 
0050 
0051 
0052 
0053 
0054 
0055 
0056 
0057 
0058 
0059 
OO5A 


0058 


32010 FAMILY MACRO ASSEMBLER 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


6000 


0000 


0000 


0000 


0000 


0000 


0000 


o0co 


0000 


0000 


0000 


0000 


YLH 
*3 

*; RAM Location 
YELL OATA 0 


*; RAM Location 
MINUS DATA 0 


DATA 0 


; 
*; RAM Location 
ONE DATA 0 
@e 
*; RAM Location 
BIAS DATA 0 


Be 
1d 


*; RAM Location 
YU DATA 0 


ee 
° 


*; RAM Location 
PKO DATA 0 


ee 
, 


*; RAM Location 
PK1 DATA 0 


He 
*; RAM Location 
PK2 DATA 0 


Be 
° 


*; RAM Location 
SRMAN DATA 0 


*; RAM Location 
SR1IMAN DATA 0 


Be 
Rs RAM Location 
SDQ DATA 0 
Be 

*; RAM Location 
SDQ} DATA 0 
Be 

*; RAM Location 
SDQ2 DATA 0 
Me 

as RAM Location 
SDQ3 DATA 0 
He 

*; RAM Location 
$0Q4 DATA 0 
He 

*; RAM Location 
$0Q5 DATA 0 


He 
, 


*; RAM Location 
S0Q6 DATA ie) 
Me 

*; RAM Location 
DQMAN DATA ie) 


Re 
, 


*; RAM Location 


074 


075 


076 


077 


078 


079 


080 


081 


082 


083 


084 


085 


086 


087 


088 


089 


030 


091 


092 


7. 


aT 


Pu 
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fast quant scale factor (hi word) 


slow quant scale factor (lo word) 


constant for mulaw conversions 


fast quant scate factor 


sign of p(k) 


sign of p(k-1) 


sign of p(k-2) 


mantissa of SR 


mantissa of SR! 


sign DQ(k) 


sign DQ(k-1) 


sign DQ(k-2) 


sign DQ(k-3) 


sign DQ(k-4) 


sign DQ(k-5) 


sign 0Q(k-6) 


mantissa of DQ 


97S 


OLOCESWL 94) UM WOddv S/Qy-ce “LI 


CCITT 


GO0436 
G0437 
G0438 
G0439 
G0440 
G044! 
G0442 
G0443 
G0444 
GO445 
G0446 
G0447 
G0448 
G0449 
G0450 
G0451 
G0452 
G0453 
G0454 
GO0455 
G0456 
GO457 
G0458 
G0459 
GO0460 
G0461 
G0462 
G0463 
G0464 
G0465 
G0 466 
G0 467 
G0468 
G0469 
G0470 
G0471 
G0472 
G0473 
G0474 
GO0475 
G0476 
G0477 
G0478 
G0479 
G0480 
G048 1 
G0482 
G0483 
G0484 
G0485 
G0 486 
G0487 
G0488 
G0489 
G0490 
G0491 
G0492 


oosc 


0050 


OO5E 


OO5F 


0060 


0061 


0062 


0063 


0064 


0065 


0066 


0067 


0068 


0069 


OO6A 


006B 


006C 


0060 


OO6E 


32010 FAMILY MACRO ASSEMBLER 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


DQIMAN DATA 0 
He 
*; RAM Location 
DQ2MAN DATA 0 
Be 

? 
*; RAM Location 
DQ3MAN DATA 0 
Me 
*; RAM Location 
DQ4MAN DATA 0 


He 
? 


*; RAM Location 
OQ5MAN DATA 0 
Me 


*; RAM Location 
K4576 DATA 0 


*; RAM Location 
K16382 DATA 0 


Me 
, 


*; RAM Location 
M16382 DATA 0 
He 

*; RAM Location 
K46 DATA ie} 


ww. 
*; RAM Location 
K49 OATA 0 


He 
’ 


*; RAM Location 
SHIFT DATA 0 


He 
’ 


*; RAM Location 
K63 DATA 0 


me. 
*; RAM Location 
FI DATA 0 


we 
, 


*; RAM Location 
Wl DATA 0 
*s 

*; RAM Location 
INQTAB DATA 0 


*; RAM Location 
K544 DATA 0 


. 
*; RAM Location 
K5120 DATA 9] 


*; RAM Location 
M15 DATA 0 
*; 

*: RAM Location 
KFF80 DATA 0 


Me 
’ 


*; RAM Location 


093 


094 


095 


096 


097 


098 


099 


100 


101 


102 


103 


104 


105 


106 


107 


108 


109 


110 


1t1 


~~ 


ar 


Po 


Po 


« 


“ 


Pry 
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mantissa of DQ! 


mantissa of DQ2 


mantissa of DQ3 


mantissa of DQ4 


mantissa of DQS 


4576 


+16382 


- 16382 


46 


49 


SHIFT table address 


63 
F!l value 
WI value 


Inverse quan table address 


544 


5120 


15 


>FF8O 


CCITT 


G0493 
G0494 
G0495 
G0496 
G0497 
G0498 
G0499 0071 
G0500 
GO501 
GO0502 
G0503 
G0504 
GO505 
GO0506 
G0507 
G0508 
GO0509 
G0510 
GO511 
G0512 
G0513 
GO514 
GO515 
GO516 
GO517 
G0518 
G0519 
G0520 
G0521 
G0522 
G0523 
G0524 
G0525 
G0S26 
G0S27 
G0528 
GO0529 
G0530 
G0531 
GO0532 
G0533 
G0534 
GO0535 
G0536 
G0537 
G0538 
G0539 
G0540 
G0541 
G0542 
NO ERRORS, 


OO6F 


0070 


0072 


0073 


0074 


0075 


0076 


0077 


0078 


0079 


OO7A 


0078 


007C 


0070 


OO7E 


OO7F 


32010 FAMILY MACRO ASSEMBLER 


0000 M127 DATA 
Me 
*; RAM Location 
0000 MFFCO DATA 
Me 
*; RAM Location 
0000 BIASA DATA 
He 
*; RAM Location 
0000 M4095 DATA 
He 
*; RAM Location 
0000 DATA 
*; RAM Location 
0000 M32767 DATA 
Re 
*; RAM Location 
0000 KFFOO DATA 
Me 
*; RAM Location 
0000 K56 DATA 
He 
ee RAM Location 
0000 K960 DATA 
He 
*; RAM Location 
0000 K79 DATA 
Me 
’ 
*; RAM Location 
0000 K95 DATA 
Re 
, 
*; RAM Location 
0000 KI30 DATA 
He 
, 
*; RAM Location 
0000 K138 DATA 
te 
’ 
*; RAM Location 
0000 K2309 DATA 
Re 
*3; RAM Location 
0000 THREE DATA 
Re 
23 RAM Location 
0000 DATA 
Re 
*; RAM Location 
0000 MmOO8O0 DATA 


. 
’ 


NO WARNINGS 


120 


121 


122 


123 


124 


125 


126 


127 


Pry 


Pry 


7 


we 


Pry 


“ 


o 


~ 


Po 


16:36:03 03-20-85 
PAGE 0054 


PC2.1 84.107 


127 


~64 


33*128 


4095 


spore 


32767 


>FFOO 


56 


960 


constants used for quantizing table 


constants used for quantizing table 


constants used for quantizing table 


constants used for quantizing table 


constants used for quantizing table 


OLOZESWL 943 YUM WOddV S/Qq-Z€E “LI 


Les 


32010 FAMILY MACRO ASSEMBLER 
REFERENCES 


VALUE 


Ooi 
0426 
0012 
0001 
0383 
035C 
0361 
O1A8 
036C 
0006 
O55E 
0184 
004E 
035E 
O19F 
OLAF 
0005 
03B6 
O2AA 
OOOA 
0008 
oooc 
0000 
OOOE 
OOOF 
004D 


0071 
0288 
0285 
02B2 
02C3 
0201 
O2CE 
02DC 
O2EA 
O2E7 
O2ED 
O2FC 
0002 
O30E 
030B° 
01C4 
O25F 
O1DE 
0279 
O1F8 
0212 
022C 
0246 
0386 
032C 
0329 
0335 
033E 
0341 


DEFN 


G0209 
00520 
GO0212 
A0031 
00336 
00085 
DO413 
A0566 
00104 
GO176 
F0033 
A051} 
A0172 
00090 
A0559 
A0572 
GO173 
00345 
ca024 
G0 188 
GO191 
G0194 
G0197 
G0200 
G0203 
G0391 


G0499 
C0075 
C0073 
coo7}1 
C0083 
C0093 
coo9t 
co10l 
Co111 
c0109 
C0113 
co125 
A0033 
C0139 
C0137 
BOo068 
BO0245 
B0091 
B0268 
B0114 
BO137 
BO160 
BO183 
00186 
C0187 
C0185 
C0193 
C0199 
co0201 


B0252 


B0230 
A0056 
00328 


A0562 


B0321 
FO0030 
A0512 
A0173 


A0557 
A0564 
BO0322 
00335 
A0099 
BO167 
BO144 
BO121 
B0098 
BOO075 
80053 
A0082 
A0320 
A0488 
A0180 


C0074 


co072 
c0092 


c0070 


c0112 
A0101 


co110 
E0007 
E0031 
E00}! 
E0035 
E0015 
E0019 
£0023 
E0027 
00183 


C0186 
C0184 


BO270 


80245 
A0126 


BO0327 


FO0035 
F0034 


B0325 


A0168 
BO0183 
BO160 
B0137 
BO114 
BO0091 
BO0068 
A0089 
A0334 


A0184 
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0043! 00456 00458 D0464 00515 00523 


00486 


BO0361 


00323 


A035! 
00396 
00389 
D0382 
00375 
00368 
0036! 
A0253 
A0346 


A0188 


00493 00496 00508 00513 


00324 


A0504 
D0398 
00391 
D0384 
00377 
00370 
00363 
A0264 
A0426 


AQ192 


A0170 A0210 A0364 


DO0325 


DO0402 
DO0395 
00388 
00381 
00374 
00367 
AG272 
A0436 


A0Q196 


00336 00338 


A0282 AO0290 A0302 AN310 
A0444 A0456 A0464 A0474 


A0200 A0204 


CCITT 
LABEL 


C16TO7 
CLNUP 
CLNUPA 
CMPRSA 
CMPRSU 
CONS 
CTL 
DOTO!L 
DOTO3 
00TO7 
02T03 
04T05 
04T07 
D6TO7 
D8TO9 
D8TOB 
O8TOF 
DAC 
DATOB 
OCTOD 
OCTOF 
DE TOF 
DML 
DMS 
DO32KA 
BO32KU 
DONEC 
DONEF 
DQ 
DQIEXP 
DQ1MAN 
DQ2EXP 
DQ2MAN 
DQ3EXP 
DQ3MAN 
DQ4EXP 
DQ4MAN 
DQS5EXP 
DQSMAN 
DQEXP 
DQMAN 
£0 
EOTO! 
EO0TO3 
EOTO? 
El 


32010 FAMILY MACRO ASSEMBLER PC2.1 84.107 16:36:03 


VALUE 


034A 
OOEA 
0176 
0108 
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PREFACE 


A full-duplex 16-kbit/s subband coder can be implemented using a single TMS32010 
(without external program memory) with a 1K-word off-chip RAM data buffer. The 
compression algorithm achieves toll-quality speech with a DRT (Diagnostic Rhyme Test) 
score of 94 percent. Because of the compactness of the subband coding, it only requires 
80 percent of the TMS32010 CPU utilization, thus allowing the processor to perform 
other tasks in addition to speech coding. A simplified block diagram of the coder is 
shown in the following figure: 


INTERFACE SPEECH 


INTERFACE 


Simplified Block Diagram of the 16-kbit/s Subband Coder 


T™TMS32010 


The subband coder can be used in many applications, such as telecommunications and 
computers. In a transcoder design for digital telephony, the subband coder is valuable 
because of the following features: relative insensitivity to transmission errors, small 
bandwidth requirement, toll-quality speech, and simple, economical hardware 
configuration. In a voice-mail system, speech data rate directly corresponds to the storage 
required in the memory. The 16-kbit/s subband coder provides an efficient tradeoff 
between speech quality and data rate. In addition, the use of a single TMS32010 coder 
allows a cost-effective system implementation. 


The following reprinted article details the theory and realization of the subband coder. 
The TMS32010 source code and documentation of the 16-kbit/s coder is available for 
licensing from Atlanta Signal Processors Incorporated. For more information, contact: 


Atlanta Signal Processors Incorporated 
770 Spring Street, Suite 208 

Atlanta, Georgia 30308 

(404) 892-7265 
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INTRODUCTION 


Over the past several years, the Subband Coder has taken 
its place as one of the highest quality and most cost effective 
approaches to the coding of speech at medium bit rates. Table 
| illustrates the relationship of the subband coder to a number 
of other popular approaches to speech coding. 
Fundamentally, of course, any coding operation must distort 
the coded signal in some way. The goal of a good coding 
technique is to constrain the distortion so as to have minimum 
impact on the overall system performance. In most speech 
communications systems and voice response systems, this 
translates into minimizing the perceived differences between 
the coded and uncoded speech signals. 

Modern speech coding systems typically use three classes 
of features in order to minimize the perceived coding 
distortion: the characteristics of the auditory system; the 
characteristics of the vocal tract; and the characteristics of 
language and individual talkers. Of these, only the first two 
can be effectively used in medium bit rate systems such as 
subband coders. Typically, three separate characteristics of 
the auditory system can be used to advantage. These are (1) 
the aural noise masking effect, (2) the frequency variant 
sensitivity of auditory perception (critical bands), and (3) the 
relative phase insensitivity of the ear (within critical bands). 
Likewise, a vocal tract model can either be used in a long 
term statistical sense, or in a short-time stationary time- 
varying model. If the short-time stationary model is used, 
then it, in turn, may be applied in three ways: (1) by using 
syllabic energy variations; (2) by using an explicit slowly 
time-varying vocal tract model; or (3) by using the pseudo- 
periodicity (pitch) in voiced sounds. 

The simplest common speech coding technique, linear 
Pulse Code Modulation (PCM), makes essentially no 
assumptions (except as to dynamic range) about the 
characteristics of the signal being coded or the eventual use 
of the coded signal. As a result, linear PCM systems require 
the greatest bit rate to generate toll quality speech. Such 
systems have the advantage, however, that since they show 
no preference toward any particular class of signals, they 
may be used to code signals which are neither spoken nor 
heard (such as data signals). Companded Pulse Code 
Modulation systems, such as the 64 Kbps mu-law companded 
system used in telephone switching networks, make direct 
use of the ear’s noise masking characteristics in its most basic 
form. In the resulting systems, the noise is correlated with 
the signal, which is good, but noise also spreads throughout 
the frequency range with no regard to the presence of signal 
energy, which is bad. Companded PCM systems are 
extremely simple and inexpensive to implement, but the rate 
at which they must operate to achieve toll quality speech is 
still relatively high. 

Systems such as Differential Pulse Code Modulation 
(DPCM) and Delta Modulation (DM) make direct use of a 


18. A Real Time Speech Subband Coder Using the TMS32010 


long-term stationary statistical model for speech production. 
Adaptive Differential Pulse Code Modulation (ADPCM) and 
Adaptive Delta Modulation (ADM) also make use of the 
slowly varying nature of the short-time energy, causing the 
noise to be heavily correlated with the speech signal, and 
causing a dramatic drop in the idle channel noise. Various 
forms of both the Adaptive Transform Coder (ATC) and the 
Adaptive Predictive Coder (sometimes called the residual 
excited vocoder) make use of all of the auditory and vocal 
tract features, and such systems are capable of generating 
excellent quality speech at medium to low bit rates. However, 
such systems are also generally both very complex and quite 
sensitive to both background noise and transmission errors. 
Pitch-excited vocoders in general, and LPC vocoders in 
particular, are capable of operating at low to very low bit 
rates, but they generally can never achieve toll quality and 
do not perform well on either noisy speech or non-speech 
signals. 

With this perspective, it is easy to see the advantages 
and disadvantages of the subband coder as compared to other 
speech coding systems. Since the subband coder uses only 
the characteristics of the auditory system in reducing the 
perceived distortion, it is capable of coding both speech and 
non-speech signals with good perceived fidelity. Similarly, 
subband coders can handle multiple simultaneous voices with 
no increase in degradation. In addition, since subband coders 
do not include the use of a time-varying vocal tract model, 
they are far less complex and far less sensitive to transmission 
bit errors than APC, ATC, or pitch-excited vocoders. This 
makes the subband coder a very good choice for real 
communications and telephony systems where the 
background noise is not controlled and where high 
transmission error rates might occasionally be expected. 
Likewise, the low complexity of subband coder realizations 
make them well matched to implementations on such signal 
processing micro-computers as the Texas Instruments 
TMS320 and the NEC 7720. The obvious applications for 
subband coders for speech include mobile radio, mobile 
telephony, and voice switching networks. 


SUBBAND CODER THEORY 


The concept of subband coding for speech was first 
introduced by Crochiere et al. [1-2]. In a subband coding 
system, the speech signal is divided into bands by a filter 
bank (Fig. 1), and then the outputs of the individual bands 
are first decimated, typically at their nominal Nyquist rate, 
and then coded for transmission. At the receiver, the coded 
signals from the individual bands are first decoded, and then 
padded with zeros and passed through a set of interpolation 
filters before being summed to form an estimate of the input 
signal. Such systems can be thought of as exhibiting two 
classes of distortion: analysis-reconstruction distortion, which 
is defined as the distortion which. would be present if the 


535 


subband signals were not coded; and coding distortion, which 
is the distortion due directly to the coding operations 
themselves. The analysis-reconstruction system alone can 
exhibit three separate types of distortion: interband aliasing 
distortion; frequency domain magnitude distortion (which 
will be called ‘‘frequency distortion’’); and phase distortion. 
In the original subband coding systems, infinite impulse 
response (IIR) filters were used, and the resulting coding 
systems exhibited all four types of distortion: interband 
aliasing; frequency distortion; phase distortion; and coding 
distortion. In later work, Esteban et al. [3] introduced the 
concept of quadrature mirror filters, which can be used to 
realize two-band analysis-reconstruction systems which 
exhibit no interband aliasing. Further, Esteban showed that 
if equal length linear phase filters are used for the band 
splitting, then the overall analysis-reconstruction transfer 
function is a linear phase. Hence, quadrature mirror 
filters allow for analysis-reconstruction systems which exhibit 
only frequency distortion. 

Based on these results, Esteban et al. [4] and Crochiere 
et al. [5-6] developed subband speech coding systems based 
on octave band tree structures of quadrature mirror filters. 
The octave band structure has the advantage that it can 
simultaneously approximate the critical band structure of the 
ear while also utilizing quadrature mirror filters. Such 
systems exhibit an overall system response which is linear 
phase, and as such they can be used to code data signals as 
well as speech signals. The filters used by Crochiere et al. 
[5] were designed by Johnston [7] using an iterative approach 
which sought to minimize the frequency distortion in the 
analysis-reconstruction system. Another characteristic of 
these later subband coders is that the band-splitting filters 
were realized using a two-band polyphase filter bank [8] to 
improve the computational efficiency. In very recent work, 
Smith and Barnwell [9-10] have presented a technique for 
designing tree-structured analysis-reconstruction filter banks 
which allow for exact reconstruction. Such systems, which 
allow for the design of filter banks of arbitrary quality, are 
very attractive since they exhibit only coding distortions. 
However, they sometimes require slightly more computation 
since they cannot always use the polyphase structure directly 
in implementing the filter banks. 

The octave band subband coders used in speech coding 
applications are typically formed as a tree structure of 
analysis-reconstruction pairs based on quadrature mirror or 
exact reconstruction filters. Fig. 2(a) shows the block 
diagram for such a two-band system. For the lower channel, 
the input signal, x(n), is first passed through a half-band low- 
pass filter with impulse response ho(n), and then decimated 
at a 2-to-1 rate. For the upper channel, the signal is passed 
through a half-band high-pass filter with the impulse response 
hy(n), followed by another 2-to-1 decimator. The output of 
these two channels, yo(n) and y(n), are the two signals 
which would be coded in a two-band subband coding system. 
For the reconstruction, these two signals are padded with 
zeros at a 1-to-2 rate, passed through the two interpolation 
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filters with impulse responses goQ(n) and g4(n), respectively, 
and the outputs are summed to form the estimation of the 
original input, x(n). 

Let us define the transfer functions of the two analysis 
filters as 


co 
Ho(z) = 2% ho(n)z—" 
n= — © (la) 
[oe] 
Hj(z) = 2% hy(njz—9 
n= — 0 (1b) 


and the Fourier transform of these two transfer functions by 


foe) 
Holw] = Hoe!) = £ ho(nje J 
n= — oo (2a) 
° oad . 
Hi[w] = Hy(e!®) = © hy(nje J” 
n= — oo (2b) 


Then, for quadrature mirror filters, the high-pass filter 
transfer function can be defined in terms of the low-pass filter 
by 

H(z) = Ho(—z) (3) 
or equivalently by 


Hj[w] = Holw—-7] (4) 


In comparison, the Smith-Barnwell exact reconstruction 
filters are related by 


Hi(z) = Ho(- 1/2) (5) 

or equivalently by 
Hj{w} = Ho[-—w-7z] (6) 
For quadrature mirror filters, this leads to the relationship 
hy(n) = (— 1)Pho(n) (7) 
and for the Smith-Barnwell filters, the relationship becomes 
hy(n) = (—1)8ho(—n) (8) 
Fig. 2(b) gives the equivalent polyphase filter structure 
[8] for the two-band analysis-reconstruction system of Fig. 
2(a). For the quadrature mirror filters, the transfer function 


for the low-pass filter can be expressed as 


Ho(z) = Po(z2) +P (z2)z—! (9) 
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and the polyphase filters can be defined as 


Poin) = ho(2n) (10a) 


Pp, (n) ho(2n+ 1) (10b) 
This is the well known polyphase result that the impulse 
response of the polyphase filters for a two-band system is 
formed by taking every second sample of the impulse 
response of the prototype low-pass filter. The polyphase filter 
structure reduces the required number of multiplies by a 
factor of two over the filter structure of Fig. 2(a). The 
polyphase filter structure can also be used with Smith- 
Barnwell exact reconstruction filters under the condition that 


ho(n) = ho(—n) (11) 


or, equivalently, hg(n) must be a zero (or equivalently, a 
linear) phase filter. Although this condition can sometimes 
‘be met for Smith-Barnwell filters, the filter design problem 
for this special case is not well understood, and no 
realizations of this type have yet been demonstrated [9-10]. 
For those cases which are well understood (including the ones 
discussed below), this condition is not met and polyphase 
realizations are not possible. 

In the traditional quadrature mirror reconstruction, the 
reconstruction filters can be defined in terms of their 
polyphase filters as 


a(n) = p,(n) (12a) 


II 


qj) = Poin) (12b) 


From Fig. 2(c), it is obvious that this leads to an overall 
system transfer function given by 


X(z) = C(z)X(z) = Po(z2)P}(z2)X(z) (13) 


What this shows is that for the case of quadrature mirror - 


filters, the entire analysis-reconstruction system behaves like 
a linear filter. 

For the Smith-Barnwell exact reconstruction filters, the 
equivalent characterization of the reconstruction system is 
given by 


Go(z) = Ho(l/z) = H1(—z) (14a) 


Gi(z) = Hj(1/z) = Ho(—z) (14b) 
This, of course, leads to exact reconstruction, or 
equivalently, a linear system C(z) which is distortionless. 
In particular, if the analysis and reconstruction filters chosen 
are both length L FIR filters, then C(z) is an ideal delay given 
by 


C(z) = z—2Lb (15) 
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This condition is also approximately true for the Johnston 
quadrature mirror filters. The primary point here in terms 
of the realization of subband coding algorithms is that, even 
under the best of conditions, a two-band analysis- 
reconstruction system still behaves as an ideal delay. This 
implies the need for additional storage for delay memory in 
tree structured realizations. 


TREE STRUCTURED SUBBAND CODERS 


Two-band subband coders based on the two-band 
analysis-reconstruction systems of the type we have been 
discussing are of relatively little interest for speech coding 
themselves, since they offer little gain over full-band 
ADPCM. However, such two-band systems can be combined 
in tree structures to produce effective coding structures for 
speech. This tree-structuring procedure has the advantage 
that it can take direct advantage of the properties of half- 
band quadrature mirror filters and Smith-Barnwell filters in 
filter structures more appropriate to the critical band nature 
of aural perception. 

The basic structure of an octave band subband coder for 
speech is illustrated in Fig. 3 where the band-splitters and 
band-mergers are typically implemented using the polyphase 
forms shown in Fig. 4. A key issue in the octave band 
structure is the compensation filters for bands which have 
not been split. It has been shown [11] that for both the 
quadrature mirror filters and for Smith-Barnwell filters, the 
compensation filters can be adequately realized using ideal 
delays. The basic problem is that in a tree structured 
environment, these delays ‘‘stack’’, and the required delay 
memory size can become quite large. In particular, if there 
are N band-splitters in an octave band subband coder, then 
the compensation delay required by the qth band is given by 


Cq(z) = z-4@) (16) 


where d(q) is given by 


N-~q 
d(q) = © L(p+q)2?P 
p=1 (17) 


and where L(p) is the length of the FIR analysis and 
reconstruction filters in the pth band-splitting operation. This 
delay storage requirement can be quite large if a large number 
of octave bands is used. This translates into a total delay 
storage requirement for the entire system of 


N 


D = © L(p)2P 
p=1 (18) 
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TMS320 REALIZATIONS 


In designing subband coder implementations for a fast 
signal processing microprocessor such as the TMS320, it is 
important to determine exactly which of the microcomputer 
resources (data memory, program memory, or real-time) 
constrains the form of the implementations. Since the 
TMS320 has a relatively large program address space (4K 
sixteen bit words) and since the subband coder algorithms 
are quite compactly programmed, it is clear that program 
address space is not an issue of concern. However, this is 
certainly not true of data memory, which is in rather short 
supply (144 words total), and real-time might also be an 
issue. A program for a subband coder can be functionally 
divided into five parts: input-output operations; analysis and 
reconstruction filter implementations; delay compensation 
implementation; channel quantization implementation; and 
control. 

The main loop in a subband coder program must operate 
on frames of input samples where the frame length is such 
as to result in an integer number of data points being coded 
by the deepest branch coder in the tree structure. In order 
to compute the number of cycles per sample required by the 
subband coder main loop, it is hence correct to compute the 
number of cycles required by the entire main loop and then 
to divide by the frame length. To compute the actual number 
of samples required by the entire realization, the number of 
cycles per sample which are required by the interrupt based 
I/O buffering routines must also be included. 

The minimum number of sample points which is required 
for a data frame in order to meet the above constraint is given 
by 


M = 2N (19) 


where, as before, N is the number of band-splitters in the 
coder structure. In an interrupt driven implementation, at 
least M data locations must be available for I/O buffering. 
In addition, the total number of data locations necessary to 
support the filter and delay structure is given by 


N 
K = © L(p)2P 
p=1 (20) 


This includes both the delay storage needed in the filter 
implementations and the delay storage needed for the 
compensation delays. In addition to this storage, our current 
implementations use two storage locations per APCM coder, 
two storage locations for the interrupt routine, and four 
temporary storage locations for various utility operations. 
Hence, the total number of storage locations required can 
be approximated as 


NN = M+K+2N+6 (21) 


538 


Table 2 gives a tabulation of NN versus N for a ‘‘typical’’ 
FIR filter structure given by  L(1)=32, 
L(2) = L(3) = LA) = 16, and L(5) = 8 for both full duplex and 
half duplex realizations. Since the TMS320 has only 144 data 
storage locations, clearly all implementations which use more 
than 144 locations must be realized through the addition of 
a read and write capable off-chip random access memory. 
This, of course, adds considerably to the cost of the 
implementation. In addition, this ‘‘external program’” RAM 
must be accessed using the Table Read and Table Write 
TMS320 operations, which require three cycles per access 
and which also require the use of the arithmetic accumulator. 
Hence, this memory can only be accessed at a comparatively 
low rate. Table 2 shows that a three band (two band-splitters) 
system is the largest subband coder which can be realized 
in full duplex without external program RAM, and a four 
band system (three band-splitters) is the largest subband coder 
which can be realized in half duplex without external program 
RAM. It is clear from this analysis that data memory is a 
major constraining factor in subband coder realizations. 

The real-time constraints in any subband coder 
realization, of course, are highly dependent on the sampling 
rate at which the coder is implemented. For toll quality 
speech, a sampling rate of eight kilohertz is appropriate, 
whereas for lower bit rate systems, sampling rates as low 
as 6.4 kilohertz might well be used. Since access to external 
program RAM, if used, must be performed by means of 
Table Reads and Table Writes, the programs must run much 
slower if more than 144 data storage locations are required. 
For the purposes of this paper, we will first compute the 
approximate number of cycles per sample point required for 
all of the operations in the subband coder excluding the data 
exchanges needed to support external program RAM. This 
will show approximately how much real-time remains for 
the implementation of the external program RAM data 
transfers. From this analysis and an analysis of the data 
transfer rates required by the Table Read and the Table Write 
operations, it will be obvious that essentially any reasonable 
subband coder for speech can be implemented in real-time 
on a single TMS320 equipped with adequate external 
program RAM. 

For a subband coder environment, the total number of 
cycles per sample point necessary to realize the arithmetic 
operations of the analysis and reconstruction filters is simply 
computed as 


N 
MM = © L(p)2!—P 
p=1 (22) 


current realization requires 24 cycles, while the control 
overhead per sample requires less than 50 cycles per sample 
per band-splitter. In addition, the APCM coders require 
approximately 60 cycles for both the coder and the decoder 
(120 cycles total for full duplex operation), while the delay 
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functions require less than 10 cycles per sample. Hence, the 
approximate number of cycles per sample can be written as 


N 
LL = MM+120N+24+460 © 2!~-p 
p=! (23) 


Table 3 gives a tabulation for the approximate percent 
of real-time required as a function of N excluding the time 
required to access external program RAM for a sampling 
rate of eight kilohertz. As would be expected, the real-time 
requirement is nowhere near linear with N, and there is still 
over 25% of real-time available even at N=5. Now let us 
consider the additional requirement imposed by the slow 
access of the external program RAM. The first point to note 
is that the vast majority of the required delay memory is not 
used in the filter computations, but is used to implement the 
compensation delays. The number of memory locations 
required by the filters is given by 


N 
MA = © 2L(p) 
p=l (24) 


Since, for the example shown, MA has a maximum value 
of 176, 84 for the receiver and 84 for the transmitter, then 
at most two data context switches must be computed for a 
full duplex realization and no data context switches must be 
performed for half duplex realizations. Since the data delay 
operations can all be performed using only one data memory 
location and paired Table Read and Table Write operations 
and since only block accesses are required, then no additional 
data memory is required and the total number of memory 
locations accessed per frame is given by 


N 
MB = 5 2P7! 
p=l (25) 


Since these operations must be done once per frame, and 
they can be performed in six cycles per data point, the total 
number of additional cycles is given by 


NA = [2MA+2MB]6/2N (26) 


As can be seen in Table 4, all of the example subband coders 
can still be implemented in real-time. 


CONCLUSIONS 
Based on the above analyses and the associated 


realizations, it is clear that a TMS320 is well matched to the 
subband coder implementation problem for voice 
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applications. For the subband coder realization, it was found 
that the delay compensation memory requirements dominated 
the realization, and reasonable implementations always 
required external program RAM. However, when this is 
made available, virtually any subband coder of interest can 
be implemented using the same hardware environment. 
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Fig. 2. (a) Two-band analysis-reconstruction band splitter 
(b) Equivalent polyphase structure for a two-band analysis-reconstruction 
band-splitter 
(c) Equivalent polyphase structure for analyzing the reconstruction 
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TABLE 1 
Summary of Speech Coding Techniques 


Coding Effective Aural Aural 
Technique Bit Noise Frequency 
Rates Masking 
Linear 120 Kbps- No No 
PCM 80 Kbps 
Companded 100 Kbps- Yes No 
PCM 50 Kbps 
Delta 80 Kbps- Yes No 
Modulation 50 Kbps 
DPCM 80 Kbps- Yes No 
40 Kbps 
ADM 40 Kbps- Yes No 
16 Kbps 
ADPCM 40 Kbps- Yes No 
16 Kbps 
Subband 32 Kbps- Yes Yes 
Coder 10 Kbps 
ATC 32 Kbps- Yes Yes 
7 Kbps 
APC 32 Kbps- Yes No 
4 Kbps 
LPC 2.4 Kbps- Yes No 
Vocoder .6 Kbps 
TABLE 2 
Storage Requirements For Typical Subband Coders 
Number Locations Locations 1/0 Delay 
of (Full (Half Buffer Memory 
Band-Splits Duplex) Duplex) (M) (K) 
1 76 52 2 64 
2 144 77 4 128 
3 278 144 8 256 
4 536 275 16 512 
5 818 364 32 768 
TABLE 4 


Total Cycles of TMS320 and Percent Real-Time 
Including External Program RAM Access Time at 
Eight Kilohertz Sampling Rate 


Number Cycles Percent 
of Per Real- 
Bands Samples Time 
1 268 43 
2 374 59 
3 617 98 
4 584 93 
5 540 87 
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Aural 
Phase 


No 


No 


No 


No 


No 


No 


Maybe 


Maybe 


No 


Yes 


Syllabic Vocal Short Pitch 
Energy Tract Time 
Model Stationarity 
No No No No 
No No No No 
No Yes No No 
No Yes No No 
Yes Yes No Maybe 
Yes Yes No Maybe 
Yes No No Maybe 
Yes Yes Yes Yes 
Yes Yes Yes Maybe 
Yes Yes Yes Yes 
TABLE 3 


Total Cycles of TMS320 and Percent Real-Time 
Excluding External Program RAM Access Time at 
Eight Kilohertz Sampling Rate 


Number Cycles Percent 
of Per Reail- 
Bands Samples Time 
1 268 43 
2 374 59 
3 427 68 
4 454 72 
5 467 74 
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Fig. 3. Octave band subband coding system 


Fig. 4. (a) Band-splitting module for octave band subband coder 
(b) Band merging module for octave band subband coder 
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TELECOMM/OATACOMM 
SPECIAL ISSUE 


Add DTME generation 


and dec 


g to 


DSP-P designs 


In a computer system that employs a 
digital-signal-processing .P and that’s 
equipped for phone-line communications, 
the DSP .P can generate and decode 
DTME dialing signals as well as handle 
typical DSP functions. Therefore, the 
system can both dial out to establish 
communications links and accept Touchtone 
inputs for remote control of tts functions. 


Patrick Mock, Texas Instruments 


A digital-signal-processing (DSP) pP can handle Touch- 
tone (DTMF) dialing and decoding over telephone lines 
in addition to its customary signal-processing chores. 
As a consequence, if a computer system already has a 
DSP pP and A/D and D/A converters in place, then the 
system can decode DTMF signals, and any Touchtone 
telephone can serve as a data-entry terminal or a 
remote-control console. The only cost for these DTMF 
enhancements is additional program space in the 
wP’s ROM. 

This article outlines a DTMF generating scheme and 
describes in detail the implemention of DTMF decoding 
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in a specific DSP »P, the TMS32010. Although the 
DTMF decoder functions as intended, it fails to meet 
AT&T specs exactly because it’s designed to detect 
DTMF tones in the presence of speech and because it 
suits computer applications like voice-mail and electron- 
ic-mail systems, which are not pure telephone applica- 
tions. DTMF tone decoders that do meet AT&T specs 
usually stop decoding tones if they detect speech. With 
a more exacting program, the TMS32010 could meet 
AT&T specs to the letter. One of the goals of this 
project, however, was to make the DTMF code as 
compact as possible to allow the DSP pP to do other 
jobs. Some performance was sacrificed as a conse- 
quence. 


Tone generation is easy 

A DTMF tone generator (Ref 1) can consist of a pair 
of programmable, second-order harmonic oscillators 
(Fig 1). The sample-generation rate of the oscillators 
determines the total harmonic distortion of the output. 
The higher the sampling rate, the more nearly exact 
the signal will be. In all cases, you must choose a 
sample-generation rate greater than approximately 7k 
samples/sec to achieve an acceptable signal. (Fig 2 
explains the DTMF tone-coding scheme.) 

Because the telephone company’s official digitizing 
rate is 8k samples/sec, most generating circuits run at 
this rate. According to the Nyquist criterion, which 
specifies that the sampling rate must be at least twice 


545 


If a computer can decode DIME signals, 
then any Touchtone telephone can serve as 
a data-entry terminal. 


the frequency of the highest-frequency signal being 
sampled, 8k samples/sec is more than adequate for 
generating any valid pair of tones using the TMS32010; 
the highest frequency involved is 1633 Hz. Because of a 
limitation in the system used to develop the chip’s 
tone-generating and -decoding programs, the decoding- 
program version listed in Fig 3 runs at 9766 samples/ 
sec, and all testing was done using this version. Howev- 
er, Table 1 presents coefficients for running at 8k 
samples/sec; Fig 4’s listing shows the portion of the code 
that must be amended for 8k-sample/sec operation. 

Fig 5 shows the flow chart for the DTMF tone- 
generating algorithm. (The DTMF tone-generating 
routine described in Ref 1 takes up 160 words in the 
program ROM.) The first step of the algorithm initial- 
izes the processor and the interfaces and performs all 
other required initialization. The next step retrieves 
the digit that’s to be dialed (0 through 9, A through D, 
or “#” or “*”) from a specified location in memory. The 
digit serves as a pointer within a table that contains the 
values required to initialize the resonators. 

Because this design uses two oscillators for eight 
possible frequencies rather than eight oscillators, to 
provide the correct frequencies you must load the 

Text continues on pg 212 


START 
IMPULSE 


HIGH-FREQUENCY GROUP 


1209 Hz 
l 


1633 Hz 


ie Hz Ou Hz 


CH} 


LOW- 
FREQUENCY 
GROUP 


852 Hz— — 


Fig 2—Pressing a bution on a Touchtone telephone’s 4x4 keypad 
generates DTMF signaling tones in pairs. For. example, pressing “6” 
generates a 770-Hz tone from the low-frequency group and a 1477-Hz 
tone from the high-frequency group. Note that the keypad has four 
keys (A through D) that are not normally seen on most phones. 
They’re available with some special instruments. 


Fig 1—A pair of programmable, second-order harmonic oscillators make up the DTMF tone generator represented by this directed graph. 
The delay boxes temporarily hold samples for one iteration. The delayed samples are multiplied by coefficients B and D and summed to 
generate a tone sample. The tones, in turn, are summed and sent to a D/A converter. 
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vi 1.00 


NOONE UAWNHeH OS 


10 


DTMF TONE DECODER 


0000 
0001 


0002 
0003 
0004 
0005 
0006 
0007 
0008 
0009 


000a 
000b 
000c 
000d 
000e 
OOOF 
0010 
Ooo11 


0012 
0013 


0014 
oo15 


f900 
0017 


0007 
0008 
000c 
OO0Ff 
0010 
0011 
0010 
0011 
0012 
0013 
0014 
0015 
0016 
0017 
0018 
0019 
OOla 
oO1b 
O01c 
0021 
0022 
0024 
0028 
0029 
002Za 
002b 
002d 
0035 
003b 
003c 


738b 
704e 
écbS 
68cb 
Sb23 
5355 
4af3 
3ef8 


4ef f 
442b 
37a0 
2cS8 
Q1d0 
ec23 
d712 
cQ00 


0200 
000a 


7t tf 
8000 


xxx kK kK & 


ROWMAX 
COLMAX 
DAT11 
DAT23 
DAT14 
DAT15 
DAT17 
DAT27 
DAT18 
DATZS 
DAT29 
DAT213 
DAT216 
DATIN 
* 

* BEGIN 
* 


* 


T™TMS320 Assembler vers 1.3 


DTMF TONE DECODER 
c Copyright Texas Instruments, 1984 
by Patrick C. Mock 
Northeast Systems Engineering 


TITL “ DTMF TONE DECODER ” 


IDT “vs 1.007 

B START Go To The Beginning 
EQu 7 Define Variables 
EQU 8 

EQU 12 

EQU 13 

EQu 16 

EQU 17 

EQU 16 10 Contains decoded row 
EQU 17 11 " de column 
EQU 18 

EQU 193 

EQU 20 

EQU 21 16 Contains last decode 
EQU 22 

EQU 23 

EQU 24 

EQU 25 

EQU 26 

EQU 27 

EQU 23 

EQU 33 

EQU 34 

EQU 36 

EQU 40 

EGU 4i 

EQU 42 

EGU 43 

EQuU 45 

EQU S33 

EQU Ss? 

EQU 60 

DATA TABLES 

DATA 29579 Real Coeff N=226 
DATA 28750 

DATA 27832 

DATA 263827 

DATA 23331 

BATA 21333 

DATA 191&7 

DATA 16120 

DATA 20223 Real Coeff N=222 
BATA 17963 2nd harmonic 

DATA 14752 

DATA 11352 

DATA 464 

BATA —-5020 

DATA -10478 

DATA -16384 

DATA S12 CLOCK = Sample Frequency 
DATA >O000A MODE 

DATA >7FFEF POSMAX Mask for data in 


Mask for data out 
Program continues on pg 208 


BATA >8000 NEGMAX 


Fig 3—This tone-decoding program for the TMS82010 runs at 9766 samples/sec. However, the official digitizing rate specified by the phone 
company 18 8k samples/sec; Fig 4 shows the section of code that adapts this program to 8k-sample/sec operation. 
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7O * 


71 0016 0001 TABLE BATA 1 ONE 
72 + Start of Program SESE FE IE HE EE FE EEE EE EEE EEE HE 
73 * 
74 0017 7f8b START SOVM 
75 0018 6e00 LDPK oO 
76 0019 6820 LARP ce) <Break > 
77 ooOla 7014 LARK 0, ONE 
78 OO1b 7el& LACK TABLE 
7? OO1c 6788 NEXT TBLR + Initialize Coefficients 
18) oo1id 1014 SUB ONE 
1 O01e f400 BANZ NEXT 
oOoif oO1c 
Bz 00z0 4311 OUT MODE , 9 Set AIB Mode 
33 0021 4710 OUT CLCK,1 Set AIB Clock 
S4- * 
85 * 
S46 #% Load not recognized symbol 
S7 * 
Ss 0022 7eff NOT LACK >FF 
Bg 0023 6915 DMOV LAST 
9O 0024 S015 SACL LAST 
91 # 
V2 0025 7F89 AGAIN ZAC Zero DFT Loop Variables 
93 0026 7O1LF LARK 0.31 
94 0027 7ilc LARK 1,DAT11 
95 oo02g8 fS500 BV ZERO 
0029 002a 
9G o02a 6381 ZERQ LARP 1 
D7 O002b 50aQ SACL #+,0,90 
98 O02Cc f400 BANZ ZERO 
002d o002Za 
99 * ‘ 
100 * Take data and calculate DFT loop 
101 * 
102 ag2ze 7ee2 LACK 226 SET DFT LOOP VARIABLE 
103 * 
104 O02f 5017 LOOP SACL COUNT 
105 0030 7OOF LARK 0,CS14 
106 0031 713b LARK 1,DAT214 
107 0032 1214 SUB ONE, 2 
198 0033 fcOO BGZ WAIT 
0034 0037 
109 0035 7007 LARK o,cSs 
110 0036 712b LARK 1,DAT28 
111 % 
112 0037 F400 WAIT BIOZ CALC Wait for A/D 
0038 OO0S3b 
113 00397 F900 B WAIT 
O03a 0037 
114 * 
115 oo3b 423¢ CALC IN DATIN, 2 
116 oOo3c 2012 LAC POSMAX 
117 oOo3d 733c XOR DATIN Convert data toe 320 format 
115° OOZe 5O3c SACL DATIN 
1:9 * 
120 * BEGIN DFT LOOFS 
121 % 
t22 * 
123 OOSF 4a8l1 FRFT LT #1 
124 0040 2c 3c LAC DATIN, 12 X¢re) 
12 0041 4298 SUBH #— X(nd-YCn-2) 
126 Q042 6d583 MFY * cos (S#CO)#YCn-1) 
127 0043 6b88 LTO * Y(n-1)-PY(n-2) and 
1235 0044 7tSF APAC 
129 QOo4s 7F8F AF AC 
1:30 Q046 ‘“7f3Ff AFA X(nd+2c0s CHC) HY Cn 1LdI-YCn-2) 
131 0047 5890 SACH #- 0,0 --> Y(n-1) 
132 0048 F500 BY CHECK 
0049 0050 
1:33 004a £400 BANZ FRPT 
0046 OO03f 
134 * 
135 004c 2017 LAC COUNT 
134 004d 1014 SUB ONE 
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1:37 004e 
oo4f 
138 
139 
140 
141 
142 0050 
143 0051 
144 
145 0052 
0053 
146 0054 
147 00ss 
00354 
148 
149 
150 
151 0057 
152 0058 
153 0059 
154 
155 
156 
157 o05a 
158 005b 
159 005c 
160 905d 
141 005e 
142 o0sF 
163 0060 
164 0061 
165 0062 
0043 
166 0044 
167 0063 
168 0066 
169 0067 
170 0068 
00469 
171 
172 
173 
174 004a 
175 006b 
174 006c 
177 006d 
178 006e 
179 O0Séf 
180 0070 
131 0071 
182 0072 
0073 
183 0074 
184 0075 
185 0076 
186 0077 
1837 0078 
0079 
188 
189 
190 
191 007a 
192 007b 
193 007¢c 
007d 
194 007e 
195 OO7Ff 
196 0080 
0081 
197 0082 
198 
199 0083 
200 
201 0084 
202 0083 


fe00 
o02f 


7007 
712b 


f800 
00eb 
5990 
F400 
0032 


7eO3 
S010 
5011 


7102 
7O21 
2022 
SoO0lia 
68380 
4898 
201la 
1088 
fd0o 
0067 
3110 
2088S 
Soia 
6891 
F400 
a0Se 


7102 
7029 
202a 
5S01b 
6880 
4898 
201b 
1088 
*d00 
0077 
3111 
2088 
SO1lb 
6891 
f400 
006e 


201b 
10la 
fd0o 
008a 
201b 
1414 
fad0 
ON22 
6alb 


800c 


7ft8e 
10la 


BNZ 


** £ * 


CHECK LARK 
LARK 

# 

MAGLP CALL 


SACH 
BANZ 


* 


LACK 
SACL 


SACL 
+ 


* Find Row Pea 
+ 


ROWS LARK 


ROWL LARP 


ROWBR MAR 


+ 


* Find Column 
* 
COLUMN LARK 


COLL LARP 


COLBR MAR 


* 


LOOP 


0,css 
1,DATZ2S 


ENERGY 


#-,1,0 
MAGLP 


3 
ROWMX 
COLMX 


k 


COLBR 


1, COLMX 


Calculate Energy at each frequency 


Compare Energies And Determine Decode Value 


* Check For Valid Signal Strength 


* 


ROWBIG LAC 
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COLMAX 
ROWMAX 
COLBIG 
COLMAX 
ONE , 4 
NOT 

COLMAX 


12 


ROWMAX 


Reverse Twist 


Ideal Sdb = 6 


549 


550 


203 
204 


205 
206 
207 


208 
209 
210 
211 
212 
213 
214 


215 
216 
217 
218 
219 
220 
221 
222 
223 
224 
225 
226 
227 
222 
229 
230 


231 
232 
233 
234 


235 
236 


237 
238 
239 
240 
241 
242 
243 
244 
245 
246 
247 
248 
249 
250 
251 
252 


253 
254 
255 
256 


237 
258 


259 
260 
261 
262 
263 
264 
265 
266 
267 


0086 
0087 
0oss 
0089 
008a 
008b 
008c 
008d 
008e 


Ooosf 


0090 
0091 
0092 
0093 


0094 
0095 


0096 
0097 
0098 
0099 
00%a 
009b 
009c 
009d 
009e 
oo9f 
00a0 
00al 
00a2 
00a3 
00a4 
00a5 
00aé 
00a7 


00a8 
00a? 


O00aa 
00ab 
O0ac 
00ad 
o0ae 
OOaf 
00b0 
00b1 
O00b2 
o0Oobs 
0064 
00bS 
O00bS6 
00b7 
00bs 
0O0b? 
OOba 
OObb 


0Obc 
00Obd 
OObe 
OObf 
00cd 
00c1 


fa00 
0022 
F900 
0094 
20ta 
1414 
fa00 
0022 
éala 


8003 


7fSe 
101b 
fa00 
0022 


éala 
82ab 


7003 
7ile 
2014 
5017 
6881 
z2caS 
7fF90 
68a0 
fbOoO 
00a3 
2017 
1014 
5017 
f400 
009a 
2017 
fed00 
0022 


6alb 
82ab 


7903 
7124 
2014 
5017 
6881 
2ca8 
7f90 
43a0 
fb0O 
00b7 
2017 
1014 
5017 
f400 
00ae 
2017 
fed0d 
0022 


7e2d 
0110 
5017 
3917 
7e038 
0010 


COLBIG 


* 


NOT 

VROW 

ROWMAX Twist 

ONE , 4 

NOT 

ROWMAX 

3 Ideal Adb = 3 


COOLMAX 
NOT 


* Check for valid row tone 


* 


VROW 
* 


* 


RVL 


RCNT 


+ 


ROWMAX 


683 683 = 1/6 = -8dB 


* Check for valid column tone 


* 


VCOL LT COLMAX 
* 
MPYK 683 683 = 1/6 = -8dB 
+ 
LARK 0,3 
LARK 1,DAT15 
LAC ONE 
SACL COUNT 
CVL. LARP 1 
LAC #+,12 
SPAC 
MAR *#+ 0 
BLEZ CCNT 
LAC COUNT 
SUB ONE 
SACL COUNT 
CCNT BANZ CVL 
LAC COUNT 
BNZ NOT 
* 
* Check 2ND Harmonic Energy Levels 
* 
LACK DAT29 Calculate address of 
ADD ROWMX . 1 row data locations 
SACL COUNT 
LAR 1, COUNT 
LACK cs? 
ADD ROWMX 
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COUNT 
0, COUNT 
ENERGY Calculate eneray level 


ROWMAX 


4095 ROWMAX /8 


ROWMAX/4 > 2nd Har 
NOT 


DAT213 Calculate address of 
COLMX,1 col data locations 
COUNT 

1, COUNT 

cs1i3 

COLMX 

COUNT 

0, COUNT 

.¢) 

ENERGY Calculate energy level 


LT COLMAX TEST CODE 
MPYK 2048 


SPAC 
BGEZ NOT -12dB = 1/16 


Load recognized number and check that it is new 


DBMOV LAST2 

DMOV LAST 

LAC ROWMX , 2 

ADD COLMX 

SACL LAST 

SUB COUNT Return if same number 
BZ NOT 


LAC LAST 
SUB LAST2 
AGAIN 2 Passes to recognize 


LAST,2 
AGAIN <break> 


* 

* Energy Calculation Subroutine 

# 

ENERGY LAC NEGMAX , 15 NEGMAX = >8000 
*#.15,1 
COUNT -1/2 + CSn/2 
e— 
COUNT 


COUNT, 1 D2(CSn—-1)/2 
e+ 


COUNT 


COUNT, 1 Di#xD2(CSn-1)/2 
#-,15 
*,15 


od abs (D2~-D1)/2 
* 
* 
((D2-D1)/2)*2 
COUNT ,15 ((D2-Di )*2) /4-D1i*D2(CSn-1)/2 
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DTMF decoding doesn’t necessarily require 
elaborate DSP routines; the routine 
presented here leaves room for several other 
DSP routines in the DSP wP. 


oscillators’ coefficients (B1, D1, B2, and D2 in Fig 1) 
prior to the start of signaling. (The use of eight 
oscillators would decrease execution time but would 
require four times as much memory as two oscillators.) 
After initializing the resonators, the program loops 
repeatedly through the resonator code and generates 
samples of the appropriate high- and low-frequency 
tones. Then the program sums the pairs of tone sam- 
ples. The DSP wP then feeds this sum to an external 
D/A converter, and the resulting analog output is the 
DTMF signal. 

Frequency specs aren't the only ones DTMF tones 
have to meet; duration specs apply also. According to 
AT&T specs, 10 digits/sec (or 100 msec/digit) is the 
maximum data rate for Touchtone signals. AT&T speci- 
fications state that within its allotted 100-msec interval, 
a tone must be present for at least 45 msec and no more 
than 55 msec. During the remainder of the 100-msec 
interval, the tone generator must be quiet to allow the 
receiver's DTMF decoder to settle. Therefore, a count- 
er makes sure that the generated tone’s duration meets 
the minimum time—approximately 45 msec—to mini- 
mize computing time. After the tone’s been on for a 
sufficiently long time, the D/A converter is zeroed and 
maintained at the zero-output level so that the total on 
time and off time equals 100 msec. 

Although DTMF tone-decoding schemes require con- 


TABLE 1—RECOMMENDED DFT LENGTHS 
AT AN 8-kHz SAMPLING RATE 


1ST HARMONIC 
N = 205 DUR = 25.6 mSEC = (18 20 22 24 31 34 38 42) 


2ND HARMONIC 
N = 201 OUR = 25.1 mSEC = (35 39 43 47 61 67 74 B2) 


COEFFICIENTS N = 205 N= 201 
1ST HARMONIC 2ND HARMONIC 
27906 15036 


TABLE 2—DFT PROGRAM SPECIFICATIONS 


N = 226 
K = (16, 18, 20, 22, 28, 31, 34, 38) 
N = 222 
K = (32, 35, 39, 43, 55, 61, 67, 74) 
255 WORDS 
60 WORDS 
9766 SAMPLES/SEC 
102.4 »SEC 


45 p 
23.2 mSEC 


FIRST HARMONIC 
SECOND HARMONIC 


DFT SIZE: 


PROGRAM WORDS 
DATA MEMORY WORDS 
SAMPLING FREQUENCY 
SAMPLING INTERVAL 
DFT LOOP TIME 

TOTAL DFT TIME 


TIME REQUIRED BY 
THE DECISION LOGIC 


150 w¢SEC (ONE SAMPLE 
MISSED BETWEEN DFTs) 
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Glossary 


Center frequency offset—the offset of the center 
of the recognition bandwidth from the nominal 
DTMF frequencies. 

DFT—discrete Fourier transform. 
DTMF—dual-tone multifrequency signaling sys- 
tem used by the telephone company for dialing. 
Guard time—the duration of the shortest DTMF 
tone a detector will recognize. 

II1R—infinite impulse response (a type of digital 
filter). 

Log-Linear—transformation of logarithmically 
compressed data from a codec back to linear 
form. 

Recognition bandwidth—the percent change 
from the nominal frequencies that a detector will 
tolerate. 

Reverse twist—the condition that exists when a 
DTMF signal’s row amplitude is greater than the 
column amplitude. 

Standard twist—the condition that exists when a 
DTMF signal’s column amplitude is greater than 
the row amplitude. 

Talk-off—a measure of the detector’s ability to 
ignore:speech signals that look like DTMF signals. 
Twist—the difference, in decibels, between the 
loudest row tone’s amplitude and the loudest col- 
umn tone’s amplitude. 


siderably more code than do generation schemes, the 
decoding program in Fig 3 takes less than twice as 
much code as the simpler generating program. Fur- 
thermore, both programs are much smaller than the 
total capacity of the DSP uP. In this case, rather than 
being called as a result of a keystroke (as the generating 
algorithm is), the decoding algorithm continually pro- 
cesses signal samples and so must be interleaved with 
other DSP functions. The algorithm must run continu- 
ally because, after all, it doesn’t know whether or not 
DTMF tones are present until after it processes the 
input. 

The discrete Fourier transform (DFT) algorithm 
employed in the program listing is known as Goertzel’s 
algorithm (Ref 2). This algorithm is compact and needs 
only one real coefficient per frequency to determine 
magnitude (Fig 6); although extracting magnitude and 
phase requires complex coefficients and hence more 
complex programming, you can decode DTMF signals 
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& 
*BEGIN DATA TABLES 
* Real Coeff N=205 
DATA 27906 697 
DATA 26802 770 
DATA 25597 851 
DATA 24295 941 
DATA 19087 1209 
DATA 16527 1336 
DATA 12945 1477 
DATA 9166 1633 
harmonic Real Coeff N=201 
DATA 15036 1394 
DATA 11287 1840 
DATA 7363 1702 
DATA 3323 1882 
DATA -10805 2418 
DATA -16384 2672 
DATA -22153 2954 
DATA ~27440 3266 


DATA 419 CLCK = Sample Frequency 
DATA >OO0A MODE 


DATA >7FEFF POSMAX = Mask for data in 
DATA >8000 NEGMAX = Mask for data out 


* 
* 
TABLE DATA ONE 
* Start of de tee he she te the the she te te te the Se te te de te tate te te 
* 
START SOVM 
LDPK 
LARP <Break> 
LARK 0, ONE 
LACK TABLE 
TBLR * Initialize Coefficients 
SUB ONE 
BANZ NEXT 
OUT MODE ,0O Set AIB Mode 
OUT CLCK,1 Set AIB Clock 


Load not recognized symbol 


LACK >FF 
DMOV LAST 
SACL LAST 


ZAC Zero DFT Loop Variables 
LARK 0,31 

LARK 1,DAT11 

BV ZERO 

LARP 1 

SACL *+,0,0 

BANZ ZERO 


Take data and calculate DFT loop 
LACK 205 SET DFT LOOP VARIABLE 


SACL COUNT 
LARK 0,CS16 
LARK 1,DAT216 


SUB ONE, 2 
BGZ WAIT 
LARK 0,CS8 
LARK 1,DAT28 


Fig 4—This amendment of Fig 3’s listing adapts the tone-generating routine to 8k-sample/sec operation. It’s a substitute routine for lines 
48: to 122. 
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DTME decoders that meet AT@T specs 
usually stop decoding tones if they detect the 
presence of speech. 


simply by extracting the magnitude of a tone’s frequen- 
cy components and ignoring their phase. In addition, 
instead of waiting for a complete sample set to begin 
processing, Goertzel’s algorithm processes each sample 
as it arrives. 

Goertzel’s algorithm takes the form of a series of 
second-order IIR (infinite-impulse-response) filters. 
Notice that, in Fig 6, you can divide the directed graph 
into two parts: a left-hand part that includes the two 
feedback elements (boxes marked “delay”) and a right- 
hand portion leading to the output that has no feedback 
elements. For DTMF decoding, you are interested only 
in the last iteration (N—1) of the algorithm. Conse- 
quently, because the right-hand branches don’t involve 
feedback, there’s no need to execute these branches of 


INITIALIZE 
GET DIGIT 
INITIALIZE 
DUAL 
RESONATORS 
GENERATE 
TONE SAMPLE 
OUTPUT 
TONE SAMPLE 


TONE ON LONG 
ENOUGH? 
NO 
GENERATE 
QUIET 


Fig 5—The directed graph in Fig 1 translates into a program that 


follows this flowchart. Note the step that produces a quiet period. 
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TABLE 3—TONE DECODER TEST RESULTS 


BANDWIDTH TEST RESULTS: 


TONE %HIGH %LOW TONE %HIGH %LOW 


1209 
1336 
1477 
1633 


SPECIFICATIONS REQUIRE: MIN = 1.5% AND MAX = 3.5% 


AMPLITUDE RATIO TEST RESULTS 
TWIST REVERSE TWIST 
SPECIFICATIONS >4.0 dB >8.0 dB 
DIGIT 1 5.3 8.4 
DIGIT 5 
DIGIT 9 


NOISE TEST: PASSES AT - 24 dBV 


TALK-OFF IMMUNITY: ONE FALSE RECOGNITION 
PER 1000 CALLS (SPEC 1500) 


the algorithm until the last iteration of the algorithm. 

What’s not obvious from the directed graph of the 
algorithm is that the left-hand constant, 2cos(27k/N), is 
the same as the right-hand constant, W*y, for calculat- 
ing the magnitude of DTMF signals. Wy is a complex 
number, and the left-hand constant is not. However, 
the program calculates the magnitude squared of the 
output of the algorithm. Squaring a complex number 
always yields a real number, and in this case, squaring 
the right-hand constant yields a real number that’s the 
same as the left-hand constant. Therefore, Goertzel’s 
algorithm, adapted to DTMF decoding, not only exe- 
cutes quickly because it has few steps, but it also takes 
up little memory space because it uses few constants. 

Given a time-ordered sample set of size N, processing 
each sample means you'll do N iterations of the algo- 
rithm. If k is the frequency you’re solving the trans- 
fprm for, then the values k and N determine the 
coefficients of each IIR filter. The values of k and N and 
the sampling rate also determine how accurately the 
transform discriminates between in-band and out-of- 
band frequencies. 

Specifically, k is a discrete integer corresponding to 
the frequency you're solving for. It’s defined as 


k=N xfrequency/(sample rate). 


(Note that you must round off the frequency of interest 
to an integer.) W*y is a frequently encountered constant 
in digital signal processing. It’s defined as 


W*y=exp(—2njk/N). 
Because the sampling rate and the frequencies you’re 
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The decoding algorithm processes signal 
samples continuously, so the DSP p.P must 
interleave the decoding with other DSP 
functions. 


extracting are fixed (by the phone company), the sam- 
ple-set length (N) is the only parameter you can vary. 
In order to obtain the best performance from the 
transform, the length of the sample set must be opti- 
mized with respect to two conflicting criteria: The 
sample set must be small enough so that the decoder 
can accumulate a complete set in an interval that’s short 
enough to keep up with the DTMF digit-transmission 
rate; conversely, the sample set must be long enough so 
that the transform discriminates between in-band and 
out-of-band signals. An exhaustive and inelegant com- 
puterized search of all possible combinations of k and N 
resulted in the sample lengths listed in Tables 1 and 2 
and used in the Fig 3 and 4 program listings. 


Companding not accounted for 

This design assumes that the analog input is linearly 
encoded. This is often not the case, because many 
phone signals are compressed logarithmically by a 
codec. In such cases, you must first perform a log-to- 


Fig 6—A simplified form of Goertzel’s algorithm, represented by 


linear expansion before submitting the samples to the 
DFT algorithm. (Ref 3 describes how to do companding 
with the TMS32010 if your system doesn’t incorporate 
companding hardware.) 

Fig 7 shows how samples are fed, in effect, into a 


this directed graph, decodes DTMF tones. A program that imple- 
ments it can save processing steps by performing the calculations 
illustrated by the right-hand portion of this graph for just the last 
iteration. Furthermore, for DTMF-decoding purposes, this compact 
algorithm requires only one constant per frequency because both the 
right-hand and left-hand constants have the same value. 


COMPRESSED PCM SIGNAL DIGITIZED 


SIGNAL 


LOG-TO-LINEAR ) LINEAR 

EXPANSION 
DETECTOR FOR 697 Hz 
sc 


1 
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THRESHOLD 
DETECTOR 
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Fig 7—The tone decoder employs 16 of the transforms shown in Fig 6. They extract each of the eight Touchtone frequencies and their second 
harmonics. 
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Goertzel’s algorithm, a discrete Fourier 
transform algorithm, operates as a series of 
second-order IIR filters. 


parallel array of 16 DFT algorithms. There is one DFT 
for each of the eight frequencies and each of the second 
harmonics of the eight frequencies. You need the second 
harmonics as well as the fundamentals to discriminate 
between speech and DTMF tones. Of course, they 
execute serially because they are sections of code—not 
physical devices. 

As Fig 8’s flowchart shows, after initializing the 
processor, the program feeds the first sample to the 
IIR filters. After all the samples have been processed, 
each filter’s current value is squared. This operation 
yields the magnitude of the strength of the signal at 
each of the eight DTMF frequencies and each second 
harmonic of the DTMF frequencies. 

The program next compares the data against several 
thresholds. It performs four principal checks. First, 
after finding the strongest signals in the high- and 
low-frequency groups, it simply determines whether 
any valid DTMF tones are present at all. If the strong- 
est signals are not above a minimal value (2°" in Fig 3’s 
program), the program does no more processing and 
begins collecting another sample set. 

Second, if valid DTMF tones are present, the pro- 
gram checks the strongest signals in the low and high 
(row and column) groups for twist—the ratio of the 
row-tone amplitude to the column-tone amplitude. This 
ratio must be between certain values for the DTMF 
tones to be valid. (Because of the frequency response of 
telephone systems, the high tones are attenuated. 
Consequently, the phone company doesn’t expect the 
high and low groups to have exactly the same amplitude 
at the receiver, even though they were transmitted at 
the same strength.) 

Third, the program compares the amplitude of the 
strongest signal in each group to the amplitudes of the 
rest of the tones in its group. Again, the strongest tone 
must stand out from the other tones in its group by a 
certain ratio. 

Finally, the program checks to see that the strongest 
signals are above one threshold while their correspond- 
ing second harmonics are below another threshold. 
Checking for strong harmonics insures that the DSP 
system won’t confuse speech for DTMF signals. 
(Speech has significant even-order harmonics; DTMF 
signals don’t.) 


Fig 8—The tone-decoder program does far more than simply detect 
the presence of DTMF signals; it also performs an elaborate series of 
checks to ensure that the tones are within specifications and that a 
valid tone is new data that must be acted on. 
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The DI'ME-decoding program checks the 
_ signal pair to establish tone validity, and 
then it determines whether the pair 
constitutes new information. 


If the DTMF signal pair passes all these compari- 

sons, then it’s a valid tone pair that corresponds to a 
digit. Just because it’s valid, however, doesn’t mean 
that the corresponding digit is necessarily new informa- 
tion. The remaining two steps of the program compare 
the current digit to the two most recently derived 
digits. First, the program checks to see if the current 
digit is the same as the second-to-last digit. If they 
match, then the program assumes the tone hasn’t 
changed lately. If they differ, it performs one final 
check to see if the current digit matches the last digit 
received. If these are the same, then the DTMF tone 
has changed recently and remained stable for two 
iterations. This means you finally have a valid new 
digit. If they don’t match, it means the tone has 
changed since the last sample was acquired but hasn’t 
remained stable long enough. Consequently, the pro- 
gram loops back without signaling that a new digit has 
arrived. If the new tone is really valid and stable, the 
next iteration of the algorithm will recognize the digit 
as valid because the new current digit will now match 
the previously received digit. 
_ There are two reasons for checking three successive 
digits at each pass. First, the check eliminates the need 
to generate hits every time a tone is present; acknow]- 
edging it only once is enough. As long as the tone is 
present, it can be ignored until it changes. Second, 
comparing digits improves noise characteristics and 
speech immunity. 

The implementation of the decoder algorithm follows 
the specification listed in Table 2. The TMS32010’s 
Harvard architecture separates data and program 
memory. The data memory is on chip. The program 
keeps the tables required by the decoding algorithm in 
on-chip data memory. These tables take up more than 
half the available data-memory locations. Depending on 
your application, you might have to store the tables in 
program memory and move the tables onto the chip 
every time the decoding algorithm runs. This will free 
the on-chip memory for other uses, but it will obviously 
increase the decoding algorithm’s execution time. 


Checking the decoder’s performance 

Evaluating the performance of a DTMF decoder is 
more difficult than evaluating the performance of a 
DTMF generator. You can check the generator very 
simply with a spectrum analyzer. To test the decoder, 
on the other hand, you have to determine not only that 
it will decode valid tones, but that it will both reject 
invalid signals and operate properly in the presence of 
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noise. Testing the decoder using AT&T’s published test 
method is an all-day affair and requires a specific 
instrumentation suite. Prerecorded tapes of various 
test tones speed things considerably. For example, 
Mitel’s (San Diego, CA) $90 CM7291 test cassette tape 
cuts the evaluation time of DTMF tone receivers to less 
than 90 minutes, according to the company (Ref 4). 
The TMS32010’s DTMF decoder was tested against 
the Mitel test tape. The test results given in Table 3 
indicate that the receiver can detect all tones. And the 
receiver bandwidths conformed almost exactly to all 
AT&T specs. There were only three tones for which the 
decoder was slightly off. In two instances, results were 
0.2% too large and, in one instance, 0.2% too small. The 
other AT&T specs were met perfectly, including the 
twist’s dynamic range at 25 dB, the guard time at 20 
msec, and the white-noise test at 24 dBV. EDN 
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A SINGLE-PROCESSOR LPC VOCODER 


Andrew W. Holck 
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ABSTRACT 


This paper presents the design of a full-duplex 2400-bit/sec 
vocoder which implements an LPC-10 algorithm in real time. 
A single commercially available digital signal processor IC, the 
TMS32010, is used to perform the digital processing. The 
TMS32010 code for this vocoder has been run on a software 
simulator and has been found to be within the real-time con- 
straints of the vocoder design. 


INTRODUCTION 


Tenth-order Linear Predictive Coding (LPC) algorithms have 
been used successfully in digital speech data compression 
systems to achieve low bit rates. A main limitation to the use 
of these systems is the cost of implementation. These costs 
have been steadily declining as advances in semiconductor 
technology are made. Recently, programmable signal process- 
ing devices have been introduced which incorporate high-speed 
multiply/accumulate hardware with RAM, ROM, and program 
control onto a single IC. Compact, low-cost vocoder implemen- 
tations based upon such signal processing devices have been 
demonstrated. 


One of the challenges of designing a system _based upon these 
devices is how to efficiently structure the algorithms which 
are to be implemented. Traditional LPC algorithms perform pro- 
cessing on a frame-by-frame basis which requires the buffer- 
ing of at least one frame of input speech data. The RAM re- 
quired to perform this buffering is larger than the internal RAM 
of currently available programmable signal processing ICs 
(<512 bytes). 


This restriction has been averted in several ways. Many LPC 
algorithms can be restructured so that the processing is done 
in a sample-by-sample or ‘’streamed’’ manner instead of frame- 
by-framel']. This eliminates the need to buffer a frame of 
speech. The processing required for an LPC vocoder can also 
be distributed between several processors. The LPC algorithm 
can be split into the operations of autocorrelation analysis, pitch 
detection, and synthesis. With a distributed processing ap- 
proach, a separate processor can be used to implement each 
of these functions/2). 
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An alternative approach is to provide additional external RAM. 
The TMS32010 digital signal processor has the capability of 
accessing external RAM. With added hardware, a relatively 
large external RAM space can be accessed efficiently with a 
modest overhead penalty. This feature makes it possible to 
implement a full-duplex 2400-bit/sec frame-oriented LPC-10 
vocoder using a single TMS32010 to perform the digital 
processing. 


VOCODER HARDWARE 


The vocoder block diagram is shown in Figure 1. Analog speech 
input is digitized by the anaiog conversion system, coded into 
a 2400-bit/sec data stream, and transmitted to an external 
system through the coded data !/O block. Simultaneously, 
2400-bit/sec coded speech data is received from an external 
system through the coded data I/O block. This data is used 
to synthesize speech samples which are then converted to a 
synthesized analog speech signal by the analog conversion 
system. The extended RAM system provides a larger RAM 
space than that available internally on the TMS32010. 


TMS32010 Digital Signal Processor 


The TMS32010 is a general-purpose digital signal processor 
IC which is used in this vocoder to perform the number crun- 
ching required to implement the LPC-10 algorithm. The vocoder 
design requires a processor with fast execution time, process- 
ing power, and general-purpose features for stand-alone opera- 
tion. Fast execution time is achieved through a 200-ns instruc- 
tion cycle time and a pipelined architecture which enables most 
instructions to execute in a single cycle. A two’s complement 
16 x 16-bit single-cycle hardware multiplier with 32-bit result 
and a 32-bit wide accumulator provide processing power. The 
multiplier and the accumulator enable the TMS32010 to per- 
form a multiply-accumulate operation in 400 ns. General- 
purpose features which allow the vocoder to be implemented 
with a single processor include: full-speed execution from ex- 
ternal program memory, ability to access external parallel I/O 
ports, both interrupt (INT) and polling input (BIO) pins, and a 
full set of bit manipulation and branch instructions. This com- 
bination of features, along with internal ROM (1.5K x 16) and 
internal RAM (144 x 16) in a single 40-pin package, enables 
this vocoder design to be much more compact than a similar 
bit-slice design. 
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FIGURE 1 - VOCODER BLOCK DIAGRAM 


Analog Conversion System 


Codec/filter combochips are commercially available which are 
tailored for the A/D and D/A conversion of telephone quality 
signals. One of these chips is used in the analog conversion 
system to perform full-duplex A/D and D/A conversion, input 
antialiasing filtering, and lowpass filtering of the output. An 
oscillator is used to provide the clock tor the codec’s switched- 
capacitor filters. The oscillator also generates an 8-kHz strobe 
which controls the sampling rate of the codec. This strobe is 
also the interrupt signal for the TMS32010. A shift register 
is used to perform serial-to-parallel conversion of the codec 
A/D data and parallel-to-serial conversion of the data sent to 
the codec D/A by the TMS32010. The TMS32010 uses one 
of its eight parallel !/O ports to access the analog conversion 
system. Nine devices are required to implement this block. 


Program Memory 


The vocoder requires about 3K words (16-bit words) of the 
™MS32010's 4K-word program memory address space. Pro- 
gram memory can be implemented in two ways. All 3K words 
can be implemented off-chip, or the 1.5K words of internal 
masked ROM can be used and the remaining 1.5K words im- 
plemented off-chip. 


Extended RAM System 
The TMS32010 accesses the extended RAM system using two 


of the eight parallel 1/O ports. One of the ports is used to ac- 
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cess the address counter for the RAM and another port is 
used to access the RAM. The counter increments each time 
the 2K-word RAM is accessed. The counter hardware can be 
thought of as an external autoincrementing index register. This 
feature enables the external RAM to be accessed with a 
minimum of overhead. The TMS32010 accesses one of the 
1/O ports in two instruction cycles (400 ns). A buffer is need- 
ed to read the output of the counter so that the interrupt routine 
can restore the counter’s contents. Seven devices are required 
to implement this block. 


Coded Data 1/O 


One of the TMS32010’s eight I/O ports is used to access the 
coded data !/O block. This block handles the transfers of cod- 
ed speech data to and from the external system. The implemen- 
tation of this block will depend on the characteristics of the 
external system. 


VOCODER SOFTWARE 


The software for the system consists of straightforward LPC 
routines, organized to take advantage of the TMS32010’s 
architecture. The software is based on the 2400-bit/sec 
analysis/synthesis modules used in the Speech Command 
System for the Texas Instruments Professional Computer, 
which are used only in half-duplex mode. The switch to full- 
duplex mode was made by interleaving analysis and synthesis 
modules on a frame basis, using the external RAM to buffer 
the sampled data. 
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The analog speech data is processed through a standard codec, 
operating at an 8-kHz sample rate. The 8-kHz strobe which 
sets the sampling rate for the codec is also connected to the 
interrupt pin of the TMS32010. For each interrupt, the 
T™MS32010 reads in an A/D sample and writes out a D/A 
sample. The input samples are decoded from the codec yz-law 
representation to linear representation through a lookup table 
stored in the TMS32010 program memory, and are then buf- 
fered in the external RAM. Synthesized speech samples in linear 
format are read from the external RAM, coded to yz-law 
representation through a tree-structured algorithm, and out- 
put to the codec. 


The analysis is performed by standard autocorrelation analysis, 
using a 30-ms Hamming window with a 20-ms frame interval. 
The autocorrelation is done with blocks of 20 samples. For 
each autocorrelation coefficient, 10 terms are multiplied and 
accumulated in the 32-bit wide accumulator before perform- 
ing the processing for the next coefficient. The reflection coef- 
ficients are calculated from the autocorrelation coefficients by 
the LeRoux-Guegen algorithml3]. Pitch tracking is done with 
the Gold-Rabiner pitch trackerl4]. The energy, pitch, and 10 
reflection coefficients are then quantized to 48 bits per frame, 
providing the basic 2400-bit/sec data rate. The lattice struc- 
ture is used for synthesis, with the parameters being inter- 
polated only at the start of pitch periods. 


With the TMS32010 instruction cycle of 200 nanoseconds, 
there are 625 instruction cycles per sample period, for a total 
of 100,000 instruction cycles per 20-ms frame period. Out of 
this total, 87% of the processing capacity is used by the full- 
duplex processing, exclusive of the coded data I/O. The 
breakdown of the processing is shown in Figure 2. An inter- 
esting result is that the processing time required for analysis 
(37%) is less than the processing time required for synthesis 
(39%). This is due to the efficiency of the TMS32010 in per- 
forming the autocorrelation function in the spectral analysis. 


Spectral Analysis 17% 
Pitch Tracking 20% 
Synthesis 39% 
1/0 Overhead 11% 

Total 87% 


FIGURE 2 - PROCESSOR UTIUZATION (% OF REAL TIME USED) 


These timing numbers were derived using an internal simulator 
on a VAX computer, using previously collected digitized speech 
files. The simulator models the entire system, including the 
"external RAM and the codec companding, through FORTRAN 
subroutines linked to the basic TMS32010 simulator. The 
TMS32010 simulator itself is written in VAX Macro, providing 
detailed simulation of each instruction, and allowing break and 
trace points for following the program flow and timing. 


The code for the full-duplex vocoder, exclusive of the code 
required for the coded data I/O, occupies about 70% of the 
™MS32010’s 4K-word (16-bit words) address space. The 
breakdown of program memory usage is shown in Figure 3. 
The internal RAM of 144 words is not sufficient for all of the 
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full-duplex processing. It is used as 128 words for the current 
process and 16 words associated with the interrupt process- 
ing. The 128 words are swapped between internal and exter- 
nal RAM as the process is swapped between analysis and syn- 
thesis, so that external RAM is holding the memory state of 
the non-current process. The breakdown of the external RAM 
usage is shown in Figure 4. 


Initialization and interrupt Processing 448 
Analysis 1106 
Synthesis 440 
Tables _854 

Total 2848 


FIGURE 3 - PROGRAM -MEMORY USAGE (16-BIT WORDS) 


A/O and D/A Sample Buffers 960 
Coded Speech Data Buffers 256 
Internal RAM Buffers 384 
Hamming Window Constants 240 
Pitch Tracker Variables _ 40 

Total 1880 


FIGURE 4 - EXTERNAL RAM USAGE (16-8IT WORDS) 
SUMMARY AND COMMENTS 


A compact LPC-10 vocoder can be implemented based upon 
this design. Approximately 20 commercially available ICs are 
required to implement this design, excluding the coded data 
1/0 block. Processor loading, ROM usage, and external RAM 
usage are within system constraints with sufficient margin to 
handle the interface to an external system. The vocoder hard- 
ware is not limited to LPC-10 analysis and synthesis. Other 
speech or signal processing algorithms can be implemented 
with similar hardware structures. Since the TMS32010 can 
execute at full speed from external program memory, one 
possibility is to implement the program memory with RAM. 
This feature would give a host system the ability to use the 
same hardware for different signal processing functions by 
downloading different programs to the program RAM. 


A significant portion of the vocoder hardware is the interface 
logic for the codec. Future developments are anticipated which 
will reduce the hardware required for the analog conversion 
system, provide more flexible A/D and D/A conversion, and 
reduce the !/O overhead for the A/D and D/A samples/5I. 
Another significant portion of the vocoder hardware is the ex- 
tended RAM system. The ability of the TMS32010 to efficient- 
ly access external RAM is one of the main features which 
allows a single TMS32010 to perform the full-duplex LPC-10 
processing. Advances in semiconductor technology will allow 
larger internal RAM spaces, but there will still be the need to 
efficiently access external RAM. Features could be incor- 
porated into future digital signal processors which would allow 
efficient access of external RAM with a minimum of external 
hardware. These features, along with increased internal RAM 
spaces and processing speed, will allow more and more com- 
pact implementations of compiex signal processing algorithms. 
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PREFACE 


Speech processing is one of the first applications of digital signal processing. With the 
advancement of commercially available digital signal processors, speech processing 
algorithms can now be implemented in compact, low-power systems. These systems 
are now widely used in both commercial and military applications. Some of the 
applications are voice-mail, voice store-and-forward, secure communications, voice 
response for secure/warning systems, and speaker-dependent/-independent recognition. 


Speech algorithms are typically complex and large in program size. For the algorithm 
to be executed in real time, the speech processor must finish computing all necessary 
steps before the next frame of data is received. A speech processing system also carries 
a heavy I/O load intensity. The Texas Instruments TMS320 processors’ fast instruction 
cycle and parallelism in the architecture allow many speech processing algorithms to 
be performed in real time. The processors’ large address range (4K words of program 
for the TMS32010 and 64K words for the TMS32020) meet large memory requirements 
of many complex speech algorithms. The TMS320 can also access external I/O devices 
efficiently, thus eliminating the need of buffering large arrays of speech data. Finally, 
the TMS320 exhibits some common characteristics of general-purpose microprocessors, 
such as a rich instruction set and hardware/software interrupts, which make them very 
easy to use. Because of these features, the TMS320 digital signal processors have been 
widely adapted as the speech processor for many speech processing systems. Some of 
these systems are: IBM’s Voice Communications Option,! Votan’s Plug-in/Card for 
IBM PC,2 TI-Speech System,3 and DECtalk Text-to-Speech System.4 


The TMS320 can be used to implement speech algorithms at a variety of data rates. 
In the Telecommunications section of the book, two speech algorithms, 32-kbit/s ADPCM 
and 16-kbit/s subband coding, have been included. In the Computers and Peripherals 
section, this report from MIT was chosen to give the designer an appreciation of the 
requirements of a processor for speech processing applications. In the report, a speech 
algorithm at 9.6-kbit/s using Adaptive Predictive Coding (APC) was thoroughly studied. 
The TMS32010 was selected as the most suitable device for implementing a compact 
APC speech system. The design considerations of critical loop, control logic, and memory 
allocation are performed and included. A preliminary hardware design consisting of 
a TMS32010, 2K words of program memory, and 2K words of data memory is included. 
TMS32010 codes for performing major steps of the APC algorithm are listed in the 
appendices of the report. These include Average Magnitude Difference Function (AMDF) 
pitch estimation, pitch prediction coefficient calculation, combined-first residual and 
autocorrelation computation, Leroux-Gueguen recursion for LPC Parcor coefficients 
computation, predictive quantizer loop, receiver loop, and I/O service routine. 


Although the report does not give a cookbook solution to the implementation of APC, 
the material included provides enough ingredients for readers to construct an APC system 
and a wide range of vocoder derivatives. For example, the hardware configuration and 
a subset of the TMS32010 code in the appendices can be used as references for creating 
a TMS32010-based pitch-excited LPC vocoder at 2400 bps. Some of the useful routines 
for the 2400 bps LPC are AMDF pitch estimation, autocorrelation computation, Leroux- 
Gueguen recursion for LPC Parcor coefficients computation, receiver loop, and I/O 
service routine. 
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ABSTRACT 


A speech coding processor architecture design study 
has been performed in which the Texas Instruments 
TMS32010 has been selected from among three 
commercially available digital signal processing integrated 
circuits and evaluated in an implementation study of real- 
time Adaptive Predictive Coding (APC). The TMS32010 has 
been compared with the AT&T Bell Laboratories DSP I and 
Nippon Electric Co. »PD7720 and was found to be most 
suitable for a single chip implementation of APC. A 
preliminary system design based on the TMS32010 has been 
performed, and several of the hardware and software design 
issues are discussed. Particular attention was paid to the 
design of an external memory controller which permits rapid 
sequential access of external RAM. As a result, it has been 
determined that a compact hardware implementation of the 
APC algorithm is feasible based on the TMS32010. 


1. INTRODUCTION 

Recently, several digital signal processing integrated 
circuits (DSPs) have become commercially available. These 
devices possess significant computational capability and 
permit a variety of speech processing algorithms to be 
implemented in compact, low power systems. In this report 
we summarize the results of a processor design study in 
which the Texas Instruments (TI) TMS32010, the Nippon 
Electric Co. (NEC) pPD7720, and the AT&T Bell 
Laboratories DSP I have been evaiuated for the task of 
implementing real-time Adaptive Predictive Coding (APC). 
We have surveyed the architectural features of these three 
DSPs and have compared and contrasted their expected 
performance in implementing real-time APC. 

Digital signal processors are typically benchmarked 
using some of the more common signal processing algorithms 
such as digital filters and FFTs. They are usually compared 
solely on the basis of the execution times of these 
computations. Unfortunately, we have found that in 
evaluating DSPs for real-time speech coding applications, 
these typical signal processing benchmarks do not provide 
us with complete information. For this reason, we have 
chosen to use an actual speech coding algorithm, real-time 
APC, as a benchmark. The decision to use APC was based 
on a number of factors. First, it is an algorithm of moderate 
to high complexity that requires a processor with considerable 
numerical processing capability. In addition, it requires a 
processor which can access an extensive amount of memory. 
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It therefore provides a reasonable indication of the processing 
power of a particular digital signal processor. Secondly, it 
fits within the category of medium- to low-bit rate speech 
coding algorithms which we are currently interested in 
implementing at Lincoln Laboratory. We postulate that a 
DSP’s ability to implement APC is reasonable assurance that 
comparable algorithms could also be implemented on that 
DSP. 

This report shall be organized as follows. In section 
2, we describe pertinent aspects of the APC algorithm as 
they relate to the algorithm’s implementation. In Section 3, 
we briefly review and compare the architectural features of 
the three digital signal processors that we have considered. 
In Section 4, we summarize a_ software/hardware 
implementation of the APC algorithm based on the 
TMS32010. 


2. ADAPTIVE PREDICTIVE CODING 


In the present section, we briefly outline the 
fundamentals of the APC algorithm. This discussion is 
intended to serve as a means of introducing our own 
terminology and notation. We have chosen not to develop 
the theory upon which the algorithm is based. For a more 
complete treatment of APC in terms of its theoretical aspects, 
we refer the reader to a report by Viswanathan, et al., [16], 
which is a comprehensive review of the theory and also 
describes several variations and improvements that have been 
made upon the APC algorithm. For the purpose of this study, 
we have considered the APC algorithm in its most basic form. 
This particular version of the APC algorithm is similar in 
structure to the original proposed by Atal and Schroeder [2]. 
A block diagram is shown in Fig. 1. Figures 1 (a) and (b) 
are the APC analyzer and synthesizer, respectively. In the 
analyzer, two predictors are employed for removing 
presumed redundancy in the input speech signal and are 
arranged in a feedback loop surrounding a one-bit quantizer. 
The predictor A(z) is a spectral predictor and is intended to 
remove the redundancy in the speech signal which is due to 
its quasi-stationary spectral properties. The spectral predictor 
has the polynomial transfer function 
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Figure 1. Block Diagrams of APC (a) Analyzer and (b) Synthesizer 


where the coefficients aj are the spectral predictor 
coefficients and the parameter P is the prediction order. The 
spectral predictor coefficients are obtained from linear 
prediction methods which will be outlined below. The 
prediction order, P, is usually a specification of the particular 
implementation and is typically equal to 4. The second 
predictor, B(z), is the pitch predictor. It removes redundancy 
in the speech signal that is due to the quasi-periodicity of 
voiced sounds. The pitch predictor has the transfer function 
B(z) = az—T (2) 
where a is the pitch prediction coefficient and T is the 
estimated pitch period. 

In the block diagram of Fig. 1, we show pitch 
prediction being followed by spectral prediction. At sample 
n, the predicted speech signal, s[n], is subtracted from the 
incoming speech signal, s[n], and the resulting residual, din], 
is quantized, coded and transmitted to the receiver. The 
quantized residual is also fed back within the analyzer loop. 
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In the receiver, the spectral prediction signal is computed 
first and is added to the received residual before the pitch 
predic tion signal. The pitch prediction signal is then added 
in, and the resultant synthesized speech signal is passed to 
a digital to analog converter for the reconstruction of the 
analog speech signal. 

The one-bit quantized residual fed back within the 
analyzer and the decoded residual din] in the receiver are 
both unit variance signals and are scaled by a multiplicative 
factor q. The factor q is an estimate of the standard deviation 
of d[n] and forces the quantized residual signals in both the 
analyzer and the receiver to be equal to the original residual, 
d{n], with the addition of quantization noise. 

The methods used to compute the APC side parameters 
a, T, ai and q are understood by viewing the prediction 
operations in the time domain. Removal of the pitch 
redundancy in the input speech signal can be written as the 
difference equation 
(3) 


e,[n}] = sin] — as{n—T] 
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in which the signal e;[n] is referred to as the first residual. 
If the signal s[n] were exactly periodic, and if T were 
computed without error, the coefficient a would equal one, 
and the first residual would be identically equal to zero. 
However, since speech is never exactly periodic and since 
the pitch estimation process employed in APC does produce 
errors, the first residual can never be identically zero in any 
practical sense. Thus, once the pitch period has been 
determined, a is estimated in a manner which minimizes the 
mean square energy in the first residual. This results in the 
following expression for a, the normalized correlation 
coefficient 


N-1 
> s{n)s{n—T] 
n=0 
CS SSS (4) 
N-1 
YS s{n—T]s[n—T] 
n=0 


Computing the spectral predictor is actually the linear 
prediction analysis problem. In this context, instead of per- 
forming the analysis on the speech signal, linear predictive 
analysis is performed on the first residual, e,{n]. Linear 
prediction methods have been examined thoroughly in the 
literature and several solutions to the resulting normal equa- 
tions have been proposed (see for example [10], [11], 
and [12] for solution methods). In the implementation of APC 
given in this report, we have used the autocorrelation method 
of solving the linear prediction normal equations. Standard 
algorithms, such as Levinson’s recursion and the LeRoux- 
Gueguen recursion, which allow the predictor coefficients 
to be obtained from the first P+] autocorrelation values, have 
both been considered. 

The parameter q, as we have mentioned above, is an 
estimate of the standard deviation of the residual d[n] and 
is used to scale the standard deviation of the quantized 
residual d[n] in the analyzer and d[n] in the receiver to the 
same level as d[n]. Thus, when the quantized residual is used 
in the feedback loop in the analyzer, the reconstructed first 
residual e;[n] and the reconstructed speech s[n] become equal 
to e;[n] and r[n], the first residual and speech signals in the 
receiver in the absence of transmission errors. Note also that 
all of these signals will have been degraded by the identical 
quantization noise introduced at the analyzer. 

The pitch period, T, can be estimated using any number 
of methods in APC. However, practical considerations 
permit only simple methods to be employed and pitch errors 
will typically be introduced. This is not a major disadvantage, 
however, since pitch errors do not severely degrade the 
quality of the resynthesized speech in APC. In most APC 
implementations, pitch period estimates are obtained using 
either autocorrelation analysis or the average magnitude 
difference function (AMDF) [14]. In the auto correlation 
method, the autocorrelation function is computed for each 
frame of speech. The distance in lags between its peaks is 
taken as the pitch estimate. In the AMDF method, the method 
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most often employed in real-time APC, the average 
magnitude difference function is substituted for the 
autocorrelation function and is computed in a manner very 
similar to the autocorrelation function for each frame of 
speech. The distance between nulls in the AMDF is taken 
as the pitch estimate. In this study we have used the AMDF 
method and have employed the following technique. To limit 
the number of computations, only certain values of T, Ti, 
corresponding to fundamental frequency values in the range 
from 50 to 400 Hz are used. These values of Ti are 
precomputed and stored in a table. The AMDF, DI[Ti, 
rewritten as 


N-1 
DITj} = > Isln] — s{n—Tj)| (5) 
n=0 


is computed for each value of Ti. The value of Ti which 

gives the minimum AMDF value is taken as the pitch 
estimate. Another measure often taken to minimize the 
number of computations is to compute the AMDF summation 
for every fourth sample of s[{n] and s[n-T], skipping three 
samples in between. We have also taken this step to minimize 
the computation time. 


2.1 Implementation Issues and Discussion 

In a real-time implementation of APC, the side 
parameters T, a, ai, and q are usually computed as 
background tasks while in coming speech is pre-emphasized 
and buffered by foreground I/O handling routines. This 
method of arranging the computations into background and 
foreground routines immediately imposes several 
requirements on the DSP that is used to implement the 
algorithm. The most obvious requirement is speed. In order 
to process speech data in real time, the DSP must be capable 
of executing a large number of machine instructions within 
the duration of a speech analysis frame. Another important 
issue is the size of the memory that the DSP either possesses 
on chip or can access externally. The memory requirements 
are large because computing much of the APC algorithm in 
the background demands that the speech data be double 
buffered. Multi-tasking foreground and background routines 
also requires a DSP capable of controlling a relatively large 
number of external I/O sources through the use of an interrupt 
mechanism. In the following section we shall see how these 
computational and control requirements of real-time APC 
translate into DSP architectural requirements. 

In this section we have summarized some basic aspects 
of the APC algorithm. For the purpose of brevity, we have 
chosen not to describe several of the measures taken which 
improve APC’s performance. For example, much of the 
research in Adaptive Predictive Coding has been involved 
with improving the performance of the predictive quantizer 
loop. These efforts include modifying the spectral predictor 
filter so that it shapes the quantization noise in ways which 
better match the masking properties of the auditory system 
[3]. Other efforts have included the development of various 
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segmental quantization techniques that require that the 
quantization gain term, q, be computed several times per 
frame (on the order of 10) instead of just once as we have 
described [16]. The effect here, also, is to better control the 
properties of the quantization noise. Although including these 
techniques would strongly affect the execution of APC in 
any of the processors evaluated in this study, we felt that 
including them in the evaluation process would not provide 
further insight in assessing the relative performances of the 
DSPs. 


3. DSP SELECTION 


Before the existence of digital signal processor 
integrated circuits, DSPs employed in real-time signal 
processing architectures served primarily as number- 
crunching peripherals. In these systems, conventional 
microprocessors (e.g., the Intel 8085 and the Motorola 
68000) served as central processing units that controlled the 
flow of data throughout the system and in and out of these 
peripherals. Although these implementations have been 
relatively small in size, smaller configurations have become 
possible by integrating more system control functions inside 
the DSP itself. 

For the remainder of this section we will review the 
architectures of the AT&T Bell Laboratories DSP I, the 
Nippon Electric »PD7720, and the Texas Instruments 
TMS32010. We shall focus on each DSP’s on-chip memory 
size and on the feasibility of supplementing this internal 
memory with external RAM. Secondly, we shall focus on 
each DSP’s system control capabilities and evaluate its ability 
to manipulate the various data paths in a system. 


3.1 AT&T Bell Laboratories DSP I 

The digital signal processing integrated circuit initially 
considered in our study was the Bell Laboratories DSP I. 
The DSP has been successfully employed in other moderate 
complexity mid-rate coders at Bell Laboratories, such as Sub- 
band Coding [6] and ADPCM [4]. It therefore became a 
candidate for implementing real-time APC. 

A complete description of the Bell Labs DSP 
architecture can be found in [5]. It features: 


- a 1024 x 16 bit on-chip ROM for program and coefficient 
storage, 

- a 128 x 20 bit on-chip data RAM, 

- an extensive set of memory address registers and a 
separate address arithmetic unit, 

- an Arithmetic and Logic Unit which features a 16 x 20 
bit multiplier and a 40-bit accumulator. 


The Bell Labs DSP architecture also features a great 
deal of parallelism which permits its relatively slow 800 nsec 
machine cycle time to be effectively reduced to 200 nsec by 
a 4-stage instruction pipelining mechanism. However, 
instruction pipelining is not always possible in many signal 
processing operations and is generally effective only in the 
type of computations required of digital filters and correlators 
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(i.e, register transfers, multiplies and adds). Therefore, the 
Bell Laboratories DSP’s 800 nsec cycle time is prohibitively 
slow for the implementation of many signal processing 
algorithms such as autocorrelation coefficient calculations 
and the other computations required in APC. 

Aside from its relatively slow machine cycle time, we 
perceive that the major problem involved in using the Bell 
Laboratories DSP for real-time APC implementation is the 
necessity for transferring speech and other data between 
external memory and the limited on-chip RAM. The use of 
RAM is essential because the 128 word internal RAM that 
the DSP provides is inadequate for storing the large speech 
buffers required for background computation of the APC side 
parameters. The DSP allows external memory to be 
substituted for the internal 1K program/coefficient ROM 
through a reconfiguration of the device and by using the 
multiplexed address/data bus which is brought off-chip. 
Unfortunately, external RAM cannot be substituted for the 
on-chip ROM because the DSP has no external memory write 
capability that directly utilizes this data bus. 

The architecture of the DSP supports serial I/O with 
external devices through the use of asynchronous serial 
interface lines which could be used for transferring data to 
an off-chip memory. However, there are two problems 
associated with an approach which would utilize the serial 
data ports for memory I/O. The first problem is that the serial 
lines must be multiplexed between the memory and normal 
I/O devices, such as codecs and modems that would 
ordinarily communicate with the DSP. This problem could 
possibly be fixed by using external hardware that would 
arbitrate among these sources of data. The second, more 
critical problem, is data throughput. The following sample 
calculation determines the amount of time that would be 
required for serial I/O in computing an AMDF pitch estimate 
and illustrates the nature of the data throughput problem. At 
the maximum input clock rate, the DSP requires 400 nsec/bit 
to bring data on-chip. Therefore, reading a 16-bit word from 
memory would require 6.4 usec. For each point in the AMDF 
summation, two speech samples, s[n] and s[n-T], must be 
read from external memory. Assuming that a frame consists 
of 160 samples and that 3 samples are skipped between 
summations, computing the AMDF for a single value of T 
would require 2x40x6.4 sec or 0.51 msec. Computing the 
AMDF for 60 values of T would require that 30.7 msec be 
spent in memory I/O alone. This is in excess of common 
speech analysis frame durations and, thus, demonstrates that 
using the Bell Labs DSP to implement APC in this fashion 
is infeasible. 


3.2 The Nippon Electric Co. »PD7720 Signal Processing 
Interface 

The NEC pPD7720 Signal Processing Interface (SPI) 
has been used previously at Lincoln Laboratory in a compact 
linear predictive vocoder implementation [7]. The success 
experienced with the NEC »PD7720 in this project prompted 
us to consider the NEC pPD7720 as a candidate for 
implementing real-time APC. The NEC y»PD7720 
architecture is described in [13] and features: 
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- a 512 x 23-bit program ROM, 

- a510 x 13-bit data coefficient ROM, 

- a 128 x 16-bit data RAM, 

- a250 nsec 16 x 16-bit parallel multiplier which gives a 
31-bit result. 


Although the NEC »PD7720 has several characteristics 
in common with the Bell Labs DSP, significant advantages 
are apparent when the two DSPs are compared. These 
advantages include an enhanced I/O structure, interrupt 
service capabilities, and a 4-level stack which provides for 
up to 4-level nesting of subroutines. The I/O structure 
contains-several features which enable the NEC »PD7720 
to be easily interfaced with a _ conventional 
microprocessor/controller, including an 8/16-bit parallel I/O 
data port which attaches directly to a system data bus. The 
NEC pPD7720 allows this data port to be configured for 
DMA mode data transfers between it and other system 
devices. This DMA capability ideally permits speech and 
other data to be loaded into the NEC »PD7720 in 128 word 
blocks for parameter computation (see discussion below). 

A typical system architecture employing several NEC 
pPD7720 SPIs is shown in Fig. 2. An architecture similar 
to this was implemented in the Lincoln Laboratory compact 
LPC vocoder, and we assert that an APC implementation 
based on the NEC »PD7720 SPI would also resemble the 
architecture shown in Fig. 2. A conventional microprocessor 
is employed as a system controller. In the figure, we have 
indicated that an Intel 8085 could serve in this function; 
however, a number of other commercially available 
microprocessors could be used as well. The primary purpose 
of the system controller is to manipulate the data paths among 
the multiple NEC »PD7720 SPls in the system. 
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Although we have shown an indefinite number of SPIs 
being deployed in the system shown in Fig. 2, we can assume 
that at least two (most likely three) SPIs will be needed to 
implement APC. As was true of the Bell Labs DSP, the SPI 
possesses only 128 words of internal RAM which is 
insufficient for the data buffering requirements of APC. 
However, since the data throughput of the NEC »PD7720 
SPI is considerably faster, it is possible to spread the memory 
requirements among the several SPIs in the system and have 
these devices pass data among themselves under the direction 
of a system wide controller. Another alternative is to deploy 
a separate external RAM with a seperate DMA controller. 
Both of these approaches have been included in the 
architecture shown in Fig. 2. The DMA controller in the 
system architecture is used to handle block (and stream) data 
transfers between the SPIs and the external RAM, and the 
control microcomputer is used to contro] the data flow 
between the SPIs. 

In addition to the large amount of memory required, 
the computational requirements of real-time APC also make 
it necessary that several SPIs be used. We base this 
assumption on the distribution of the computational load in 
the Lincoln Laboratory LPC vocoder design [7]. The real- 
time LPC implementation, although requiring very little 
memory for data buffering, requires three SPIs. The LPC 
and APC algorithms are of comparable complexity and three 
NEC uPD7720 SPls, or possibly more, will most likely be 
necessary for APC. 

When discussing the drawbacks of the Bell Labs DSP, 
we pointed out that most of the processing time required for 
computing the APC critical loops is spent accessing external 
memory. AMDF pitch estimation was the particular example 
cited. The Bell Labs DSP was ruled out because its I/O 
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Figure 2. A Typical System Architecture Employing the NEC »PD7720 
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structure was not conducive to extensive use of external 
memory. A similar situation exists with the NEC ».PD7720 
SPI; however, it is less severe. The NEC »PD7720 possesses 
an equal amount of internal RAM as the Bell Labs DSP, and 
factors alluded to previously dictate that speech and other 
data be stored in external memory. An additional factor 
which makes the use of the NEC »PD7720 internal RAM 
undesirable is the internal memory pointer system that the 
NEC ,»PD7720 provides. It was discovered when 
programming the NEC »PD7720 for LPC [9] that a 
significant amount of overhead was devoted to manipulating 
these internal memory pointers. This programming 
inconvenience makes stream processing of data in the APC 
algorithm preferable over the use of block processing 
methods, which would buffer data needed for parameter 
computation in internal RAM. 

Assuming data is to be processed in a stream fashion, 
we were able to approximate the execution times of some 
of the critical loops of the APC algorithm. Assuming external 
control of the system data paths, as shown in Fig. 2, 
approximately 4 psec per 16-bit word are required to 
exchange data with external memory [8]. If one adds these 
numbers up, the approximated execution time of the AMDF 
pitch estimation algorithm is in excess of an analysis frame 
duration. However, the execution times of the remaining 
APC critical loops are each shorter than the assumed frame 
duration of 20 msec, thereby making a NEC »PD7720 based 
implementation of real-time APC feasible if an alternative 
pitch estimation algorithm to the AMDF method is used. 
These results are particularly encouraging, since it has 
already been demonstrated in the Lincoln Laboratory 
compact LPC vocoder that the more sophisticated, but less 
memory intensive, Gold pitch estimation algorithm can be 
programmed to run in the NEC »PD7720 in real time. 

From these observations it seems that a real-time 
implementation of APC based on the NEC »PD7720 SPI is 
feasible. The architecture of such a system would most likely 
resemble the one shown in Fig. 2. Further determination of 
the specific hardware and software complexity, such as the 
exact number of NEC »PD7720 SPIs that would be required, 
has not been undertaken and, of course, would be the next 
step. Instead, our attention has been directed towards 
determining the feasibility of the Texas Instruments 
TMS32010. 


3.3 Texas Instruments TMS32010 

The Texas Instruments TMS32010 effectively 
combines the high numerical processing power of the NEC 
#PD7720 SPI with the control, data manipulation, and 
storage capabilities previously found only in general purpose 
microprocessors. A full description of the Texas Instruments 
TMS32010 architecture is given in [15]. We have highlighted 
some of its features here. They include: 


- a 1500 x 16-bit internal ROM, an external ROM/RAM 
memory address space of up to 4K words, 
- a 144 x 16-bit internal RAM, eight 16-bit parallel I/O 


ports, 
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- a 200 nsec 16 x 16-bit parallel multiplier with a 32-bit 
ALU/accumulator, 
- a 200 nsec machine cycle time. 


The most important advantage that the TMS32010 
offers over the Bell Laboratories DSP and the NEC »PD7720 
SPI is its 12 bits of external memory address space. A 16-bit 
word can be accessed from external ROM/RAM in two to 
three machine cycle instructions. We have found that based 
on this relatively short memory access time, the sum of the 
execution times of all of the critical loops of the APC 
algorithm is less than a 20 msec frame duration (see Section 
4.2 below). Therefore, the APC algorithm could most likely 
run in real-time in a single chip, TMS32010-based system. 

The TMS32010 provides two modes of accessing off- 
chip memory. In mode I, the TMS32010 generates the 
necessary memory addresses internally, and data is 
transferred over the 16-bit data bus. In mode II, the 
TMS32010 transfers data over its parallel I/O ports. Although 
both modes of external memory access physically involve 
the data bus, the differences between these two modes are 
important. Mode I memory transfers are effected by 
executing a three machine cycle TBLR or TBLW instruction 
in the TMS32010. When executing these instructions, the 
contents of the CPU accumulator are taken and used directly 
as the memory address. In mode II memory transfers, a two 
machine cycle IN or OUT instruction is executed. For these 
instructions the least significant three bits of the address bus 
contain a port address which can be decoded externally to 
select one of eight devices that are to send or receive data 
from the TMS32010 over the data bus. When the I/O ports 
are used for memory access, ROM/RAM addresses must be 
provided externally. The trade-off between using these two 
modes of memory I/O is one made between hardware and 
software efficiencies. Although the TBLR and TBLW 
instructions nominally require three machine cycles to 
execute, extra machine cycles are typically required for 
saving and restoring the contents of the accumulator which 
are involved in the ongoing computation. As a result, instead 
of three machine cycles being required for memory I/O, the 
amount of time often turns out to be on the order of seven 
to eight machine cycles. On the other hand, memory I/O 
involving the data ports is guaranteed to require no more than 
two machine cycles. The disadvantage in the case of mode 
II is that external hardware is needed for generating the 
required memory addresses. 


3.4 Discussion and Summary 

With ample external support hardware, any of these 
DSP integrated circuits could be used to implement real-time 
APC. However, given that the implementation should require 
a minimal amount of hardware, the TMS32010 seems to be 
the best alternative among the three. Using the AMDF pitch 
estimation computation as a comparison task, the Bell Labs 
DSP would require the most extensive amount of external 
support hardware followed by the NEC »PD7720 requiring 
an external memory controller and a control microcomputer, 
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and, lastly, the TMS32010 requiring just an external memory 
controller. 

In this section we have made comparisons of these 
DSPs based primarily on their memory accessing capabilities. 
Another important distinguishing feature which allows these 
DSPs to be compared is their capability for providing 
foreground/background multi-tasking of computations. In this 
respect, we require a DSP to have the ability to handle 
interrupts. As far as satisfying this particular requirement, 
the Bell Labs DSP does not support interrupts while the NEC 
pPD7720 and the TMS32010 do. 

Based on the issues discussed in this section the 
TMS32010 is the processor of choice among the three DSPs 
that we have evaluated for the task of implementing APC. 
Although complete evaluations based on speech coding 
algorithms other than APC have not been carried out, it is 
reasonable to assert that the TMS32010 would be most 
appropriate for a variety of other moderate complexity speech 
coding algorithms as well. 


4, APC PROCESSOR DESIGN 


For the remainder of this report, we shall summarize 
the results of this APC processor design study by briefly 
describing one possible hardware/software implementation 
of real-time Adaptive Predictive Coding using the Texas 
Instruments TMS32010. Because the basic APC structure 
described in Section 2 will support a variety of speech 
sampling and data transmission rates, we begin this summary 
of the algorithm’s implementation by describing, in closer 
detail, the specifications of the APC algorithm that we 
propose to implement. In Section 4.2 we give results of a 
critical loop timing study, and we describe a control strategy 
for fitting together the various software components of the 
APC algorithm. We have concluded from this programming 
exercise, in which the critical loops of the APC algorithm 
were coded in the actual TMS32010 instruction set, that the 
most important consideration in designing the software for 
APC is the fashion in which data is stored in memory. In 
Section 4.3 we describe one possible memory allocation 
scheme. We close our discussion of the APC implementation 
by describing the hardware requirements. We have concluded 
from performing a preliminary hardware design that the 
majority of the hardware design effort must be directed 
towards providing a high speed interface with memory 
external to the TMS32010, and to developing a method of 
communicating with several external input/output devices 
under interrupt control. The details of this preliminary 
hardware design are summarized in Section 4.4. 


4.1 Algorithm Specifications 

In Section 2 we outlined the structure of the APC 
algorithm. The structure described in Section 2 will support 
a range of data transmission and speech sampling rates. The 
version of the APC algorithm chosen for this study is 


intended to operate at a transmission rate of 9.6 Kbps and 
at a sampling rate of 8000 samples/sec. The average frame 
duration is in tended to be 20 msec which corresponds to 
an analysis frame size of 160! samples. The transmission 
frame size for this sampling rate is 192 bits which are 
allocated among quantized residual and side parameters as 
shown in Table I. 


Table 1. Bit Allocation per Frame 


QUANTITY 


Residual 


Pitch 


Pitch Predictor 
Coefficient 


Quantizer 
Level 


Reflection 
Coefficients 


4.2 Critical Loop Timing and Control Strategy 

The first step actually taken in the evaluation of the 
TMS32010 was a critical loop timing study in which the 
various portions of the APC algorithm were coded in the 
TMS32010 instruction set and approximate execution times 
of the code were calculated. This critical loop timing study 
was useful in obtaining two types of information. First, it 
has given us some benchmark timing figures which could 
be used as an objective measure for comparing the 
TMS32010 against the other digital signal processing 
integrated circuits. Secondly, this timing data provided 
information which was later used in decisions affecting the 
hardware design. 

Two versions of several of the critical loops were 
coded. In version I of the code, TBLR and TBLW 
instructions were used to transfer data to and from off-chip 
memory. In version II, the data port I/O instructions, IN and 
OUT, were used to transfer data to external RAM. We have 
summarized the execution times of the software units in Table 
II. All of these execution times are based on a 200 nsec 
machine cycle time. Listings of the code written for these 
critical loops appear in the appendices. From examining the 


‘Note that the frame duration is measured with respect to the transmission and receiver modem clocks which are asynchronous to the sampling rate clock. 
Therefore,the analysis frame may deviate slightly from the 160 sample normal size. 
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Table 2. Summary of Critical Loop Execution Times 


EXECUTION* 
TIME (msec) 


10.80-11.00 


AMDF Pitch Estimation 


ALPHA Calculation 


1st Residual Calculation 
& LPC Autocorrelation 
Analysis 


Reflection Coefficient 
Calculations 
Predictive Quantizer 2.84 


2.00 


Receiver Loop 


1.09-1.28 


ADC-DAC 1/0 


Transmit Modem 1.40 


1/O Handler * * 


Receive Modem 
1/O Handler * * 


*Execution time per frame 


OPERATION VERSION | VERSION Il 
% REAL EXECUTION* 
TIME TIME (msec) 


54.0-55.0 


23.79-24.18 119.0-120.9 17.19-18.14 86.0-90.7 


% REAL 
TIME 


1.40-2.16 


1.60 


1.09-1.28 


1.4 


**These are foreground routines. Execution times were calcultaed by multiplying the per sample execution times 


by the 160 sample frame size. 


execution times of the critical loops in Table II, it is apparent 
that the code which incorporates the TBLR and TBLW mode 
of external memory access could not execute within a 20 
msec frame duration. 

Thus, if only one TMS32010 were used, we would not 
expect the algorithm to execute in real time. One therefore 
would have to partition the APC algorithm among two or 
more TMS32010 DSPs. On the other hand, if the I/O ports 
are used for transferring data to external memory in 
conjunction with external memory address generators, it 
seems possible that a single TMS32010 would be all that is 
needed. 

During our study we briefly examined the trade-offs 
between implementing a single TMS32010-based APC 
processor versus one which incorporates two TMS32010 
DSPs with version I of the APC software partitioned. 
Although a dual TMS32010-based processor possesses 
potentially more processing power, it is difficult to make 
effective use of it due to interprocessor communication 
overhead. In designing a dual TMS32010 architecture, the 
first task is to find a reasonable partition of the APC software 
between the two TMS32010 DSPs. The most straightforward 
partition, a direct split between the analyzer and synthesizer, 
would result in an unbalanced distribution of the 
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computational load. The analyzer requires a significantly 
greater proportion of the computational resources. In fact, 
given the execution times of the analyzer loops, it is 
improbable that a single TMS32010 would be able to execute 
all of the analyzer routines within the 20 msec frame duration. 
Therefore, a more uniform partitioning of the APC 
algorithm, in terms of computational requirements, is needed. 
This alternative has a more subtle drawback in terms of data 
communication overhead. Although the APC analyzer 
software can be segmented into several autonomous units, 
practically all of these units process the same speech data. 
If these units are contained in separate TMS32010s, then 
either entire speech buffers would have to be passed among 
DSPs or each of the DSPs would have to access identical 
copies of the same speech data. The first option entails a 
significant amount of processing time being devoted to I/O 
among the processors. The second option would require that 
either memory be shared or data be copied to both TMS32010 
processors. Both of these memory management schemes are 
unduly complex. 

After recognizing the difficulties involved in using a 
dual TMS32010 system, we decided not to pursue this effort 
and, instead, adopted the single-chip design which uses the 
I/O ports for transferring data to external memory. For the 
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remainder of this section and the next, we describe a software 
control strategy and external memory allocation for this one 
chip design. We use the term, software control strategy, to 
refer to the method used to combine software units. Our 
philosophy in adopting a software control strategy in this 
APC implementation has been to relegate as much of the 
computation to background tasks as possible. This allows the 
foreground routines, which are executed upon interrupt from 
the external I/O sources, to be simple I/O handlers that 
merely control the pointers required for buffering the data. 
In Table II, foreground routines are identified with two 
asterisks. 

The obvious disadvantage of computing the APC 
routines as background routines is the overall increased 
demand for memory. However, as we shall illustrate in the 
following section, the memory requirements of this APC 
implementation fit safely within the confines of commercially 
available RAMs. 


4.3 Memory Allocation 

In Fig. 3 we show how memory is allocated among 
the APC software units. A total of 2048 words of RAM are 
required and have been divided among eight 256-word pages. 
The 256-word page size is used to accommodate the use of 
8-bit external address generators which are used as memory 
pointers as described in the next section. Computing the 
analyzer routines as background tasks normally requires that 
the incoming speech be double buffered. Actually, triple 
buffering is used. The extra buffer, stored on a separate 
page, is provided for storing the previous pitch period of 
speech that is necessary in computing the pitch period 
estimate, T, the pitch predictor coefficient, a, and the first 
residual signal, e;[n]. During these calculations, the speech 
sample s[n-T] is needed and, depending on the value of T, 
could reside on the previous pitch period page. The two 
buffers of input speech used in these background 
computations, the current processing frame and the previous 
pitch period, are arranged contiguously on pages 0 and 1 
so that the same 8-bit memory pointer, with the addition of 
a 9th bit used for page crossing, can be used to access these 
two pages of data as a single 512-word block. We have thus 
eliminated the overhead in software involved in page 
crossing. The 9th bit of the pointer to speech sample s[n-T] 
is set by the hardware when it reaches the end of page 0. 
A separate pointer to the speech sample s[n] is initialized 
to the bottom of page | and never crosses the 0/1 or the 1/2 
page boundaries. 

The reconstructed speech signals in the analyzer and 
synthesizer (i.e., the state space of the recursive pitch 
prediction filters) are stored in pages 2 and 3 which are 
implemented in hardware as circular buffers. The 256-word 
buffer size is more than adequate for storing the state space 
which has a maximum length of 160 points. 

Since we have decided to compute the predictive 
quantizer and receiver loops as background routines, the 
single bit/sample residual data must also be double buffered, 
as do the APC side parameters. However, since this residual 
data stream is serial, we can reduce its storage requirements 
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Figure 3. Ram Allocation for APC Implementation 


by packing it into 16-bit words. This data is stored on page 
7, along with the APC side parameters. 

We have eliminated the need in the analyzer for 
buffering the first residual signal by combining the 
autocorrelation computation with the calculation of the first 
residual signal. Through the use of a first-in first-out buffer 
maintained in internal RAM (see code in the appendix) for 
storing only P+1 first residual values, we are able to compute 
the first residual autocorrelation values that are necessary 
for computing the LPC spectral predictor coefficients directly 
from the speech signal. 

ROM is needed for storing program instructions and 
data constants. Although we have not completely specified 
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the amount of RCM which will be needed, we have assumed 
that no more than 2048 words of ROM will be required. The 
hardware implementation of both ROM and RAM will be 
discussed in the following section. 


4.4 Hardware Design 

There are two principal features of the APC processor 
hardware. The first is a high speed external memory interface 
circuit. This circuitry provides two separate memory address 
generators which are operated under programmed control 
of the TMS32010. The second major feature is an interface 
between the TMS32010 and four external I/O devices (the 
analog to digital and digital to analog converters, the transmit 
modem, and the receive modem). This interface allows the 
external devices to communicate with the TMS32010 CPU 
on an interrupt basis. For the remainder of this section, we 
outline our approach for designing the APC processor 
hardware. 

A functional block diagram of an APC processor 
architecture is shown in Fig. 4. In this figure, we have labeled 
the external memory controller circuit and the external I/O 
interface portions of the architecture explicitly. The 
architecture permits access to external memory from the 
TMS32010 under the two modes described in the previous 
sections, using either the memory address bus in conjunction 
with the TBLR/W instructions or the port address bus for 
faster access. The memory address bus is used primarily for 
fetching program instructions and constants from RCM via 
mode I. Under mode II, the data stored in locations 0-1023 
of RAM are designed to be accessible using the address 
generation logic which is contained in the external memory 
controller circuitry. In order to retain maximum flexibility, 
we have made all 2K of RAM accessible to the TMS32010 
under both modes by multiplexing the address bits input to 
the RAM devices. 

The data ports can be thought of as eight physical ports 
which are directly tied to the TMS32010. In actuality, data 
transfers involving the ports will utilize the data bus as well. 
A 3-bit port address (PAC-3) is decoded and is used to select 
one of eight devices which is to send or receive data from 
the TMS32010 over the bus. In the proposed system shown 
in Fig. 4, three of the eight ports are used to interface the 
TMS32010 to external I/O devices. The I/O devices requiring 
separate ports are the analog to digital converter (ADC), the 
digital to analog converter (DAC), and the parallel to serial 
converter (PSC) which subsequently connects to a serial 
transmit modem. We have been able to input data from the 
serial receive modem without the use of an explicit I/O port. 
An additional port is used to interface the TMS32010 with 
external interrupt control logic. The other four ports are used 
to interface the TMS32010 with external RAM. 


4.4.1 Hardware Design External Memory Controller 
Figure 5 is a more detailed schematic of the External 
Memory Controller circuit. In this schematic we have shown 
explicitly the memory I/O control signals which are generated 
by the TMS32010 CPU, the logic used for decoding these 
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signals, and the port/memory address bus. For flexibility in 
the software design, external memory can be used as either 
a 4K word block, consisting of both ROM and RAM to be 


_ accessed under mode I type memory transfers (i.e., by 


executing TBLR and TBLW instructions), or as separate 
ROM and RAM each consisting of 2K words. In mode II, 
the first 1K words of RAM are accessed via the data I/O 
ports in conjuction with the external address generation logic 
which shall be described below. These two modes of memory 
access are distinquished in the hardware by decoding the 
TMS32010 control signals MEN, DEN, and WE, along with 
the memory address bit Ajj. 

For the most part, mode I memory transfers are used 
primarily for fetching program instructions and data constants 
from ROM. However, we also must access pages 4 through 
7 of RAM under mode I. For mode I, the 4K block of 
memory has been partitioned into a lower 2K section of ROM 
and an upper 2K section of RAM. A simple 2-level hardware 
decoding of address bit Al1 distinguishes a read from ROM 
from a read from RAM. For a read operation from ROM, 
7 of RAM under mode I. For mode I, the 4K block of 
memory has been partitioned into a lower 2K section of ROM 
and an upper 2K section of RAM. A simple 2-level hardware 
decoding of address bit Aj, distinguishes a read from ROM 
from a read from RAM. For a read operation from ROM, 
the TMS32010 signal MEN will become active low, along 
with address bit All. These two signals cause the ROM- 
ENABLE signal to become active low, which is directly tied 
to the chip-select (CS) inputs of the ROM devices. If a read 
from RAM is to take place, MEN will again be active low, 
but A,1 will be high since RAM is contained in the upper 
2K section of the memory address space. The address bit 
Aq] is inverted and combined with MEN to generate chip 
select signals for the RAM devices (see Fig.5.). 

Modes I and II memory transfers are distinguished by 
decoding the address bit, Ajj, along with the TMS32010 
control signals WE and DEN. These signals are both active 
low and are combined with Aj, to generate a PORT- 
ENABLE signal (also active low) which enable a 3-to-8 line 
decoding of the port/address but which is assumed to contain 
a valid 3-bit port address. The decoder circuit signals to one 
of the eight devices tied to its output lines to communicate 
with the TMS32010 CPU over the data bus. Two separate 
DMA controllers are being employed as external memory 
address generators. In the schematic in Fig. 5, Advanced 
Micro Devices (AMD) Am2940s [1] are used. These 
particular devices have been chosen primarily because of 
their speed. The machine cycle time of the TMS32010 is 
nominally 200 nsec, and a reasonable, but fast, access time 
for commercially available RAMs is 50 nsec. According to 
the specifications given for the Am2940, its propagation 
delay, combined with the delays of the other combinational 
logic in the external memory interface, provide adequate time 
for data being accessed from RAM to settle on the data bus 
before the end of the TMS32010 memory read cycle. Similar 
time constraints are met for the memory write cycle. 
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Figure 4. Architecture of the Proposed APC Processor 
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Figure 5. Schematic of the External Memory Interface Circuit 


ROM-ENABLE(L) 


The Am2940s are programmable and receive 
instructions from the TMS32010 over the data bus via the 
port I/O mechanism. One of the eight data ports is dedicated 
entirely to providing initialization and other instructions to 
the Am2940s. When this port is selected, the INIT signal 
becomes active low (see Fig. 5) and the Am2940s receive 
instructions over the data bus. The format of the data 
instructions which are given to the Am2940s is described 
below. 

Although the memory requirements of the APC 
algorithm are extensive, an advantage that the algorithm 
provides is that memory access is primarily sequential within 
a page. In other words, speech samples and other data that 
are used within the same software routine will generally 
reside on the same page and will be arranged sequentially 
within that page. This way, after the DMA controllers have 
been programmed at the beginning of a routine, there is little 
interaction between the TMS32010 CPU and the address 
generators during the remainder of the routine’s execution. 
In addition, most of the computationally intensive signal 
processing routines involve sequential data fetches from two 
memory locations. The Am2940 address generators will 
increment their present addresses after an INC signal is 
generated by the Memory Sequencer circuit shown in Fig. 
5. The Memory Sequencer is a relatively simple Finite State 
Machine (FSM) which ensures that the memory pointers to 
RAM are incremented by the proper amount after each data 
transfer. Accessing the I/O ports using either RAM-PORT-1 
or RAM-PORT-2 causes both addresses to increment while 
an access via RAM-PORT-0 causes no incrementation. Thus, 
a typical autocorrelation computation would require first 
reading [n] through RAM-PORT-O and the reading s[n-1] 
through RAM-PORT-1 or 2. Since address incrementation 


(RAM-PORT-1 + RAM-PORT-2)/0 


RAM-PORT-1 + RAM-PORT-2/1 


occurs following the second read, subsequent data fetches 
will access the proper data. All of the routines which access 
external memory using the I/O ports have the memory 
pointers incremented once after each transfer, the exception 
being the routine which computes the AMDF pitch estimate. 
In this routine, three speech samples are skipped between 
each point in the AMDF summation (see Eq. 5); and the 
memory pointers must, therefore, be incremented three 
times. 

The state transition diagram for the memory sequencer 
FSM is shown in Fig. 6. Shown in the figure are three inputs 
to the FSM, RAM-PORT-1. RAM-PORT-2 and AMDF- 
FLG, in addition to the system clock. The output of the FSM 
is an INC control signal which causes both of the Am2940 
address generators to increment their memory pointers. The 
memory sequencer FSM steps through its sequence while 
the CPU is processing the data it has just read, or, in the 
case of a memory write, while the CPU is preparing to output 
another word to RAM. We have managed to save a 
considerable amount of time by having these operations 
carried out concurrently. The memory sequencer FSM 
normally sits in an idle state. After the CPU has finished 
its read or write cycle, signalled by either the RAM-PORT-1 
or the RAM-PORT-2 signals becoming inactive, the FSM 
will enter the INCREMENT 41 state. During the transition 
it generates the INC control signal which is tied to both 
Am2940s. If the AMDF pitch estimation is not being 
performed, the FSM will return to the IDLE state. If an 
AMDF pitch estimate is being computed, the FSM will 
continue to itt INCREMENT #2 and INCREMENT #3 
states, incrementing the Am2940s three more times in the 


process. 


SI 


AMDF-FLG/1 


Figure 6. Memory Sequencer State Transition Diagram 
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The external memory interface circuit is programmed 
by issuing a sequence of 16-bit instruction words over one 
of the I/O ports. Each instruction word is broken into several 
fields which are labelled in Fig. 7(a). The least significant 
eight bits contain the initialization data for the Am2940s (e.g., 
initial addresses, etc.). Bits 8 through 10 and 11 through 13 
contain the Am2940 instructions, and bits 14 and 15 contain 
the RAM access mode bits which are described in Fig. 7(b). 
A list of the Am2940 program instructions is given in [1]. 
The appropriate setting of the RAM access mode bits 
indicates to the external memory controller which memory 
page is to be accessed and the number of times the memory 
pointers are to be incremented during each memory I/O 
cycle. The access mode bits will control the setting of the 
R/W-FLG-1, R/W-FLG-2 and the AMDF-FLG signals, 
which are output from the RAM Access Control Register 
and are input to another register which selects pages 2 and 
3 of RAM, and are also input to the memory sequencer FSM. 
A table is provided in Fig. 7(b) that summarizes the settings 
of the RAM access mode bits. 


AM2940 
INSTRUCTION 


AM2940 
INSTRUCTION 


RAM ACCESS 
MODE 


ACCESS MODE 
BIT 1 | BIT O 


(b) 


MEMORY PAGE 


AND FUNCTION 


PAGES 0 & 1 
AMDF MODE ACCESS (R) 


PAGES 0 & 1 
NON AMDF MODE (R) 


PAGE 2 (R/W) 
PAGE 3 (R/W) 


4.4.2 Hardware Design-I/O Interface Circuit 

The second major task in the hardware design is to 
interface the four external I/O devices with the TMS32010. 
These external devices are to communicate with the 
TMS32010 on an interrupt basis. Since the TMS32010 has 
only one interrupt line, the control signals output from these 
I/O devices must be multiplexed. Our approach has been to 
design an interrupt control register which will uniquely 
indicate the presence of an interrupting device by one of its 
bits being set. The register is read by the TMS32010 any 
time after the interrupt has occurred. 

The I/O interface circuit is shown in Fig. 8. Buffer 
registers are provided between the TMS32010 and the analog 
to digital converter (ADC) and the digital to analog converter 
(DAC). These registers allow the data transferred between 
the TMS32010 and these devices to be double buffered, 
eliminating the need for the TMS32010 to be directly 
involved in any type of handshaking procedure. A parallel 
to serial converter (PSC) is provided between the transmit 
modem and the TMS32010. The parallel to serial converter 


AM2940 DATA 


Figure 7 (a). External Memory Interface Program Instruction Word 
(b). RAM Access Mode Bits 
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Figure 8. External I/O Device Interface Circuit 


changes the parallel data output from TMS32010 into a serial 
bit stream appropriate for the transmit modem. It also serves 
as a data buffer as well. 

The interrupt control register (ICR) is shown in Fig. 
9. It multiplexes three externally generated I/O control signals 
on to the single interrupt line of the TMS32010 through the 
use of a single logic gate. These three control signals are 
the ADC/DAC sampling clock, the transmit modem clock, 
and the receive modem clock. The ICR is actually a 4-bit 
register, with the fourth bit being used to store the serial input 
data from receive modem. The ICR is implemented as a set 


ADC/DAC 
CLOCK 
e@ 


\/ Qy 


O 
CLOCK 


\/ Q2 


CLOCK 
O 


of four D-latches. A typical interrupt service scenario would 
proceed as follows. When one of the external devices 
interrupts the TMS32010, the interrupt signal is passed on 
directly to the TMS32010 when the corresponding bit is set 
in the ICR. A bit set in the ICR prohibits the interrupting 
device from reinterrupting the TMS32010 until the bit is reset 
through an acknowledgment procedure as follows. The bit 
is reset by the TMS32010 after the contents of the ICR are 
read over one of the I/O ports and a word is written back 
with the corresponding bit set to 1. 
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Figure 9. Interrupt Control Register 
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5. SUMMARY 


In this report we have given the results of an APC 
processor design study. The system that we have proposed 
is based on the Texas Instruments TMS32010. We began by 
outlining the basic features of the algorithm and pointing out 
those aspects of the algorithm which make its implementation 
challenging. The major problem associated with 
implementing APC in real time is memory. We 
acknowledged the necessity of computing the APC side 
parameters as background tasks which inherently requires 
an extensive amount of RAM. 

As a part of this study, we examined two other DSPs 
besides the TMS32010: the AT&T Bell Laboratories DSP 
I and the Nippon Electric »PD7720 Signal Processing 
Interface. We have compared these DSPs against the 
TMS32010. The Texas Instruments TMS32010 was 
determined to be most suitable for a real time implementation 
of APC because of its speed, its ability to perform the 
numerically intensive signal processing operations required 
by APC, and its relatively sophisticated control features 
which enable it to handle the memory addressing and I/O 
requirements of APC. 

The objective of this study has been two-fold. We 
wanted to learn the relative strengths and weaknesses of the 
three DSP ICs, and we wanted to determine whether a 
compact implementation of APC and other moderate bit rate 
speech coders of comparable complexity were feasible using 
the TMS32010. Towards these ends a critical loop timing 
study was performed which allowed us to obtain some 
benchmark timing figures which were used to characterize 
the TMS32010. These timing figures also helped us in 
making some hardware decisions and allowed us to determine 
the approximate hardware requirements. 

There are several steps which can follow from this 
work. The most logical step would be for the architecture 
outlined in Section 4 of this report to be constructed. 
Although a preliminary hardware design was given in this 
report, many of the details still need to be more fully 
developed. 
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APPENDIX I 


AMDF PITCH ESTIMATION - VERSION I 


* Computes a pitch estimate from one out of 60 values which are stored in a 
* table in internal RAM. Assumes speech data to be stored in external RAM 


* accessed using TBLR 


INIT 


LOOP-1 


LOOP-2 


BIG-NUM-L 6 
BIG-NUM~H ,6 
MIN-AMDF-L 6 
MIN-AMDF~H ,6 
#60 
NUM-PITCHS ,6 


ARO , #P-TBL-ADDR 


#S-ADDER 
S-PTR, 6 


* ARO ,6 
ST-PTE,6 


AMDP-L ,6 
AMDF~H,¢6 
ARO, #NUM-SAMPLES 


MIN-AMDF-L ,6 
MIN-AMDF-H ,6 
AMDF-L, 6 
AMDF-H, 15 
SAME 


MIN-AMDF maintains min AMDF 
value, init it to something large 


Num-Pitchs is loop counter 
ARO points to pitch table SAME 


Initialize pointers to speech 
Data s{n) and s[n-T]) 


Compute pointer to s(n-T} by 
subtracting away current pitch 


Initialize Accumulated AMDF 
value 
ARO is loop counter 


Load ACC w/ pointer to s(n} 
Read s{n] 

Update pointer skipping 
Three speech samples 

Do same w/ s[n-T 


Compute /s[n} - s{n-T)/ 


Update AMDF value 


If not finished w/ current 
pitch value, loop 


Compare current AMDF value with 
previous ainisus 


If current is larger, loop 
to next pitch value 


APPENDIX I (continued) 


AMDF-L, 6 
AMDF-H,@ 
MIN-AMDF~L, 6 
MIN-AMDF-H ,6 
*,ARD, 6 
T,6 Save the present pitch value 


If current AMDF is saaller 
make it the new minimus 


4+ , ARS Loop to next pitch value 
NUM~PITCHS ,6 

ONE ,6 

NOM-PITCHS ,6 

LooP-1 
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APPENDIX II 


AMDF PITCH ESTIMATION ~ VERSION II 


x 
* 
* 
* the data ports using the IN instruction. 
* ized by a separate initialization routine. 
INIT ZALS BIG-NUM-L,@ 
ADDH BIG-NUM-N ,6 
SACL MIN-AMDF-L,@ 
SACH MIN-AMDF~H ,6 
LACK #60 
SACL NUM- PITCHS ,6 
LARK ARO, #P-TBL~ADDR 
ZAC 
SACL ACCESS-CTR-WD ,§ 
CALL EXT-MEM-INIT 
LOoP-1 LACK #ADDR-S 
SUB *,6,ARO 
ADD REIN-INS,11 
ADD LOAD- INS ,8 
SACL INSTR ,@ 
OUT INSTR, INTRFC-PORT 
LOOP-2 IN S, RAM-PORT-1 
IN ST, RAM-PORT-2 
ZALS S,é 
SUB ST,6 
ABS 
ADD AMDF-L ,6 
ADD AMDF-H,15 
SACL AMDF-L,6 
SACH AMDF~H ,@ 
B10Z LOOP-2 
ZALS MIN-AMDF-L 6 
ADDH MIN-AMDF-H ,6 
SUB AMDF-L ,6 
SUB AMDF~H,15 
BLZ SAME 
ZALS AMDF-L,6 
ADDH AMDF-H ,6 
SACL MIN-AMDF~L ,6 
SACH MIN-AMDF-H ,6 
ZALS * ARO 
SACL T,6 


Compute a pitch estimate from one of 60 values stored in a table in internal 
RAM. Assumes speech data to be stored in external RAM which is accessed via 
External memory interface is initial- 


Initialize minimum AMDF value to some SAME MAR 
big number ZALS 
SUB 
SACL 
Initialize counter for number BGEZ 


of pitch values 
Initialize AR# to point to pitch table 
Clear RAM access control word 
to indicate AMDF mode in 
initialization of external memory 
interface 
Init external pointers to s[n] and s[n-T] 
Issue Reinitialization Instructions to 
AM2940's 


Output Instructions over Data 
Port 
Read s[n] and s[n-T] from external RAM 


Compute /s[n]-s{n-T] / 


Update AMDF Value 


Use hardware to detect end of loop 


Compare current AMDF value w/ 
previous minimum 


If saaller, update minimum 


Save present pitch value 


APPENDIX II (continued) 


a+ , ARS 

NUM-PLTCHS , 6 

ONE ,@ 

NUM-PITCH ,6 

LOOP-1 Loop if not finished 
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INIT 


LOOP-4 


LOOP-1 


LOOP-2 


LACK 
SACL 
SUB 

SACL 
LACK 
SACL 


APPENDIX V 


COMBINED 1ST RESIDUAL AND AUTOCORRELATION COMPUTATION 


Computes 5 autocorrelation values along with the let residual direct 
from the speech signal. 


Aseumes that speech is stored in external 


> 

#ORDER 
COR-COUNT, 6 
ARL, # ~ADDR 


*+,AR1,6 
*- JARI 6 
*- ,AR1,6 


a+ ’ ARI s ) 

*+ ,ARL 6 
COR-COUNT ,6 
ONE 


RAM and that it is accessed using TBLR 


Initialize pointers to speech 
data 


Initialize main loop counter 


Initialize autocorrelation values 
to zero 


Read s{n} 


Read s[n-T] 


Compute el(n) = a{n]-  s[n-T] 


Place el [n} value on a Pirst-in 
first-out, stack which retains 
five most previous residuals values 


Set up loop to compute correlation 
values 


Update, R# through R4 
values 


Loop-3 


APPENDIX V (continued) 


COR-COUNT , 6 
LOOP~2 

ARO, #e4-ADDR 
ARI, #ORDER 


Reorder values in the residual FIFO 


Update loop counter 


REDO LOOP 
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* 


LPC~INIT 


INIT-LP-1 


INIT-LP-2 


INIT~PTRS 


LG-REC 


APPENDIX VI 


LEROUX-GUEGUEN RECURSION FOR COMPUTING LPC PARCOR COEFFICIENTS 


#4 
INIT-COUNTER 6 
ARO, #ADDR-RO 
ARL, #ADDR-E6 


A+ ARO 

+, AR] 

*+ ARO, 
INIT-COUNTER 
ONE ,6 
INIT-COUNTER 
INIT-LP-1 


#3 
INIT-COUNTER 
ARO, #ADDR-R1 
ARI, #ADDR-E-1 


INIT-COUNTER 
INIT-COUNTER ,6 
INIT-LP-2 


ARO, #ADDR E-ARRAY 
ARO, E-ARRAY-START 
ARO, #ADDB-ké 
ARO, K-PTR 

#7 
NUM-INTERATIONS, 6 
ARO, #ADDR-END 
ARO, ENO-PTR 

ARO, #ADDR-ENI 
ARO, ENI-PTR 


ARO, EN1-PTR 
A+ ARS 
NUMERATOR 
ARG, ENT-PTR 
ARO, FNO-PTR 
*-ARO 
DENOMINATOR 
DIVIDE 


Set up pointers to transfer 
autocorrelation values to init recursion 
Point to R[6] 

Point to e(¢g} 

Auto correlations are double word 


(use upper word only) 
(skip a word) 


Load the other three autocor. values 


Aux registers pointed to data 


skip word for double worded 
autocorrelation 


init pointer to e(9) [-ptnt2} 
init number of iterations 


init pointer to e(n) (6) 


init pointer to e(®) [n+1] 


LG LOOP 


TERLP 


APPENDIX VI (continued) 


QUOTIENT 
ARO ,K~-PTR 
bd » ARO A) 
+, ARG 


ARO, K-PTR 

ARO, #ADDR-E-END 
AR], E~ARRAY-START 
AR1, CROSS-PTR 
AR], #ADDR-SCR-END 
AR1, TO-PTR 
NUM-ITERATIONS 
DONE 

LG-COUNTER 


*+, ARO, 6 
AR1, CROSS-PTR 
#+, ARI 


AR1, CROSS~PTR 


NUM- ITERATIONS 
LG-COUNTER, ¢ 
ARO, #ADDR-E-END 


NUM-ITERATIONS 
ONE, 6 
NUM-ITERATIONS, 6 


ARO, E-ARRAY-START 


*+, ARS 


ARO, E-ARRAY-START 


LG-REC 


Init pointers for from data 


Init ptr to put cway deta 


e)(4] Accumulator 


kL aje’™ [n+1-1) 
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APPENDIX VII 


PREDICTIVE QUANTIZER LOOP - VERSION I 


* UPDATE 
* Assumes speech data as well as the pitch predictor state space 
* is stored in external RAM. The pitch predictor state space is kept 
* ina circular buffer which is accessed using TBLR and TBLW 
* instructions 
INIT ZALS N, 6 
SACL COUNTER, $ Initialize Loop Counter 
ZALS #S-ADDR, 6 Initialize Pointer to Incoming speech 
SACL S~PTR 
LOOP-1 LARK ARG, #A-ADDR ARG points to predictor coefficients 
LARK AR1, # El-ADDR AR] points to spectral filter 
ZAC state space 
LT &+, ARI Compute spectral predictor OUTPUT-R 
MPY w+, ARO 
LOOP-2 LTD *+, AR] 
MPY *+, ARO 
BANZ LOOP-2 
APAC 
SACH Y, 6 Y contains Spectral Prediction 
ZAC COMPUTE PITCH PREDICTION 
LT ALPHA 
MPY RT 
APAC INPUT-R 
SACH xX, 6 X CONTAINS PITCH PREDICTION 
ZALS S~PTR, 6 
TBLR Ss 
ADD ONE, 6 
SACL S-PTR, 6 
ZALS s Subtract two predictions from 
SUB xX, 6 input speech 
SUB Y, @ 
BGEZ DIFF~-POS Quantize residual, if neg. 
ZAC tranemit zero 
SACL D, 6 Scale variance of quantized 
LT 6 residual 
MPYK MINUS~1 
APAC 
B UPDATE 
DIFF-POS LACK ONE If residual is positive, transmit 
SACL D, @ one 
ZALS Q, 6 


APPENDIX VII (continued) 


B-OUT- PTR ,6 
ONE ,# 
R-OUT-PTR,6 
R-BOTTOM, ¢ 
OUTPUT-R 
#ADDR-RBUP 


R-OUT-PTR, 6 
R 


R-IN-PTR, 6 
ONE, 6 

R-IN-PTR, 6 
R~BOTTOM, 6 
INPUT-RB 

#ADDR-RBUF 
R-IN-PTR, 6 


R-IN-PTR, 6 
RT 


COUNTER, 6 


Compute spectral prediction filter 
state variable 


Compute Pitch Predictor State 
variable and store it 


Compute pointer into circular buffer 
for storing data 


If at end of buffer, point 
back to beginning 


Compute pointer into circular 
buffer for retrieving data 


APPENDIX VIII APPENDIX VIII (continued) 


PREDICTIVE QUANTIZER LOOP - VERSION II 
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ZAC 
* SACL D,o If quantized residual is negative 
* Aseumes input speech and reconstructed speeech data to be stored LT Q transmit zero 
* in external RAM. The input speech is accessed using TBLR. The MPYK MINUS 1 
* reconstructed speech is accessed using the I/O porte and the exter- APAC 
* nal memory interface. The serial quantized residual signal is packed B UPDATE 
* into 16-bit words. 
DIFF-POS LACK #0NE If residual is positive transmit one 
INIT ZALS N,6 Initialize loop counter SACL D6 
SACL COUNTER, 6 ZALS Q,6 
LACK #S-ADDR,® Initialize Pointer to input speech 
SACL SPTR,O data UPDATE ADD Y,6 Combine quantized first residual 
LACK #D-ADDR SACK El, 6 
SACL D-OUT-PTR,6 ADD xX, 6 Compute reconstructed speech and 
ZALS THREE store it 
SACL ACCESS-CTR-WD ,6 Initialize external memory OUT R, RAM-PORT-1 
CALL EXT-MEM-INIT interface for read/write mode IN RT, RAM-PORT-2 
LACK #16 
SACL COUNTER, 6 ZALS BYTE-COUNTER ,6 Pack residual bit stream 
ZAC BYTE SUB ONE, 6 
SACL RESIDUAL-BYTE ,6 SACL BYTE-COUNTER 
BLEZ NEW-OUT-BYTE 
Loop-l LARK ARO, #A-ADDR Compute spectral prediction ZALS RESIDUAL-BYTE, 1 
LARK AR1,#E1-ADDR ADD D,O 
ZAC B GO-ON 
LT *— ARI 
MPY * ARO NEW ZALS D-OUT-PTR,# 
TBLW RES IDUAL-BYTE 
Loop-2 LTD *- ARI ADD ONE, 6 
MPY *-, ARO SACL D-OUT-PTR,6 
BANZ LOOP-2 ZALS Do 
APAC SACL RESIDUAL-BYTE 
SACH ¥,0 LACK #16 
ZAC Compute pitch prediction SACL BYTE-COUNTER ,f 
LT ALPHA 
MPY RT GO-ON ZALS COUNTER ,6 
APAC SUB ONE ,@ 
SACH X,6 SACL COUNTER, 6 
ZALS S-PTR,O BGEZ LOOP-1 
TBLR S- PTR Read input speech sample 
ADD ONE, 6 
SACL S-PTR 
ZALS s 
SUB X,6 
SUB Y,6 Compute residual and quantize 
BGEZ DIFF-POS 
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INIT 


LOOP~1 


SHIFT 


Ie is accessed using TBLR. 
buffer in external RAM. 
Ports. 


APPENDIX IX 


RECEIVER LOOP 


Assumes residual input is packed in 16-bit words in external RAM. 
Synthesized speech is stored in a circular 


This buffer is accessed via the external 1/0 


#N 

COUNTER ,6 
#D-IN-ADDR 
D-IN-PTR,6 

FOUR 
ACCESS-CTR-WD ,6 
EXT-MEM-INIT 


BYTE~COUNTER 


BYTE~COUNTER 
SHIFT 

D-IN-PTR 
RESIDUAL-BYTE ,# 
ONE ,6 

D-IN-PTR 

#16 
BYTE-COUNTER ,6 


RESIDUAL-BYTE ,I 
RESIDUAL~BYTE ,f 


ONE ,é 
BYTE-COUNTER 6 


Oo 


S-HAT ,@ 

S-HAT, RAM-PORT-1 
ARO, #A-ADDR 

ARI, #£-ADDR 

2 »ARI 

* ‘ ARO 


Initialize a loop counter 


Initialize pointer to input residual 


Initialize external memory interface 
for Read/Write mode 
Initialize counter for parallel 
to serial conversion of input residual 


Obtain quantized input residual 
from 16-bit residual word 


Current residual value is high order 
bit of residual-byte 


Scale variance of residual 


Add in spectral prediction 
put result on spectral pred. 
filter state space 

Store reconstructed speech 
in circular buffer 

Compute next spectral prediction 
value 


APPENDIX IX (continued) 


Compute next pitch prediction 
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* a/D portion 


® 


ZALS 


APPENDIX X 
ADDA A/D-D/A SERVICE ROUTINE INVOKED BY 


INTERRUPT FROM A/D CLOCK 


SN, ADC Input speech from ADC Register 

SN, 6 Pre-emphasis 

OLDSN 

PRE-FAC 

TEMP ,6 Store preemphasized speech temp. 
S~IN-PTR Load pointer to input speech buffer 
TEMP Write out preemphasized speech in buffer 
ONE ,6 increment pointer 

S-IN-PTR,O 

SN,¢ delay s[n) 

OLDSN 

S-OUT~PTR-1 ,@ Retrieve pntr to output speech buffer 
YN Real in processed speech sample 

ONE , 0 Increment pointer 

S-OUT-PTR-1,6 Re~store pointer 

YN Do De~emphasis 

OLD SHATN 

PRE-FAC 

OLD SHATN,#@ Delay output speech sample 

OLD-SHATN , DAC Output speech sample 


® Check for end of buffer. If the end, switch speech buffers. 
* This is done by switching pointers. 


DONE 


ZALS 
SUB 
BGZ 
ZALS 
XOR 
SACL 
ZALS 
XOR 
SACL 


RET 


S-PTR-1,06 Check for end of Data 

S-OUT~END ,@ 

DONE 

S-OUT-PTR-1 ,6 Toggle bit 8, awitching from page 
H'100 5 to 6 (and vice versa) 


S-OUT-PTR-1 ,# 
S-OUT-PTR-Z ,6 
H'100 
S-AT-PTR-2 


Return from interrupt 
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Recognizing speech patterns is a complex process. However, a 
firmware-programmable microcomputer with special algorithms forms 
the foundation for a simple but accurate speech-recognition system. 


Firmware-programmable uC aids 
speech recognition 


This article is the third in a series on signal 
processing with a fast single-chip microcomputer. 
The series began with an interview with Kevin 
McDonough, programmable-products design man- 
ager at TI, in the May 27 issue (p. 42). The first article, 
in the same issue (p. 129), described how to imple- 
ment a vocoder with the chip; the second, in the June 
24 issue (p. 109), applied the chip to implementing 
high-speed modems digitally. 

Microprocessor-based VLSI systems can easily 
provide effective speech analysis and synthesis, but 
speech recognition is still a very formidable task. A 
general-purpose, high-performance speech-recogni- 
tion system requires a computer system of substan- 
tial power. Some low-cost, low-performance systems 
have been built, but they have proved too specialized 
and too inflexible to adapt to any reasonably wide 
range of applications (see “Key Features of Speech 
Recognizers”). The programmability and algorithms 
of a digital signal processor, however, offer speech 
recognition the cost-effective flexibility that this 
developing field needs. 

Currently, speech-recognition algorithms are 
rather primitive. But with a signal processor like the 
TMS320 microcomputer, only firmware changes will 
be required as recognition technology advances. 
System redefinition, which would necessitate new 
integrated-circuit designs, will not be necessary. 

Basically, speech recognition involves four steps: 
feature extraction, pattern-similarity measurement, 
time registration, and a decision strategy. An addi- 
tional task, called enrollment, creates speaker-de- 
pendent reference patterns, which key the recogni- 
tion to a certain speaker. 

Feature extraction transforms speech signals into 
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time-varying parameters (or features) that not only 
can be recognized by the system but also can reduce 
the amount of data in normal speech to something 
more manageable. Of all the many available feature- 
extraction methods, linear predictive coding (LPC) 
is usually the most effective. 

A 320 programmed as a feature extractor based 
on a tenth-order LPC model (which runs at an 8- 
kHz sample rate) operates on a 30-ms Hamming 
window with a frame period of 20 ms, which is short 
enough to capture most dynamic-speech events ade- 
quately. However, extracting the compressed-speech 
features from the input signal and encoding them 
into LPC-10 parameters are not simple. Neverthe- 
less, a modified Le Roux-Gueguen algorithm can be 
readily implemented in just part of a 320 to take care 
of this speech-recognition step. 

The next step—computing the similarity, or dis- 


© 
a) 
2 
a 
E 
< 


Frequency 


Inverse-filter 
operation 


Residual signal 
A(z) Y(n) 


1. Input speech utterances, after conversion into LPC-10 
form, are compared for similarity with stored templates of 
speech patterns via a digital inverse-filter operation. 
Evaluation of the residual output signal from the filter 
determines whether a match (recognition) has been 
obtained. 
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Simplified speech recognition 


tance, measured between the extracted speech 
parameters and stored reference patterns—is 
basically a frame-by-frame comparison of speech 
data with reference data, or templates. This opera- 
tion can be looked upon as passing the input signal 
for the current frame through an all-zero inverse 
filter that represents the reference data. When the 
reference data match the input data—the spectral 
valleys in the inverse match the spectral peaks in 
the input signal—a low-energy residual output will 
be passed through (Fig. 1). If the inverse filter 
matches the input data perfectly, the residual-error 
energy will be normalized at a value of 1. Typical 
normalized values of error, to be within recognizable 
limits, must be less than 1.2 in speaker-dependent 
systems within a 1.05 to 1.8 range. 

More precisely, however, the parameters that 
measure distance between input samples and stored- 
template reference patterns are computed by 
autocorrelation and residual-energy algorithms (Fig. 
2). Autocorrelation is a fairly simple software pro- 
cedure, since it consists mostly of a long sequence 
of register loadings and multiplication operations. 
The residual is thus easy to compute: multiplying 
respective terms of the two sequences—the inner 
(dot) product of the autocorrelation functions of the 
input speech parameters and the impulse response 
of an inverse filter—and summing them. 

The dynamic-programming function then scans 
for minimum-distance matches of the spoken-word 
parameters among all the stored templates to find 
suitable word-recognition candidates. The can- 


Input-speech Auto- 
correlation 
analysis 


didates are passed to a high-level decision-logic 
operation, which applies a threshold-comparison test 
and a next-closest-error threshold test to determine 
which word or words are to be selected. 


Compensating for timing length 


One important aspect of the dynamic-program- 
ming operation is a procedure that compensates for 
variations in the length and timing of the input 
utterances, but which does not change the meaning 
of the inputs—performing a so-called time warping 
procedure of the input parameters. A straightfor- 
ward approach would attempt to derive the 
reference-data and the speech-signal parameters 
from frames of the same period (20 ms in the 320’s 
LPC-10), and make a comparison for every frame. 

However, for speech recognition, comparing at 
every second input frame is adequate and leads to 
two important advantages: the amount of reference 
data that must be stored and the number of dynamic 
programming computations that must be performed 
are cut in half, and the dynamic-programming com- 
putation is simplified by eliminating a memory 
element that would be required when comparing 
every frame (Fig. 3). 

On the other hand, although the time-warping 
approach can accept speech inputs with a varying 
time factor of 2 to 1, the system prefers that 
utterances be spoken at a more or less constant 
average rate for greatest accuracy. In addition, the 
time-warping routine need not start and finish on 
specific speech-input frames. Although the process- 
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2. The algorithm flow diagram represents the sequence of operations required to implement 
the inverse-filter comparison operations—a very complex procedure. 
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Key features of speech recognizers 


Advances in the capabilities and performance of 
speech-recognition equipment center on three ma- 
jor features: 

= Connected-speech vs single-word recognition. 

s Speaker-independence. 

= Vocabulary size. 

Most current speech recognizers depend on a 
small period of silence between words—typically 
200 ms—to determine word end points. Such speech 
recognizers are discrete, single-word types. 
Connected-speech systems, on the other hand, can 
recognize words without explicit knowledge of the 
end-points. Such systems are more complex and 
expensive, but they do perform far better than 
discrete word systems and can accommodate much 
wider timing variations of the words. 

Often, a relatively simple speaker-dependent 
recognition system can offer reasonable perform- 
ance for a limited range of speakers other than the 
one for which the system is tuned. But to handle 
a wider range of speakers, the speech recognizer 
would have to contain enough reference patterns 
for each vocabulary word to be representative for 
the expected speaker population. 

Accordingly, speaker-independent recognition 
systems should be characterized by population- 
performance statistics—able to provide less than 
a certain percentage of the population. The better 
the system, the smaller the error and the larger 
the population. Despite the theoretical superiority 
that speaker-independent systems should offer, of 
the currently available systems, the speaker-de- 
pendent types perform better than the speaker- 
independent systems. 

For now, vocabulary size is limited mainly by a 
system’s ability to keep up with the rate of incom- 
ing speech data. The system’s computer processing 
time is spent mostly in making speech-input to 
reference-data comparisons. The time this process 
consumes is linearly proportional to the size of the 
vocabulary. In addition, the error rate usually 
increases with the size of the vocabulary. 

For that matter, the similarity of various words 
to one another in the composition of the vocabulary 
generates errors. Thus, the larger the vocabulary, 
the greater the potential for this type of error. Even 
a two-word vocabulary might be trouble, if the 
words happen to be similar sounding like “seen” 
and “seem.” In general, better performance is 
obtained with vocabularies consisting of long 
multisyllabie words. 


22. Firmware-Programmable uC Aids Speech Recognition 


ing time then increases substantially, the un- 
constrained end points offer two major benefits: 
system reliability improves greatly, and words em- 
bedded in adjacent or overlapping utterances can still 
be recognized. 

For 98%-accurate recognition or better with fixed 
end points, the reliability of the end-point determina- 
tion must be at least 98% accurate. This expectation 
is unrealistic, since end-point information is not 
normally specific to words as normally spoken. 
However, with an unconstrained end-point dynamic- 
programming algorithm, the system can recognize 
vocabulary words even when not discretely 
separated in time. 

But because of wide variations among different 
speakers, today’s speech-recognition devices must be 
tuned, or enrolled, to accept speech from only one 
(or a few) users. (Recognizing only a specific in- 
dividual may even be a desired attribute.) 


Matching speaker to machine 


Enrollment creates a set of feature vectors for each 
vocabulary word in the system’s repertoire. The 
vectors, which are included in the similarity 
measurement as part of the recognition process, 
define the spectral shape of the reference pattern for 
individual vocabulary words. 

Before enrollment, the initial end points of the 
reference patterns are established by energy 
measures. Actual start and stop times include some 
leeway—one frame, typically—to allow more re- 
liable end-point detection. This data becomes part 
of the stored speech autocorrelation coefficients for 
each frame that determine the inverse filters. 

After the initial template end points are formed, 
samples from the speaker update, the original 
templates. For each training input, the utterance is 
compared with the stored reference data for that 
word. If the utterance is recognized—if the detected 
error is under a certain threshold—the template will 
be updated by averaging the appropriate input- 
speech autocorrelation coefficients with those pre- 
viously stored as the reference. If the utterance is 
not recognized, then it must be reprompted, as many 
times aS necessary. 

Training applies both to the individual- and 
multiple-word utterances of the overall vocabulary. 
Successive repetition of the training procedure leads 
to significant performance improvement, and the 
amount of updating, or substitution, decreases sub- 
stantially with each additional training session. 
After five training inputs, for example, substitutions 
will decrease to about one-third that of the first 
training pass. 

To implement all those speech-recognition and 
training algorithms, more memory than the 320 
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4. To make even a minimal working speech-recognition 
system, the TMS320 needs the support of more memory and 
an analog |/O circuit. Mounted on a standard circuit board 

with the 320, such support must be coordinated by an external 
host computer and controller for speech recognition. 
However, the board, or peripheral speech processor, is not 
limited to speech recognition. A generalized configuration, 

it can be used for many other signal-processing applications. 


of the host computer via an external controller. Four 
registers support controller-to-PSP communica- 
tions, as follows, 

« An 8-bit command register, when loaded from the 
external speech-recognition-system controller, gen- 
erates an interrupt to the 320 microcomputer. 

« A 16-bit input-data register passes data from the 
controller to the PSP. 

« An 8-bit status register, an exact analogy of the 
command register, forces an interrupt of the external 
controller when the 320 commands it. 

« A 16-bit output-data register passes data from the 
PSP back to the controller. 

The interface between the controller and the PSP 
generally supports just the relatively low-bandwidth 
communications, as required by the command and 
control signals. However, the interface can support 
data transfers at rates from 10° to 1.5 X 10® words 
per second, depending on the complexity of the 
interrupt software.O 
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INTRODUCTION 


Graphics systems are now commonplace with the 
proliferation of personal and professional computers, 
particularly in the engineering and business workplace. 
Workstations and other graphics display stations, which in 
the past were too expensive to be widely available, are now 
cost-effective in a larger variety of.applications. As the 
technology has advanced into more areas, the increasing 
requirement for graphics resolutions has demanded 
proportionally greater computational and data throughput 
capabilities. Since graphics is at least two-dimensional, a 
doubling of the display resolution in two dimensions (X and 
Y) requires a quadrupling of memory, throughput, and 
computational resources. 

A graphics system, in general, consists of a display 
monitor, a display controller, a list memory, and an image 
or main control processor that modifies the list memory. Each 
basic element in the system represents an item of choice, 
which impacts the overall performance of the system. 

Increasing graphics resolution requires advances in 
display and display memory control technologies and in 
processors that can provide the increasing throughput and 
computational demands of image generation and 
manipulation. The TMS32020 Digital Signal Processor 
(DSP)!-3 is capable of meeting these demands. Its ability to 
perform 16 x 16-bit multiplies yielding a 32-bit result in a 
single instruction cycle is significant for graphics, which 
includes matrix operations for translation, rotation, and 
scaling. 

In this application report, an example of a graphics 
display subsystem based on a raster-scan display is described. 
The system consists of the TMS32020 functioning as the 
image control processor, dynamic video memories, and a 
display/memory controller. A proposed system configuration 
is presented and discussed from both hardware and software 
viewpoints, including some of the necessary tradeoffs and 
some available options. This report also includes a working 
schematic and the software used to test the design. 


OVERVIEW 


System designers for graphics display systems have 
been faced with classic tradeoffs of cost versus quality. For 
higher-end systems where crisp images of many line 
segments are necessary, the decision has frequently led to 
vector-scan display systems. This has been especially true 
in such areas as computer-aided design (CAD) systems, 
which must support high-quality, wire-frame-based images 
and dynamic operations. At the other end of the spectrum, 
raster-scan graphics systems have been able to take advantage 
of the advances and popularity of television-type display 
systems. This has reduced the cost to affordable levels, but 
has had a pronounced effect on the display quality. 


A raster graphics system is basically a pipeline, such 
that a bottleneck anywhere in the pipe drastically reduces 
performance and inhibits other hardware in the system from 
operating at peak efficiency. Upgrades with these systems 
have gone from text display drivers to bit-mapped graphics 
systems with limited input/setup access. The present state 
of bit-mapped graphics is limited by memory access times 
and processing capacity. 

In the past, computer systems utilizing a raster graphics 
display might have relied on the host processor in the system 
to update the bit-mapped pixel display. The bit-mapped 
display represents the screen image by a series of memory 
bits read and displayed on the monitor by the raster-scan 
display system. Since memory access was limited, two 
display processors were often used: one to drive the display 
from a bit-mapped memory and the other to drive the display 
from a text-code memory. By allowing the text to be 
displayed by a separate processor, the host could avoid the 
bottleneck of memory access by simply supplying the ASCII 
code of the characters to be displayed rather than the actual 
bit pattern of the character itself. This improved the total 
throughput on a bit-rate basis, but limited the flexibility in 
the display image. Additionally, the display was limited in 
the extent of bit-mapped graphic interaction due to the long 
setup access time required. 

The new generation of video controllers,4-6 together 
with the multiported video memories,’-9 permits a unified 
text and graphics display system with unlimited access to the 
display memories. Figure 1 shows a system-level block 
diagram of a unified text and graphics display system. The 
memory access changes the location of the bottleneck in the 
system away from the video display system itself and back 
to the host processor or central processing unit (CPU) 
supplying the information to be displayed. Since the memory 
access limitation has been eliminated, the system is now 
limited more by the CPU. Dynamic modification of high- 
quality, wire-frame-based displays and other applications are 
now feasible with a raster-scan display system. This dynamic 
modification involves updating the pixels in the bit-mapped 
display memory using seed fills and multiple line draws. The 
computational load required for object rotation, scaling, 
translation, and other numeric-intensive calculations to 
determine the orientation, size, and position of the object 
is also involved. Even with coprocessors, some graphics 
applications seem slow to the user since the coprocessor 
cannot compute the numeric-intensive algorithms fast enough 
to simulate a realistic response. Therefore, a need exists for 
faster processors, such as the TMS32020, if realistic 
responses are desired. 

The TMS32020 Digital Signal Processor (DSP) is 
capable of multiplying a 16-bit number by another 16-bit 
number and producing the 32-bit result in a single instruction 
cycle of 200 ns. This feature makes the TMS32020 proficient 
at graphics operations, such as translation, scaling, and 
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Figure 1. Unified Text and Graphics Display System 


rotation. Furthermore, a TMS32020-based graphics system 
has a high I/O throughput and is easy to interface to any host 
processor. Additional features, such as a large address reach 
(common with more generic microprocessors), separate 
program and data memory spaces (64K words each), 
multiprocessor interface capabilities, an architecture designed 
with auxiliary registers to aid with indirect addressing, and 
a powerful, easy-to-program instruction set, make the 
TMS32020 an attractive master processor for a graphics 
display subsystem. Figure 2 shows a block diagram of a 
graphics display subsystem, which utilizes a TMS32020 as 
the graphics “‘engine’’ 

This application report provides an example of a 
graphics display subsystem using the TMS32020 (DSP), the 
TMS34061 Video System Controller (VSC), and TMS4161 
dynamic video memories (VRAMsS). The proposed system 
serves as the basis for discussions on both hardware and 
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software and insights into several tradeoffs that a designer 
must consider. By building the hardware and programming 
some basic graphics software functions, as shown in the 
appendices, the design has been taken from a mere paper 
design through the first several steps toward a practical 
application. Design flexibility of the TMS32020 is thereby 
demonstrated as well as the efficiency of the TMS32020 in 
some of the numeric-intensive tasks required for graphics 
applications. 

The benefits of the proposed system are inherent in the 
design itself. Basic features of the TMS32020, TMS34061, 
and TMS416ls simplify the design since the devices are 
easily interfaced with one another and with other parts of 
a system. After the hardware is designed, it remains flexible 
because a large portion of the design is user-definable due 
to the programmability of the TMS32020 and the TMS34061. 
Application reports,!9-12 including discussions on 
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Figure 2. TMS32020-Based Graphics Display Subsystem 


606 


23. A Graphics Implementation Using the TMS32020 and TMS34061 


programming the matrix operations!9 required for graphics 
applications for the TMS32020, are useful references. 


SYSTEM CONFIGURATION 


A graphics system implemented using the TMS32020 
consists of a host processor, the TMS32020 as the graphics 
processor, the TMS34061 Video System Controller (VSC), 
the multiported video RAMs (VRAMs), and various random 
logic. The host processor interface may be designed in 
various ways to accommodate the host itself and the 
integration of various processing units in the system. A wide 
variety of host processors used in graphics systems may 
actually be used. Since the potential is largely user 
preference, it is not discussed further in this application 
report. More information and an example of a TMS32020 
host interface can be found in the application report, 
‘*TMS32020 and MC68000 Interface.’’!1 

The graphics system discussed here uses the TMS34061 
VSC and TMS4161 VRAMs to drive a display monitor with 
720 x 300 display-bit resolution. In this system, the VSC 
supplies all of the necessary RAS and CAS signals to the 
VRAMs and provides the“necessary synchronization and 
blanking signals to the display monitor. The VSC expands 
the addressing range of the TMS32020 to the VRAMs’ 20-bit 
address, thus allowing the TMS32020 to access the large 
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address space required for graphics applications. The 
TMS4161 VRAMs are 150-ns dynamic RAMs with dual- 
port access, designed specifically for graphics applications. 

From the standpoint of the TMS32020, the system has 
been designed around a VSC Evaluation Module (EVM) 
board. This board is characterized by an eight-bit data bus 
and a memory array that can support display resolutions up 
to 1024 x 512 pixels with 4 bits per pixel. 

With this design, the TMS32020 can provide an 
efficient implementation of the numeric- and pixel-processing 
capabilities required. The VSC supplies the necessary 
memory and display control for the system. Figure 3 shows 
a block diagram of a graphics system configuration. The 
detailed schematic of the circuit actually built is found in 
Appendix A. 


TMS32020 Digital Signal Processor 

The TMS32020 Digital Signal Processor is a high- 
speed processor well designed for computation-intensive 
applications such as graphics. Its 200-ns instruction cycle 
time, modified Harvard architecture, hardware multiplier, 
multiprocessor control features, and powerful instruction set 
make it particularly efficient in many applications. Some of 
the additional features of the TMS32020 (see Figure 4) are 
a 32-bit accumulator, parallel shifters, on-chip data RAM, 
and an on-chip timer. 
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Figure 3. Graphics System Block Diagram 
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Figure 4. Block Diagram of the TMS32020 Digital Signal Processor 
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TMS34061 Video System Controller to access the memory either directly or indirectly through 
The TMS34061 Video System Controller, shown in X-Y address registers. When the indirect access method is 
Figure 5, is a high-performance device that controls the video used, the VSC further assists the host by allowing on-the- 


display and the dynamic memories of a bit-mapped graphics _ _ fly address adjustments (increment, decrement, clear, and 
system. The VSC relieves the host of memory control, no change). By using the VSC in conjunction with the 
memory refresh, and VRAM shift-register reload functions. multiport’ VRAMs, memory access time by the host is 
It is easily programmed via 18 registers to accommodate a virtually unlimited (better than 94 percent). 


wide variety of display sizes. The VSC also allows the host 
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Figure 5. Block Diagram of the TMS34061 Video System Controller 
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TMS4161 Dynamic Video RAM 

The TMS4161 multiport video RAM is a 64K x 1 
memory array with a 256-bit shift register. The shift register 
is divided into four cascaded 64-bit shift register segments, 
allowing data access in a serial mode to be tapped at four. 
different lengths. A functional block diagram of a TMS4161 
is shown in Figure 6. Data may be shifted into or out of the 
internal shift register while normal random accesses are 
taking place with the memory array. In a graphics system, 
the data is moved by block transfer from the memory array 
to the shift register and shifted out at the dot/chip rate. 


Program and Data Memory Requirements 

The TMS32020 provides independent 64K-word 
address spaces for both program and data memory. The 
program space may be divided among all memory types as 
the system designer deems necessary. This enables allocation 
or utilization of memory resources both from a cost and a 
size perspective since the memory may be RAMs (both static 
and dynamic), ROMs, PROMs, EPROMs, or EEROMs of 
any and/or differing speeds. Slower memories are typically 
less expensive and can prove to be a significant cost 
advantage for portions of the system infrequently accessed. 
Where time is critical, faster memories are required to 
enhance the system performance. In the design described in 
this report, the program memory consists of 4K words of 
bipolar PROM (TBP28S166). 

This program memory contains the initialization code, 
basic display control or function algorithms, a command 
parsing routine, and a table of commands and required data. 
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By designing the software to be structured around a table- 
driven approach, the system can easily be modified to allow 
the host to supply and update the command and data table 
contents. Through the use of shared memory or other 
multiprocessing communication designs, a host processor can 
be conveniently interfaced to the TMS32020. As an example, 
the system could consist of initialization and interface 
procedures and some basic functional routines programmed 
into PROM and additional program memory space 
implemented with RAM. The RAM area allows the design 
to expand in capability by having the host download routines 
to the TMS32020. The routines can then be executed to 
enhance the system by building on the basic functions. 
The circuit presented utilizes only the data memory 

internal to the TMS32020 (544 words). For systems designed 
to maintain large blocks of data, such as the display 
information for character formation in text displays or tables 
of display coordinates of a large number of dynamically 
changing objects, it is necessary to also include RAM and/or 
ROM in the external data memory space. This memory, 
along with the internal data memory of the TMS32020, the 
video system controller, and video memories, must be 
mapped into the 64K-word data memory space. 

Considerations for memory selection and interface 
requirements have been discussed in significant detail in an 
application report, ‘‘Hardware Interfacing to the 
TMS32020.’’12 This report provides a comprehensive 
discussion on interfacing the TMS32020 to memory devices 
(PROMs, EPROMs, static RAMs, and dynamic RAMs) and 
peripherals. It also supplies circuit and timing diagrams for 
each of the interfaces. 


TIMING AND CONTROL 


OUMMY CELLS 


DATA 
(1/2) MEMORY ARRAY IN 
REG 


(1/2) 4 OF 256 COLUMN DECODE 


256 SENSE . REFRESH 


AMPLIFIERS 


] 

POG TRANSFER CONTROL 

ae REGISTER | REGISTER | REGISTER 7, 
00 01 10 WW 


eon seus COLUMN 19, 
O 
10F 4 
SELECTOR re sour 


Figure 6. Block Diagram of the TMS4161 Multiport Video RAM 
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TMS32020 and TMS34061 Interface 

Access to the VSC (and the VRAMs) is gained through 
a data memory-mapped design. The VSC contains a number 
of internal registers and mode-function selections in addition 
to video memory access to the video memories, thus 
providing a homogeneous access methodology for the 
TMS32020 software design. The VSC is mapped into the 
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Figure 7. TMS32020 Data Memory Map 


23. A Graphics Implementation Using the TMS32020 and TMS34061 


lower 32K words of the data memory space, as shown in 
Figure 7. By paging the VSC at pages 8 through 255, the 
software differentiates between it and the internal memory 
of the TMS32020. In addition to the address bus lines and 
the bidirectional data bus, the TMS32020 supplies the VSC 
with enable and read/write signals while the VSC in turn 
supplies a ready or hold signal to the TMS32020. 

Figure 8 shows the signal and bus interface between 
the TMS32020 and the VSC. The design maps the VSC into 
the lower 32K-word address range of the data memory space 
by combining the DS signal and A15 address line to gate the — 
STRB signal that allows access to the VSC. This single enable 
signal is the input to three individual enable inputs (ALE, 
CEH, CEL), providing implementation of all modes of VSC 
access. Since the VRAMs have a 150-ns access time, MSC 
(from the TMS32020) is used to provide an automatic one- 
wait state and allow time for the VSC to activate its 
RDY/HLD line to generate additional wait states to the 
TMS32020. 

On the data bus, the lower eight bits are isolated from 
the VSC by using an SN74LS245 transceiver to avoid bus 
conflicts with the VRAMs. The three-state outputs of this 
buffer are enabled whenever STRB is gated to the VSC. 
The R/W signal from the TMS32020 is used to determine 
the direction that the transceiver drives the bus. In this 
configuration, the upper eight bits of the TMS32020 data 
bus are not used whenever access is made to the VSC or 
VRAMs. However, the eight MSBs of the data bus are 
supplied the low output of an SN74LS244 buffer in such 
cases. This buffer is normally in a high-impedance state and 
supplies the low-signal output only when the VSC or VRAMs 
are being read. The benefit here is that the software of the 
TMS32020 does not have to mask the upper eight bits 
whenever the VSC or VRAMs are read. Note that the circuit 
being presented in this report does not support host-direct 
access to the VRAMs. 

The algorithms that are coded for this design use the 
X-Y indirect addressing capabilities of the VSC to access 
each bit location in the VRAMs. Actually, for each address 
enable to the VRAMs, two pixels (four bits each) are read 
or written. This approach makes full use of the eight-bit data 
bus already utilized for VSC register access. However, 
whenever a single pixel is to be modified, a pixel pair must 
be read, the selected pixel modified, and then the pixel pair 
written back to memory. This produces a significant overhead 
for line-drawing algorithms since a read-modify-write must 
be performed each time a pixel is changed, rather than a 
simple write. A possible alternative is to access the video 
memories on the basis of a single pixel bus. In the case of 
a four-bit bus, the throughput of a line-drawing algorithm 
may be nearly doubled since the pixel access is reduced to 
a simple write. On the other hand, for cases where the 
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Figure 8. TMS32020 and TMS34061 Interface 


algorithias test the pixel value, such as in seed fills, it is 
advantageous to have the data bus of maximum width (16 
bits) and actually read four pixels at a time. This minimizes 
the number of read accesses per potential test condition, 
thereby maximizing throughput. It is also beneficial to use 
a 16-bit data bus for block transfers of data into (or out of) 
the VRAMs, as in the case when bit-mapped text is to be 
displayed. For a 16-bit bus, its appropriate use can nearly 
double the throughput since the number of decisions per pixel 
group access is maximized. 

The consideration involved in choosing among all these 
cases is the requirement of additional logic to configure the 
data bus to various widths. The 16-bit data bus, as described 
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for the bit-mapped text, also benefits from the TMS32020 
being able to perform host-direct accesses to the VRAMSs. 
If a programmable-width (4-, 8-, or 16-bit) bus is necessary 
for speed considerations in the design, then additional logic 
is required to configure the bus to each of the desired widths. 

The VSC needs a 10-MHz system clock, which may 
be created from the 20-MHz input clock to the TMS32020 
in one of two ways. In the current diagram shown in Figure 
8, the 10-MHz clotk is produced by an exclusive-OR of the 
two 5-MHz clock output signals (CLKOUTI and 
CLKOUT2) of the TMS32020. An alternate approach 
consisis of using a flip-flop to convert the original 20-MHz 
signal down to the desired 10-MHz clock. 
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TMS34061 and TMS4161 Interface 

The interface between the VRAMs and the VSC is 
straightforward. Eight address output lines (MA7-MAQ0) on 
the VSC are tied to the address lines (A7-A0) of each of the 
VRAMs. RASO-RAS3 and CASHI are control )1 signal outputs 
of the VSC used to provide the necessary RAS and CAS 
signal inputs to each of the VRAMs, thus properly enabling 
access to the pixels being addressed. CASHI is tied 
commonly to all the CAS inputs of the VRAMs, while 
RASO-RAS3 split the VRAMs into four blocks. The other 
control lines, TR/QE and W, are tied commonly to all 
VRAMs, providing signals that enable shift register transfer 
cycles and write cycles, respectively. Since the VRAMs are 
each a 64K x 1-bit memory array, an address access of any 
single pixel (represented by four bits) actually causes four 
of the VRAMs to be enabled simultaneously. 

The shift-register output of the VRAMs is used to 
provide the RGBI (red, green, blue, and intensity) signals 
to the display monitor. The overall display memory array 
in this design is a 1024 x 512-pixel array. Note that the active 
display area of the total array is a 720 x 300-pixel array, 
as shown in Figure 9. 

Output from each of the VRAMs is shifted in parallel 


to an eight-bit external shift register for each of the RGBI. 


signals (see Figure 10). More detailed information of the 
VSC and VRAM interface may be found by consulting the 
TMS34061 Evaluation Module User’s Guide. 13 


Color Palette Option 

An option to be considered for a graphics display 
system is the addition of a color palette. The TMS34070 color 
palette!4 is designed to generate up to 4096 colors on a single 
display. It provides a 16-register lookup, expanding the color 
selection to 4 bits each of RGB in three 4-bit Digital-to- 


Analog Converters (DAC). The analog output of each DAC 
is capable of directly driving standard 75-ohm monitor 
cables. The 16 14-bit registers of the color lookup table are 
loaded directly from the video memory, eliminating the need 
for a separate processor interface through which the registers 
can be loaded. A mode selection allows the color registers 
to be loaded (1) prior to the start of each individual scan line, 
(2) prior to the start of each frame, or (3) only on explicit 
command. 

Figure 11 shows the internal architecture of a 
TMS34070 video palette, and Figure 12 illustrates a typical 
video memory system that includes the palette. 


SOFTWARE DESIGN 


In addition to the selection of the hardware to be used, 
consideration must be given to the design of the software 
system. When using a raster-scan display system, it is 
necessary to determine which functions must be included, 
which would be useful and efficient if included, and which 
are infrequently used and able to be created from other basic 
functions by the user. Some functions, such as clearing the 
screen and drawing lines, are quite obviously necessary. 

The TMS32020, together with the VSC, can provide 
efficient implementations of incremental algorithms utilized 
in pixel processing. This is possible because of the single- 
cycle instruction execution of most TMS32020 instructions 
and the X-Y indirect addressing capabilities of the VSC. In 
addition, matrix multiplication and other object manipulations 
are efficiently implemented on the TMS32020 by using the 
single-cycle hardware multiplier and block processing 
capabilities. Many other features may need to be evaluated 
in designing an interactive graphics system (i.e., x-y 
locators/cursors, output primitives, coordinate 
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transformations, windows and clipping, zooming, panning, incorporating a VSC and VRAMs mapped into the data 
and library functions). 15,16 memory space must be designed with certain considerations. 


Software to run on a TMS32020 for a graphics system The TMS32020 is designed to access data memory in a direct 
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Figure 12. Video Memory System with a Video Palette Enhancement 


memory accessing mode that limits accesses to pages of 
memory (128 words in length) or in an indirect memory 
accessing mode through the use of the auxiliary registers. 
Therefore, accessing a VSC or VRAMs along with the 
internal data memory space of the TMS32020 requires the 
simultaneous use of both addressing techniques. (If only 
direct addressing is used, frequent use of the LDPK 
instruction is required, resulting in a less efficient coding 
of the algorithms.) Because of the demand to use both 
addressing techniques, it is necessary to make the clearest, 
most efficient use of the auxiliary registers, The VSC and 
VRAMs are best accessed through a layered software 
approach with algorithms of basic functionality being called 
and utilized by routines with a higher level of functionality. 

In the following sections, a description of the software 
design is given. Various considerations with regard to the 
necessity and level of each functional routine are discussed. 
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Accessing the VSC 

Access to the VSC (and to the VRAMs) is provided 
through the data memory address space of the TMS32020. 
The VSC (and consequently the VRAMs) is enabled 
whenever an external memory access is made with A15 in 
a low state. The VSC is function-selected by the TMS32020 
using a three-bit function-select code input on FS2-FSO. 
Table 1 gives the function-select codes and their descriptions. 
These inputs are driven by address lines A13-A11 of the 
TMS32020. Whenever All, Al2, and A13 are all zero, the 
A10 address line should be set to a one to assure that an 
external address is generated. 

From Table 1, it can be determined that the registers 
may be accessed at TMS32020 addresses starting at > 0400 
or at > 1000. X-Y indirect access occupies the address range 
starting at >0800. The host-direct access, although not fully 
implemented in this circuit, begins with addresses at > 1800. 
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Table 1. Function-Select Decoding 


TMS32020 
STARTING 
DATA MEMORY 
ADDRESS 


Shift-register accesses are located at either >2000 or >2800 
depending on whether data is being moved from the shift 
register to memory or from memory to the shift register. 

Descriptions of the VSC internal registers 
corresponding to specific TMS32020 data memory addresses 


Register access 
X-Y indirect access 
Register access 


Host-direct access 
Shift-register access (SR to memory) 


Shift-register access (memory to SR) 


Reserved (put VSC in test mode) 


SELECTED FUNCTION CYCLE 


are shown in Table 2. Note that in the accesses to the VSC 
internal registers, the useful addresses are actually eight 
address units apart; i.e., eight consecutive addresses actually 
access the same internal register. 


Table 2. VSC Register Address Map 


TMS34061 REGISTER DESCRIPTION 


TMS32020 DATA 
MEMORY ADDRESS 
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0488 Display Address Update - High Byte 
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0480 Display Address Update - Low Byte 


TMS32020 DATA TMS34061 REGISTER DESCRIPTION 


MEMORY ADDRESS 


0490 
0498 


Display Start - Low Byte 


Display Start - High Byte 


23. A Graphics Implementation Using the TMS32020 and TMS34061 


Information is provided about the X-Y indirect address 
access in Table 3. Note that in the accesses to the X-Y indirect 
mode, the useful addresses are actually eight address units 
anart; 1.e., eight consecutive addresses actually perform the 
same X-Y indirect function. For more general information 
about accessing the VSC, refer to the TMS34061 User’s 
Guide. 


Table 3. Address Map of X-Y Adjustment Codes 


MEMORY ADDRESS X Y 


0878 Clear X Clear Y 


Initializing the VSC Registers 

Whenever a reset signal is applied to the VSC, the 
internal registers of the VSC assume specific values. These 
registers must be initialized to values matching the actual 
hardware configuration before any memory accesses to the 
VRAMs take place. The code to initialize these values is 
contained in the appendix in the section beginning with the 
label VSCIN. 

The first four registers define the timing for each 
horizontal line. Note that the number of time units between 
the ‘‘End Blank’’ signal at the beginning of the line and the 
‘*Start Blank’’ signal at the end of the line is 90. The reason 
for this is that each shift clock signal applied to the VRAMs 
accesses data from eight VRAMs, shifting data in parallel 
to an external shift register. This provides 720 pixels per 
line in total (90 time units x 8 pixel data bits/time unit = 
720 pixels). The next four registers define the vertical timing 
control, with the difference in the two blanking times 
resulting in 300 lines on the display. The next two registers 
set up the starting address of the display in the VRAMs and 


define the address increment to be applied whenever the 
internal shift registers must be reloaded. The remaining 
registers are set to provide feedback to the host, basic 
function control, and the X-Y description for host-indirect 
access to the VRAMs. 

The X-Y Offset Register is set to split the X-Y Address 
Register into nine bits of Y and seven bits of X, and to supply 
an additional two bits of X-Y address extension as the two 
LSBs of X. This splits the eighteen bits of X-Y address 
evenly, allowing X and Y addressing from 0 to 511. The 
X addressing is further expanded by accessing two pixels 
at a time (eight-bit data bus), resulting in effective X 
addressing from 0 to 1023. 


Clearing the Display Screen 

Certain operations are absolutely vital in the design 
of a graphics system. One of these vital operations or 
functions is the ability to clear the entire display to a single 
color. Several features of the VSC and the TMS32020 are 
used to optimize the implementation of this function. The 
flow diagram shown in Figure 13 describes the process 
required to clear the screen to any given color. 

To optimize the task of clearing the screen, the VSC 
provides an active strobe on RASO, RASI ? RAS2, and RAS3 
simultaneously. The end result is the expansion of the data 
bus to effectively four times its width. In this system, the 
same pixel data can be written to eight pixels (instead of two) 
for each write operation from the TMS32020. 

In an efficient implementation of the screen-clearing 
algorithm, all the pixels are written in a row that is not being 
displayed. Because of the use of the internal shift register 
in the VRAMs, writing a row of VRAM data that is 
consistent with the shift-register size requires 256 writes. 
Since eight pixels are being written with each write, it 
requires only 256 writes for a full row of data to be written 
to all the memories. The TMS32020 RPTK instruction aids 
in the program execution of this operation by providing a 
single instruction execution loop, as shown below. 


RPTK 255 
SACL * 4 


After a row of off-display memory is filled, the 
TMS32020 must wait for the vertical blanking period to 
occur. When the vertical blanking period begins, a second 
feature of the VSC and VRAMs is utilized. The VSC is 
modified to inhibit display-update cycles, and the contents 
of the off-screen memory that were filled in previously are 
transferred to the shift registers of the VRAMs. Using the 
RPT instruction, the shift-register contents are moved to 
every row of the display memory. Because of the 
organization of the VRAMs in representing the display, the 
contents of the shift register are equivalent to two rows of 
data on the display. The number of writes required to fill 
display memory are equal to one-half the actual number of 
rows in the display. Utilization of the auxiliary registers in 
the TMS32020 and the capability to modify the contents of 
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Figure 13. Clear Screen Flow Diagram 


an auxiliary register by the contents of the base auxiliary 
register are critical to the efficient implementation of this 
operation. 

The routine labeled CLEAR and contained in the code 
listed in the appendix provides an example of this 
implementation. 


Bresenham’s Line-Drawing Algorithm 

A figure or object is constructed of lines that must be 
drawn on a pixel-by-pixel basis in any bit-mapped, raster- 
scan graphics system. The lines used must produce an equally 
satisfying image with consideration given for actual display 
resolution when the object is drawn in any orientation on 
the screen; i.e., when a line of a given length is drawn 
between two points, the line must be as smooth as possible 
and without holes or gaps. It is also important that the lines 
be drawn in an efficient manner with the minimum number 
of calculations possible since the basic line-drawing algorithm 
is used frequently in creating and modifying images. An 
attractive and popular choice for vector generation in a raster- 
scan display system is Bresenham’s algorithm. The algorithm 
requires only integer arithmetic, thus the reason for its 
popularity. A discussion of the theory of Bresenham’s 
algorithm is found in reference [17]. The specific TMS32020 
implementation is described in the following paragraphs and 
in the flow diagram shown in Figure 14. The code for this 
implementation is provided in the appendix beginning with 
the label LINE. 

Figure 15 shows the approach to a line drawing in a 
two-dimensional Cartesian-coordinate system. Figure 15(a) 
shows the coordinate system divided into octants. Given two 
points A and B (see Figure 15(b)), it is convenient to translate 
the points so that one of them (e.g., A) lies at the coordinate 
system’s origin (see Figure 15(c)). It can then be seen that 
the line from A to B must lie in one of the octants. (Here 
it is assumed that the lines shown to divide the octants actually 
belong to one or the other of the two octants.) Using 
Bresenham’s algorithm, a line may be drawn from A to B 
in any of the octants. Unfortunately, the coding for each of 
the octants is unique, and a significant amount of program 
space is required. A technique used to reduce the program 
memory requirement by half is to perform the translation 
of the points in such a way that the point not at the origin 
always has a positive y-coordinate value (see Figure 15(d)). 
In this way, it is necessary only to provide code for four of 
the eight octants. 

Each of the four octants begins at the point that has 
been translated to the origin and utilizes the X-Y indirect- 
addressing function of the VSC to access memory and modify 
the address register. The modification of the address for each 
octant is one of the four combinations of incrementing and 
decrementing X and Y. On a pixel-by-pixel basis, an 
increment (or decrement) in the x coordinate does not always 
result in an increment (or decrement) in the X address. Table 
4 shows the selection of X-Y adjustment codes loaded into 
the auxiliary registers to facilitate access to the video 
memories. 
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Figure 14. Flow Diagram for a Line-Drawing Algorithm 
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Figure 15. Line Drawing in a Cartesian-Coordinate System 
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The algorithm is designed using the following approach 
for all octants. The distance function that determines the 
location of the next point is tested before writing to the 
present point. Based on whether the distance function is 
positive or negative, a branch is taken. Then, the 
corresponding increment constant is added to the difference 
function to modify it in preparation for the decision at the 
next point. The pixel pair with the desired pixel is read from 
memory and masked using the access through auxiliary 
register 1 (AR1). After the new color value has been added, 
the pixel pair is written back to video memory using an 
auxiliary register (AR2, AR3, or AR4), which provides the 
appropriate address adjustment for the next pixel 
modification. Auxiliary register 0 (ARO) contains the number 
of pixels that must be modified to connect the two designated 
points. ARO is decremented, with appropriate branch control 
to the next difference function test, until all pixels between 
A and B are modified. 

Extensive use is made of the auxiliary registers to allow 
interleaved access of the video memories and internal 
memory of the TMS32020. This implementation minimizes 
execution time, a primary concern, as well as reduces the 
program memory requirement. In addition, tests for 
horizontal and vertical lines are also made to improve the 
rate at which those lines are drawn. Table 5 shows the 
performance per pixel for line drawing in the present 
implementation. 


Table 5. TMS32020 Performance per Pixel for Line 
Drawing 


FUNCTION CYCLES TIME AT 
200 NS/CYCLE 
Horizontal line a 


Vertical line 
Diagonal line 

Further optimization can be accomplished by designing 
the hardware to establish a single pixel-width data bus. By 
implementing this modification, it is no longer necessary to 
perform read-modify-writes with the video memory. The 
reduction to write-only for line drawing nearly doubles the 
throughput of the TMS32020 for this function. A second 
benefit of a single pixel-width bus implementation is the 
compression of the algorithm to the equivalent of the code 
required for a single octant, thus further reducing the amount 
of program memory required. 


Rotating a Wire-Frame Cube 

The usefulness of a line-drawing function can easily 
be seen in the construction of such simple objects as a wire- 
frame cube. The importance of the efficiency of the function 
is observed when the task is expanded to include the rotation 
of that object. The discussion here encompasses the definition 
of a three-dimensional object and the rotation of that object 
in its coordinate space. It further includes the viewpoint 
perspective of that object and its perspective projection onto 
the two-dimensional display space. 


An object can be defined by its vertices and a list of 
the connections of those vertices. In the case of a cube, the 
definition is relatively simple; eight vertices are connected 
to form 12 edges. To draw the cube on the screen, the 
vertices in the three-dimensional space of the cube must first 
be translated, rotated, and scaled with respect to an observer’s 
viewpoint, and then projected onto the two-dimensional 
display space. The net result of these operations is a second 
list of points describing the cube in a two-dimensional 
projection. After the second list of cube vertices is generated, 
a list that links those points appropriately can be used to 
supply the line-drawing algorithm with pairs of points, thus 
generating the image of a wire-frame cube on the screen. 

In order to effect a rotation of the wire-frame cube on 
the display, it is necessary to rotate the vertices of the cube 
in its originally defined three-dimensional space, apply the 
viewpoint and projection, and redraw the object on the 
screen. Two additional elements must be considered. First, 
in order not to leave multiple images of the object on the 
screen, it is necessary not only to draw the object in its 
modified position but also to erase the previous image of the 
object. Second, the erasure and modified drawing must occur 
within a period of time that has an upper bound of the refresh 
cycle of the display (typically 1/60 second). This assures that 
the viewer always sees a complete object on the display. An 
alternative, which can allow the time to be longer for display 
modification and still permit continual observation of fully 
present objects, is to expand the video memory. By 

expanding the video memory, a complete screen can be 
created in memory, which is not bing actively displayed. 
During the vertical blanking period following completion of 
the image modification, the starting display address can be 
modified to show that portion of memory, while the next 
cycle of modification can begin with the memory previously 
being displayed. In other words, the video memory can be 
divided into two display windows, and displayed and 
modified in a ‘‘ping-pong’’ approach. 

A code implementation of the first approach discussed 
for the rotation of a wire-frame cube is included in the 
appendix beginning with the label CUBER. This routine calls 
or uses five other subroutines: CLEAR and LINE (mentioned 
previously), and ROTZ, MTRX4, and PRJCTN (to be 
discussed later). The flow diagram shown in Figure 16 
describes the process that is used to perform the wire-frame 
cube rotation. 

The basic process of cube rotation begins by loading 
the definition table of the cube and the matrices into the 
designated areas of memory. If it is required that the 
viewpoint matrix and projection matrix be dynamically 
defined, the matrix definition may occur at this point. After 
this overhead is completed, the overall rotation loop is 
entered. 

Inside the main rotation loop (abel CTLOOP), the first 
task is to build the modified rotation matrix. Two alternatives 
exist to continuously rotate an object. One of the alternatives 
is to rotate the present coordinate values for all vertices using 
a fixed-rotation matrix, and update the coordinates with the 
modified (rotated) values. This method allows a degradation 
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Figure 16. Flow Diagram for a Wire-Frame Cube Rotation 


in the object definition since the modified results are 
recursive, requiring truncation or rounding, and eventually 
propagate an observable error. The other alternative and the 
one chosen here is to maintain the original coordinate values 
and modify the rotation matrix. The benefit of this approach 
is that the modification can be integer, beginning with zero 
and increasing to a maximum value at which point a reset 
to zero occurs. This is expected since the rotation is a circular 
function. After the rotation matrix has been determined, the 
matrix manipulation of each of the vertices can begin. 
Following all the matrix calculations, the screen coordinates 
of each vertex are in a table. Using straightline code as a 
list implementation, the screen is cleared, and coordinates 
are supplied to the line-drawing routine in 12 pairs. When 
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all 12 lines are drawn, forming the cube, the angle of rotation 
is updated for the next iteration, and the main loop is 
repeated. 

An inner loop is used to perform the matrix 
manipulation of each of the vertices. Within this loop, the 
coordinates of a point are loaded, the three matrix subroutines 
for rotation, viewpoint perspective, and projection are called 
in succession, and the results are stored in a table of screen 
coordinates. This loop is executed once for each vertex point 
that is used to define the object (eight in the case of a cube). 
Table 6 summarizes the performance of the TMS32020 in 
cube rotation and in matrix manipulation that converts the 
object coordinates to screen coordinates on a per-vertex 
basis. 
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Table 6. TMS32020 Performance per Vertex for Cube Rotation 


Total computation for eight vertices 


Some assumptions were made in implementing each 
of the matrix subroutines. These assumptions relating to 
rotation, viewpoint perspective, and projection are 
sufficiently general for most cases but are worthy of an 
explanation. The overall matrix transformation for each point 
is defined by the following matrix equation: 


Riz Ri2 Ry 0 
R21 R22 R33 «0 
[Xp Yp Zp] = [X YZ 1] © R31; R32 R33 0 
0 0 0 1 
Cyy Ci2 C3 «0 Px, O 0 0 
Co} C22 C33 0 0 PP 0 90 
C31 C32 C33 O| * |[O 0 P O 
C4; C42 C43 1 Tx Ty Tz 1 
(1) 
Rotation 


In rotating each vertex of the object, it is assumed that 
the object is simply rotating about some axis through the 
body. The actual cube definition, used as the example in the 
code presented in the appendix, begins with the cube centered 
at the origin of a Cartesian-coordinate system with the cube 
faces being orthogonally centered on the x, y, and z axes. 
This orientation is shown in Figure 17. By using this 
orientation and rotating the cube about the z axis, the cube 
is observed to spin on one of its faces. If it is desired to have 
the cube spin on one of its corners, then the original 
orientation may be altered to be like the one shown in Figure 
18. The key to the assumption is not the axis about which 
the object is rotating, but rather the fact that the object is 
simply rotating. If the desire is to observe the object in a 
more complicated motion, i.e., by adding precession and 
nutation and/or revolution, then the rotation matrix, which 
describes the object’s motion, necessarily increases in 
complexity. 
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Figure 17. ‘‘Real-World’’ Cube Definition 
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Figure 18. Alternate ‘‘Real-World’’ Cube Definition 
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With the assumptions as described, the rotation matrix, 
which begins as a 4 x 4 matrix using homogeneous 
coordinates, can actually be reduced to a simple 2 x 2 matrix 
for the necessary calculations. This is possible since the 
z-coordinate value of each point does not change with rotation 
and the motion is a simple rotation, involving no additional 
translation in x or y. 


Riz Riz Ri3 0 
R21 R22 R33 0 
[Xr Yr Zr] = [Xo Yo Zo 1) * | Ra, R32 R33 0 


0 oO OO 1 
(2) 


Ri Ri2 
[X; Yr] = [Xo Yo] * R2; R22 (3) 


One additional condition is assumed for this example. 
The cube being rotated is a ‘‘unit’’ cube in its coordinate 
system, with no side longer than one. The orthogonal 
orientation of the cube results in all coordinate values being 
+ 0.5 for all coordinates (x,y,z). This condition is required 
so that the coordinate values are always defined as a Q15 
number (i.e., 15 bits following an assumed binary point), 
thus simplifying the calculational load for the TMS32020 
code implementation. 


Viewpoint Perspective 

Placing the cube in a perspective position is a basic, 
yet involved, operation that must be performed before the 
object can be projected onto the screen. In most cases, if 
a stationary viewpoint is used, a combination matrix can be 
developed prior to its real-time use and stored as a single 
matrix. The combination matrix is actually the product of 
seven individual matrices for the specific case of the cube 
rotation code in the appendix. 

A matrix supplying the viewpoint perspective of an 
object includes translation, rotation, and scaling. Its general 
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form is therefore a complete 4 x 4 matrix, although for 
purposes of implementation, the calculations reduce to a 
4 x 3 matrix since the known zeroes and ones can be 
eliminated. 


Cii Ci2 C13 


C21 C22 C33 
[Xe Ye Ze 1] = [X; Y; Z, 1] © C3, C32 C33 


C4, C42 Caz 


—- OOO 


(4) 


To determine the composition of the viewpoint matrix, 
the following elements must be considered: First, a point of 
observation in the ‘‘real-world’’ coordinate system, outside 
the object, must be selected. The coordinates of this point 
actually become the origin, and the object is subject to a 
translation equivalent to this from the origin. Second, the 
X-Y address coordinates of a point on the display screen are 
like those shown in Figure 19. Since a right-hand coordinate 
system is the normal expectation, the z axis in the screen 
coordinate system points into the screen. The observer is thus 
looking parallel, down the z axis at the point of observation 
rather than at the object. This means that the next three simple 
matrices used to build the combination matrix turn the line 
of sight from parallel with the z axis to pointing at the center 
of the object. Third, it is generally desirable to observe 
positive z-coordinate values as if they were projecting from 
the screen. The next matrix accomplishes this task, thus 
converting from a right-hand to a left-hand coordinate 
representation, as shown in Figure 20. In the same way, 
positive y values are normally observed as going up rather 
than down. Another similar matrix is utilized, which converts 
back to a right-hand coordinate system (see Figure 21). 
Fourth and finally, the last element going into the 
combination matrix is a matrix that provides a scale 
normalization defining the size of the object relative to the 
viewing position of the actual observer. In other words, this 
matrix incorporates the relative scaling of the object with 
respect to the assumed distance of the actual observer 
from the display screen. 
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Figure 19. X-Y Address Coordinates of a Display System 
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Figure 20. Z-Axis Conversion to the Normal Observation 


RIGHT-HAND SYSTEM 


Figure 21. Y-Axis Conversion to the Normal Observation 


Projection 

Thus far, discussion has been presented with regard 
to orienting an object with respect to “‘normal’’ screen 
observation, and all of the matrices have defined and 
manipulated the object in three dimensions. It is necessary 
to convert the coordinates to only two dimensions, 
corresponding to the X-Y addressing of the video memory, 
in order to ‘‘draw’’ the object on the screen. The conversion 
must incorporate the z-coordinate value into both the x and 
y coordinates, producing the X and Y memory addresses. 


Figure 22 shows the method used to incorporate z into 
x and y in the viewer’s coordinate system. The fundamental 
process is to modify the value of x and y, and correspondingly 
the position of the point on the screen relative to the depth 
or projection of the z-coordinate value. In other words, in 
a screen-projected coordinate system, a line positioned close 
to the observer is seen as a longer line than one positioned 
farther away, assuming that both have the same relative 
length in the ‘‘real-world’’ coordinate system. 
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Figure 22. Z-Coordinate Projection onto the X-Y Screen 


The matrix to apply this projection is a diagonal matrix 
plus translation. The diagonal elements also apply a scaling, 
thus allowing the object to be sized to fill a designated area 
of the screen. This sizing factor is equivalent to using a 
‘‘zoom.’’ The translation elements allow the object to be 
positioned with its center at any point on the screen. 


[Xp Yp Zp 1] = [Xe Ye Ze 1] 


vsx' 0 0 0 
0 vsy’' 0 O 
. 0 0 vsz' 0 ee 
VCX VCY VCZ 1 
VSX' = VSX/Z, -VSY' = VSY/Ze (6) 


An analysis of the calculations for this matrix shows 
that it requires the equivalent computations of a 2 x 2 matrix, 
in addition to the division necessary to calculate 1/Z,. This 
is due to the fact that the matrix is diagonal. The parameters, 
VSX' and VSY'’, are dynamically generated for the projection 
matrix, with the stored parameters for the matrix being the 
X and Y screen scaling (VSX, VSY) and centering (VCX, 
VCY) parameters. 
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Filled-Ellipse Drawing 

An additional graphics function for filled-elipse 
drawing is worthy of discussion although it represents 
somewhat of a special case. This function is technically 
composed of two separate functions: curve drawing and 
filling regular objects. While the task of filling as presented 
here is specialized, it contains some useful insight into 
utilizing the power of the VSC. A curve-drawing algorithm, 
as an extension to the line-drawing algorithm previously 
discussed, has an even wider scope of potential. 

Curve drawing, in general, provides a useful function 
in many graphics applications. In some display systems, even 
drawing what appears to be a circle requires a general ellipse- 
drawing algorithm. This is true for the display monitor used 
in the project described in this report. An efficient algorithm 
for drawing ellipses is the midpoint algorithm.!8 This 
algorithm represents a general approach that can be used to 
build incremental algorithms for drawing curves other than 
circles or ellipses. 

The basis of an incremental curve drawing is a 
minimization of the linear error of the drawn curve in 
reference to the true curve. This is similar to the basis and 
approach of the line-drawing algorithm. A second basis for 
the approach must be its efficiency and accuracy in 
implementation. 
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The linear error can be minimized to a maximum of 
one-half the distance between vertically- or horizontally- 
adjacent pixels if an eight-way-stepping technique is used or 
minimized to a maximum of one-half the distance between 
diagonally adjacent pixels if a four-way-stepping technique 
is used. A comparison of the pixel-stepping directions is 
shown in Figure 23. Since the task here is to draw a 
filled ellipse, rather than the ellipse boundary, the 
four-way-stepping technique is utilized. 

Although the distance function used to determine the 


(a) FOUR-WAY STEPPING 


linear error in a line-drawing algorithm is linear, the distance 
function in a curve-drawing algorithm is quadratic. The 
consequence of a quadratic distance function is that the 
calculations of the function must be carried out with greater 
than 16-bit precision. The TMS32020 code implementation 
for determining the curve boundary of an arbitrary ellipse 
is found in the appendix beginning with the label FELIPS. 
Figure 24 shows a flow diagram of the filled-ellipse drawing 
routine. 


(b) EIGHT-WAY STEPPING 


Figure 23. Alternative Pixel-Stepping Directions 


23. A Graphics Implementation Using the TMS32020 and TMS34061 


627 


INITIALIZE 
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(a) FILLED-ELLIPSE SETUP PROCEDURE 


Figure 24. Filled-Ellipse Drawing Flow Diagram 
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(b) UPPER-HALF PLOTTING PROCEDURE 


Figure 24. Filled-Ellipse Drawing Flow Diagram 
(continued) 
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Figure 24. Filled-Ellipse Drawing Flow Diagram 
(concluded) 
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In the filled-ellipse routine, after the parameter 
initialization has taken place, the x-coordinate value (in the 
memory-space coordinate system) is saved on a stack for each 
y-coordinate value in a single quadrant arc. An example of 
this approach is shown in Figure 25. The values stored in 
the memory are the x-coordinate values relative to the center 
of the ellipse being located at the origin. 

To draw the filled ellipse on the screen, four loops are 
used instead of one or two to allow RAS3, RAS2, RASI, 
and RASO to be utilized simultaneously wherever possible, 
thereby enhancing the performance. This technique is 
possible since the approach to drawing a filled ellipse is to 
draw a series of horizontal lines. 

Starting at the top of the ellipse, the horizontal lines 
may be of the short variety and include no groups of eight 
pixels addressed as a single block. Going down the ellipse 
toward its center, the lines get longer and begin to include 
groups of eight pixels addressed as a single block. (This is 
only guaranteed to be true if the ellipse is specified to have 
a horizontal axis value of eight or larger.) Figure 26 depicts 
the difference between a ‘‘short’’ line and a ‘‘long’’ line. 
When the loop drawing the short lines is terminated by 
detecting the presence of eight pixel blocks, the second loop 


DATA DATA 
ADDRESS VALUE 
>0200 >O000A 
>0201 >O00A 
>0202 >000A 
>0203 >0009 B=6 
>0204 >0008 
>0205 >0007 
>0206 >0004 


is entered. This loop breaks the line up into a left segment 
part of one to eight pixels, a center segment part of a number 
of eight pixel blocks, and a right segment part of one to eight 
pixels. When the stack of x-coordinate values has been 
exhausted, the center of the ellipse is reached and the top half 
of the ellipse is drawn. 

Since the stack is a memory implementation, processing 
begins by accessing the stack in the reverse direction. The 
series of wide lines that include the eight pixel blocks are 
drawn until it is determined that a line contains no such 
blocks. When this condition is observed, the loop is 
terminated. The last loop, which draws the short lines, is 
executed until the stack is exhausted, and the drawing of the 
ellipse is completed. 

Several specific features are noted when using this 
coding of a filled-ellipse algorithm on the TMS32020. First, 
the symmetry of the ellipse is utilized to reduce the loop 
calculations to determine the boundary points. Second, in 
the loops to draw the short lines, the coding is similar to the 
special-case horizontal-line coding in the standard line- 
drawing algorithm. Third, the multiRAS capability of the 
VSC and the RPT instruction of the TMS32020 enhance the 
throughput on the longer lines. 
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Figure 25. Filled-Ellipse Plotting Stack 
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Figure 26. Horizontal-Line Construction of a Filled Ellipse 
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SUMMARY 


A proposed graphics system is presented in this report, 
which consists of an image control processor (the 
TMS32020), a display controller (the VSC), video RAMs, 
and program memory for the TMS32020. A display monitor 
is incorporated to view the graphics memory. Although a 
specific host processor and interface to the TMS32020 is not 
included in this specific project, a typical system may include 
a host. The TMS32020 is easily interfaced to any of a wide 
range of host processors. On the other hand, a designer may 
choose to incorporate the host functions into the TMS32020, 
in which case the present hardware can be expanded by 
adding additional program memories and a communications 
interface. 

The TMS32020, while designed for optimizing digital 
signal processing tasks, has a large number of features 
commonly found on many general-purpose microprocessors. 
By combining these features with a high-speed architecture, 
the TMS32020 provides significant advantages over those 
same microprocessors. The high-speed architecture is based 
on separate program and data memory spaces that allow full 
overlap of instruction fetch and execution, a 16 x 16-bit 
hardware multiplier with a 32-bit accumulator capable of 
performing multiply-accumulates in a single instruction cycle 
of 200 ns, auxiliary registers with a separate arithmetic unit 
to provide efficient indirect addressing and loop control, and 
a powerful instruction set that includes multifunction and 
repeat instructions. 

The system presented is efficient in graphics image 
generation because of several important features. First, by 
using the VSC and VRAMs, a large video display address 
space can be addressed with essentially no additional support 
logic, and video memory accesses are virtually unlimited. 
Second, the TMS32020 is easily interfaced to the VSC and 
VRAMs, again with essentially no additional logic. Third, 
the TMS32020 provides an instruction set that executes 
largely in a single instruction cycle, thus benefiting even 
highly optimized graphics algorithms, such as Bresenham’s 
line-drawing algorithm. Finally, the hardware multiplier in 
the TMS32020 minimizes execution time for matrix 
multiplications utilized in graphics for rotation, translation, 
and scaling. 

The designer is thus presented with a comprehensive 
solution to the problem of designing a graphics display 
system. A large number of product applications, such as 
matrix printers, laser printers, PC-based engineering 
workstations, graphics display terminals, CAD systems for 
logic design, and interactive application simulators, are all 
benefited with a TMS32020-based design. The powerful 
TMS32020 processor provides speed comparable to that of 
custom-logic designs and ease of development that matches 
conventional processors. The programmability of the 


TMS32020, as well as the VSC, permits shorter design 
cycles, simpler maintenance, and easier upgrade capability. 
The inherent flexibilities of a TMS32020-based system allow 
the design of a system powerful today and expandable 
tomorrow. 
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APPENDIX A 


TMS32020 AND TMS34061 GRAPHICS SYSTEM SCHEMATIC 
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X2_ 


U64 


R1,R2 
R3-R8 
R9-R19 


C1-C4 
C5 
C6-C13 
C14-C63 


Sl 


Parts List for the Graphics System Schematic 


TMS32020 
TMS34061 


TBP28S 166 
TMS4161 


SN74S138 
SN74LS244 
SN74LS125 
SN74LS04 
SN74S00 
SN74832 
SN74S86 
SN74LS245 
SN74AS74 
SN74S113 
SN74LS241 
SN74AS02 
SN74AS27 
SN74AS08 
SN74F299 


Digital Signal Processor 
Video System Controller 


2K x 8 Programmable ROM 
64K x 1 Multiport Video RAM 


3-to-8 Line Decoders/Demultiplexers 

Octal Buffers/Line Drivers/Line Receivers 

Quad Bus Buffer Gates with Three-State Outputs 
Hexadecimal Inverters 

Quad 2-Input Positive-NAND Gates 

Quad 2-Input Positive-OR Gates 

Quad 2-Input Exclusive-OR Gates 

Octal Bus Tranceivers 

Dual D-Type Positive-Edge-Triggered Flip-Flops 
Dual J-K Negative-Edge-Triggered Flip-Flops 
Octal Buffers/Line Drivers/Line Receivers 

Quad 2-Input Positive-NOR Gates 

Triple 3-Input Positive-NOR Gates 

Quad 2-Input Positive-NAND Gates 

8-Bit Bidirectional Shift/Store Registers 


20.000 MHz Oscillator 
18.432 MHz Oscillator 


33 Ohm 8-Resistor Pack 


390 Ohm Resistors 
1K Ohm Resistors 
4.7K Ohm Resistors 


39 wF Bypass Capacitors 
0.1 »F Capacitor 

0.1 »F Bypass Capacitors 
0.001 »F Bypass Capacitors 


SPST Momentary Contact Switch 
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APPENDIX B 
TMS32020 GRAPHICS SOFTWARE ALGORITHMS 
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32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 15:17:30 12-05-85 
PAGE 000! 
COPY PRELUDE . ASM 
TOT “GRAPHIC’ 
OPTION XREF 


FF80 
0022 


FF80 
0020 


FF80 
0020 


FF80 
0020 


FF80 
0020 


FF80 
0020 


FF80 
0020 


FF80 
0020 


FF80 
0020 


TARR ERE SEESESESZESEZESEE ERE ESE ERE SESE REE EERE EERE RE EARS EERE EE SE 


* 


This is the source module for the 1TMS32020 for a graphics 


* demonstration. The system is built using a VSC EVM board 
* containing the TMS34061 (VSC) and 2 Mbit of TMS4161 (VRAM) 
* configured in a 1024 x 512 pixel array with 4 bit pixels. 
* The VSC displays and maintains the contents of the VRAMs. 
* The TMS32020 ts responsible for creating (plotting) the 
* graphic images which are displayed. Access to the VRAMs 
* by the TMS32020 is provided through the VSCs X-Y indirect 
* addressing access. Memory control by the VSC is 
* programmable through 18 accessible registers to tailor the 
* available memory and memory access to the selected 
* monitor. The memory on the VSC EVM has been configured 
* for use with a TIPC monitor which has a 720 x 300 pixel 
* display. 
(RESP RREPERERESE EEE SERS ESSE ESSEC SESE ESE ESSERE ERS AREER EEE EEE 
# 
AORG >0000 
* 
RESET B INIT * branch to processor initialization 
# 
INTO 8 DUMMY * no interrupts are uttlized 
* 
INT1 B OUMMY 
a 
INT2 8 DUMMY 
# 
AORG >0018 
* 
TINT B DUMMY 
* 
RINT B OUMMY 
* 
XINT B DUMMY 
* 
TRAP B DUMMY 
* 
DUMMY 8 DUMMY 
COPY INI T.ASM 
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BO002 : 
680003 TPE RRR SEEZERARE RSE ESE REZ ARES ESSE SESE ESE SERS EEE SESS EES SESE SS 
80004 * This routine is executed each time the RS- pin is pulled 
60005 * active low and released. Al! internal registers 
B0006 * (memory-mapped, auxiliary, status, data-memory) of the 
BOO007 * TMS32020 are fnitialized. All control registers and video 
B0008 * data-memory of the 1MS34061 are initialized . 
B0009 TRB RARE PERE RE RE REE RE RES SES ERE ES EES ESSERE ERTS ESSERE SEE SSE ESS Y 
BO0010 * 
80011 0022 ~=OINIT EQU $ 
BO0!2 0022 CEO] DINT * disable all {interrupts 
B0013 0023 C800 LDPK 0 * set-up to access Block B2 
B0014 0024 CE02 ROVM * reset overflow mode 
BOOTS 0025 5588 LARP ARO * set auxiliary pointer to reg 0 
BOOI6 0026 CE08 SPM 0 * set P reg shift mode to no shift 
BOO17 0027 CE20 RTXM * set external serial framing 
BO018 0028 CEND SXF * set external flag pin 
BO019 0029 CE07 SSXM * set ALU sign-extensfion mode 
B0020 002A CE04 CNFD * configure Block BO as data memory 
B0021 002B 0001 LALK >FFFF 
002C FFFF 
BOO022 0020 6003 SACL PRD * initialize period register 
80023 002E CA00 ZAC 
BO0024 002F 6004 SACL IMR * disable each interrupt 
B0025 0030 6005 SACL GREG * declare all memory local 
BO026 0031 CAO! LACK 1 
B0027 0032 6062 SACL ONE 
BO028 * 
B0029 0033 VSCIN EQU $ 
B0030 0033 C008 LARK ARO,8 
B0031 0034 5589 LARP ARI 
B0032 0035 D100 LRLK ARI, VSCREG 
0036 0400 
80033 0037 CAOA LACK >000A * load Horizontal End Sync 
B0034 0038 60E0 SACL *0+ 
BO0035 0039 68E0 SACH *0+ 
BO036 003A CAI7 LACK >0017 * load Horizontal End Blank 
B0037 0038 60E0 SACL *0+ 
BO038 003C 68E0 SACH *0+ 
BO039 003D CA7I LACK >007! * load Horizontal Start Blank 
BO0040 003E 60E0 SACL *0+ 
BO041 OO3F 68E0 SACH *0+ 
BO042 0040 CA76 LACK >0076 * load Horizontal Total 
BO043 0041 60E0 SACL *0+ 
B0044 0042 68E0 SACH 0+ 
B0045 0043 CA02 LACK >0002 * load Vertical End Sync 
B0046 0044 60E0 SACL *0+ 
B0047 0045 68E0 SACH #*0+ 
B0048 0046 CAI3 LACK >0013 * load Vertical End Blank 
B0049 0047 60E0 SACL *0+ 
B0050 0048 68E0 SACH *0+ 
BO051 0049 0001 LALK >103F * load Vertical Start Blank 
O04A 103F 
B0052 004B 60E0 SACL *0+ 
BO053 O004C 6CEO SACH *0+,4 
BO00S54 0040 000! LALK >1042 * load Vertical Total 
OO4E 1042 
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B0055 
B0056 
60057 
60058 
B0059 
B0060 
B0061 
B0062 
80063 
B0064 
B0065 
B0066 
B0067 
B0068 
80069 
680070 
60071 
B0072 
80073 
B0074 
B0075 
B0076 
80077 
60078 
80079 
B0080 
B0081 
60082 
B0083 


B0084 
60085 
B0086 
80087 


60088 
80089 
680090 
80091 
B0092 
80093 
B0094 
B0095 
B0096 
B0097 
80098 
B0099 


60100 
B0101 
B0102 
80103 
BO104 
80105 
B0106 
B0107 
60108 


OO4F 
0050 
0051 
0052 
0053 
0054 
0055 
0056 
0057 
0058 
0059 
005A 
0058 
00SC 
0050 
00S5E 
OOSF 
0060 
0061 
0062 
0063 
0064 
0065 
0066 
0067 
0068 


0069 
006A 
0068 


006C 
0060 
006E 
O06F 
0070 
0071 
0072 
0073 
0074 
0075 
0076 


0077 
0078 
0079 
OO7A 
0078 
007Cc 
0070 
OO7E 
OO7F 
0080 
0081 
0082 
0083 
0084 


32020 FAMILY MACRO ASSEMBLER PC 1.0 
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60E0 SACL "0+ 
6CEO SACH  *0+,4 
CA02 LACK 0002 
60E0 SACL "0+ 
68E0 SACH "0+ 
CA00 LACK  >0000 
60E0 SACL "0+ 
68E0 SACH "0+ 
CA00 LACK  >0000 
60E0 SACL "0+ 
68E0 SACH "0+ 
CALO LACK 0010 
68E0 SACH "0+ 
60E0 SACL "0+ 
CA20 LACK >0020 
68E0 SACH "0+ 
60E0 SACL "0+ 
CA00 LACK  >0000 
60E0 SACL "0+ 
68E0 SACH "0+ 
CA20 LACK 0020 
60E0 SACL "0+ 
68E0 SACH  *0+ 
CA00 LACK  >0000 
60E0 SACL "0+ 
68E0 SACH "0+ 

# 
CA00 LACK 0000 
FESO CALL CLEAR 
040C 

* 
CA00 ZAC 
5588 LARP ARO 
0000 LRLK ARO, >0200 
0200 
CBFF RPTK 255 
60A0 SACL “+ 
CBFF RPTK 255 
60A0 SACL "+ 
C063 LARK ARO, 963 
CBIC RPTK 28 
60A0 SACL "+ 

* 
CAO! LACK 
C806 LOPK 6 
602E SACL ONEG 
0001 LALK ROMDAT 
OBED 
5837 TBLR —_XMAX 
002E ADD —ONE6 
5839 TBLR —-YMAX 
002E ADD —-ONE6 
583F TBLR —-AMAX 
002E ADD —-ONE6 
5840 TBLR AMIN 
002E ADD ONE 
584! TBLR BMAX 
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PAGE 0003 


Display Update Register 


Display Start Register 


Vertical! Interrupt 


Control Register | 


Control Register 2 


Status Register 


X-Y Offset Register 


X-Y Address Register 


initialize VRAMs 


initialfze Block BO 


initialize Block B61 


{nitfalfze Block 82 


load 


pixel processing constants 


maximum.X address (TIPC = 719) 


maximum Y address (TIPC = 299) 


ellipse demo constants 
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B0109 
BO110 
BOIL! 
B0112 
80113 
B0114 
BO115 
B0116 
B0117 


0003 


0085 
0086 
0087 
0088 
0089 
008A 
0088 
008C 
0080 
008E 
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002E ADD ONE6 
5842 TBLR = BMIN 
CA03 LACK 3 
6043 SACL _ THREE6 
CA0S LACK 5 
6044 SACL FIVE6 
CAdv9 LACK 9 
6045 SACL  NINE6 
FF80 B PARSER 
008F 

COPY PARSER. ASM 
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™S32020 - TMS3406! DEMONSTRATION PAGE 0005 T™MS32020 - 1TMS34061 DEMONSTRATION : PAGE 0006 
C0002 i C0048 * 
C0003 ESET ECTS ESI IS ETT ECE SCTESSVILT TV TVIS TTT TTT TTT TTT TT C0049 OOAB FF80 CMDTBL 8B INIT * 9000 
C0004 * The PARSER routine controls the demonstration of the GOAC 0022 
C0005 * graphics display system by reading the contro! data table C0050 OOAD FFS8O B DELAY * 0001 
C0006 * and routing execution to the appropriate function. A OOAE OODC 
C0007 * synchronization word, 32020, must occur at the beginning C0051 OOAF FF8O B CLRSCR 9 * 0002 
C0008 * of the command table and between each command. The parser 00B0 00F7 
C0009 * uses this synchronization word to recover from errors in C0052 00B1 FF8O B PIXELC * 0003 
C0010 * entry of data in the command table. Following the 00B2 0100 
C0011 * synchronization word, the command word is read and used to C0053. 0083 FFS8O B LINEC * 0004 
C0012 * select one of the appropriate functional routines. In the 0084 O10F 
C0013 * event that the command word {fs not within the command C0054 0085 FF80 B OUMONE * 0005 
co014 * sequence, searching continues until the next 0086 00DB 
C0015 * synchronization word is found and the process repeats. C0055 0087 FF8O B POL YC * 0006 
C0016 * When a valid command word {fs identified, execution is O00B8 0120 
C0017 * transfered to the appropriate routine which must read all C0056 00B9 FF8O B OUMONE * 0007 
C0018 * data from the list appropriate to that routine. 0OBA 0008 
C0019 CeCe PSE ESSE STIS IIIT TTT T IT IV ISTIC TIC ITITTTI TT TTT TTT) C0057 00BB FF8O B OUMONE * 0008 
C0020 7 0oBC 0008 
c0021 CO8F PARSER EQU $ C0058 O0B8D FF8O B FELPSC * 0009 
C0022 OO08F C800 LOPK 0 * set-up to access Block B2 OOBE 0158 
C0023 0090 000! LALK BEGIN * top of command table C0059 OOBF FF80 B SPIN * OOOA 
0091 0B9B 00co 0169 
C0024 0092 606C SACL CMDADR * save current table address C0060 O00CI FFS8O B ELPSET * 0008 
C0025 0093 C800 PLOOP  LODPK 0 * set-up to access Block B2 00Cc2 0100 
C0026 0094 0000 LRLK ARO, 32020 * load synchronization word C006! 00C3 FFS8O 8 WELPSC * 000C 
0095 7014 00C4 0212 
C0027 0096 5589 LARP ARI C0062 00C5 FF8O B DUMONE * 0000 
C0028 0097 206C LAC CMDADR * get current table address 00cé6é 000B 
C0029 0098 5869 SYNC TBLR SYNCH * read word; expect a sync value C0063 00C7 FF8O B OUMONE * OQOOE 
C0030 0099 0062 ADD ONE 00Cc8 00D0B 
C0031 009A 3169 LAR AR! ,SYNCH C0064 00C9 FFS80 B STOP * QOOF 
C0032 009B 55D0 MAR *O- OOCA 0009 
C0033 009C FB90 BANZ SYNC * test for sync value c0065 00CB FF8O B DEMOO0 * 0010 
0090 0098 00cc 0280 
C0034 ; * c0066 O00CD FF80 B OEMO! * 0011 
C0035 O009E 586B TBLR CMND * read word; expect a valid command OOCE 0281 
C0036 O09F 0062 ADD ONE C0067 OOCF FF80 B DEMO2 * 0012 
C0037 00A0 606C SACL CMDADR * save current table address 00D0 O2FC 
C0038 OO0A! CAI7 LACK CMDLEN C0068 0001 FF8O B DEMO3 * 0013 
C0039 O0A2 1068 SUB CMND 0002 0343 
C0040 O0A3 F280 BLEZ PLOOP * verify command fs valid C0069 0003 FFBO B OEMO4 * 0014 
00A4 0093 0004 03BF 
C0041 . C0070 0005 FF8O B DEMOS * 0015 
C0042 OO0A5 0001 LALK CMOTBL 0006 0450 
OOA6 OOAB C007! 0007 FF80 B CUBER * 0016 
C0043 O0A7 0168 AOD CMND, | ; 0008 O8FC 
C0044 O0A8B CE24 CALA * go execute selected command C0072 OO17 CMOLEN €QU $-CMOTBL /2 
C0045 OOA9 FFBO B PLOOP C0073 * 
QOOAA 0093 
C0046 : 
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c0075 
C0076 0009 
000A 
C0077 00DB 
C0078 
C0079 
C0080 
c0081 000C 
C0082 0000 
C0083 OODE 
C0084 OODF 
C0085 00E0 
C0086 00E! 
C0087 00E2 
Co08s 00E3 
OOE 4 
C0089 00E5 
C0090 O0E6 
C0091 O0E7 
C0092 O0E8 
C0093 00E9 
C0094 OO0EA 
C0095 O0EB 
C0096 OOEC 
C0097 OOED 
C0098 OOEE 
C0099 OOEF 
C0100 OOFO 
OOF | 
00F 2 
00F3 
CO102 OOF 4 
OOFS 
C0103 OOF6 
C0104 
C0105 
C0106 
C0107 OOF7 
C0108 O0F8 
C0109 OOFS 
CO110 OOFA 
COLL! OOFB 
C0112 OOFC 
C0113 OOFD 
OOFE 
COI14 OOFF 
C0115 
C0116 
C0117 
C0118 0100 
COll9 0101 
C0120 0102 
Col21 0103 
C0122 0104 
C0123 0105 
C0124 0106 
C0125 0107 


co101 
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FF80 
0009 
CE26 


206C 
586A 
0062 
606C 
306A 
C168 
5588 
FF80 
OOF 4 
c808 
CA01 
2060 
0868 
6080 
C809 
6020 
2888 
6828 
2050 
2050 
D004 
0001 
F680 
OOEF 
FB99 
OO0ES 
CE26 


206C 
5868 
0062 
606C 
CAFF 
4£68 
FE80 
040C 
CE26 


206C 
C806 
5800 
5802 
002E 
5801 
5803 
002E 


STOP 


OUMONE 


DELAY 


DELAY! 


WAITI 


WAITD 


DECDLY 


# 
5 
5] 


CLRSCR 


PIXELC 


B 


RET 
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STOP 


TIME DELAY CONTROL 


BZ 
BANZ 


RET 


CMDADR * get current table address 
TIMDLY * read word; expect a time value 
ONE 

CMDADR * save current table address 
ARO, TIMOLY 

ARI, TMPI 

ARO 

DECOLY 


>0008 * VSC register page 

1 

VSBLL * read Vertical Start Blank 
VSBLH,8 

* 


>0009 * VSC register page 

VINTL * load Vertical Interrupt 
*,8,0 

VINTH 

STATL 

STATL * wait for vertical 
>0001 


tnterrupt 


WAITO 


WAITI,*-,1 * decrement delay counter 


CLEAR SCREEN CONTROL 


CMDADR * get current table address 

TMP I * read word; expect a color value 
ONE 

CMDADR * save current table address 
>00FF 

TMP I 

CLEAR * fnitialize the VRAMs 


WRITE PIXEL CONTROL 


CMDADR * get current table address 


XI * read word; expect x coordinate 


Yl * read word; expect y coordinate 
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c0126 0108 
C0127 0109 
C0128 O10A 
C0129 0108 
C0130 010C 
0100 
C0131 O10E 
C0132 
C0133 
C0134 
C0135 O10F 
C0136 0110 
C0137 O111 
C0138 0112 
C0139 0113 
CO140 0114 
C0141 0115 
C0142 0116 
C0143 O117 
C0144 0118 


C0145 0119 


C0146 OLA 
CO147 O118 
C0148 O1IC 
C0149 0110 
O1IE 
CO15O OLIF 
cos} 
C0152 
C0153 
C0154 0120 
C0155 012! 
C0156 0122 
C0157 0123 
C0158 0124 
C0159 0125 
C0160 0126 
C0161 0127 
C0162 0128 
0129 
C0163 012A 
012B 
C0164 
C0165 012C 
C0166 0120 
CO167 O12E 
C0168 O12F 
C0169 0130 
C0170 0131 
COI71 0132 
C0172 0133 
C0173 0134 
C0174 0135 
0136 
co175 
C0176 0137 
C0177 0138 
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5804 
C800 
0062 
606C 
FE8O 
0524 
CE26 


206C 
C806 
5800 
002E 
5801 
002E 
5802 
002E 
5803 
002E 
5804 
c800 
0062 
606C 
FE80 
0524 
CE26 


206C 
C806 
5830 
002E 
5804 
002E 
3030 
5588 
FB90 
012C 
FF80 
0154 


7030 
5822 
5824 
002E 
5823 
5825 
c800 
0062 
606C 
FF80 
0145 


7030 
C800 


LINEC 


POLYC 


POLY! 


POLYL 
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COLOR * read word; expect a color value 
0 

ONE - 

CMDAODR * save current table address 
LINE * plot the point 


LINE DRAWING CONTROL 


CMDAOR * get current table address 
6 


Xl * read word; expect x! coordinate 
ONE6 
Yi * read word; expect yl! coordinate 
ONE6 
X2 * read word; expect x2 coordinate 
ONE6 
Y2 * read word; expect y2 coordinate 
ONE6 
_ COLOR * read word; expect a color value 
0 
ONE 
CMDADR * save current table address 
LINE * plot the line 


DRAWING CONTROL 
e 


CMDADR 
6 
NPOINT 
ONE6 
COLOR 
ONE6 
ARO,NPOINT 
ARO 

POLY 1 


* get current table address 


* read word; expect # polygon sides 


* read word; expect a color 


POL YD 


ARO, NPOINT 


XA * read word; expect x! value 


YA * read word; expect yl value 


ONE 
CMDADR 
POLYN 


* save current table address 


ARO, NPOINT 
0 


45] 
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C0178 
C0179 
C0180 
C0181 
C0182 
C0183 
C0184 
C0185 
C0186 
C0187 
C0188 


C0189 
C0190 
CO191 
C0192 
C0193 
C0194 
C0195 
C0196 


C0197 
C0198 
C0199 
C0200 
c0201 


c0202 
C0203 
C0204 
C0205 
C0206 
C0207 
C0208 
C0209 
C0210 
C0211 
C0212 
C0213 
C0214 
C0215 
C0216 
C0217 
c0218 
C0219 


C0220 
c0221 


C0222 
C0223 
C0224 
C0225 


C0226 
C0227 
C0228 
c0229 
C0230 


0139 
O13A 
0138 
013C 
0130 
013€ 
013F 
0140 
0141 
0142 
0143 
0144 


0145 
0146 
0147 
0148 
0149 
O14A 
0148 
014C 
0140 
O14E 
014F 
0150 
0151 
0152 
0153 


0154 
0155 
0156 
0157 


0158 
0159 
O15A 
0158 
015C 
0150 
O15E 
O1SF 
0160 
0161 
0162 
0163 
0164 
0165 
0166 
0167 
0168 


0169 
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206C 
C806 
5802 
5824 
002E 
5803 
5825 
C800 
0062 
606C 
FE80 
0524 


C806 
2024 
6000 
2025 
6001 
3030 
FB90 
0137 
2022 
6002 
2023 
6003 
FE80 
0524 
CE26 


POLYN 


C800 POLYD 
0062 
606C 
CE26 


206C FELPSC 
C806 
5810 
002E 
5811 
002E 
5812 
002E 
5813 


002E 
5604 


c800 
0062 
606C 
FESO 
06BB 
CE26 


206C SPIN 


LAC 

LOPK 
TBLR 
TBLR 
AOD 

TBLR 
TBLR 
LOPK 
ADD 

SACL 
CALL 


CMDADR * 


X2 ‘ 


CMDADR * 


c- 
z 
m 

= 


X1 . 


Yl . 
ARO, NPOINT 
POLYL * 


ONE 
CMDAOR * 


x(n) 
y(n) 


last 


x(Nt1) = 


save 
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current table address 


word; expect next x value 


word; expect next y value 


current table address 
the line 


= x(n+l) 
= y(nt+l) 


point 2 


x(1) 


* y(nt+i) = y(t) 
* plot 


the line 


current table address 


FILLED ELLIPSE ORAWING CONTROL 


LAC 
LOPK 
TBLR 
ADD 
TBLR 
ADD 
TBLR 
ADD 
TBLR 
ADD 
TBLR 
LDPK 
ADD 
SACL 
CALL 


RET 


CMDAOR * 
6 

XC . 
ONE6 

yc * 
ONES 

A * 
ONE6 

B a 
ONE6 
COLOR * 
0 

ONE 
CMDADR * 
FELIPS * 


get current table 
read word; expect 
read word; 
read word; 
read word; 


read word; 


address 

center x value 
expect center y value 
expect horfzontal axis 
expect vertical axis 


expect a color 


save current table address 
draw a filled ellipse 


SPINNING FILLED ELLIPSE DRAWING CONTROL 


LAC 


CMDAOR 


* get current table address 
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c0231 
C0232 
C0233 
C0234 
c0235 
C0236 
C0237 
C0238 
C0239 
C0240 
C0241 
C0242 
C0243 
C0244 
C0245 
C0246 
C0247 
C0248 
C0249 
C0250 
C0251 
C0252 
C0253 
C0254 
C0255 
C0256 
C0257 
C0258 
C0259 
C0260 
C0261 


C0262 
C0263 
C0264 
C0265 
C0266 
C0267 
C0268 
C0269 
C0270 
c0271 
C0272 
C0273 
C0274 
c0275 
C0276 


C0277 
c0278 


C0279 
C0280 
C0281 


C0282 
C0283 


O16A 
0168 
016C 
0160 
O16E 
O16F 
0170 
0171 
0172 
0173 
0174 
0175 
0176 
0177 
0178 
0179 
O17A 
0178 
017C 
0170 
O17E 
OL7F 
0180 
0181 
0182 
0183 
0184 
0185 


0186 
0187 
0188 


0189 
O18A 
0188 
018C 
0180 
O18E 
018F 
0190 
0191 
0192 
0193 
0194 
0195 
0196 
0197 
0198 
0199 
OL9A 
0198 
019C 
0190 
019E 
O19F 
01A0 
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C806 
5822 
002E 
5823 
002E 
5824 
5832 
002E 
5825 
002E 
5831 
002E 
5834 
C800 
0062 
606C 
2162 
606A 
C806 
2831 
6833 
CAOF 
4E31 
6031 
2032 
CBOF 
4734 
6032 


3032 
FF80 
01A0 


702F 
2022 
6010 
2023 
6011 
2024 
6012 
1034 
6024 
2025 
6013 
2031 
6004 
FE80 
0688 
cs00 
FE80 
00E0 
C806 
CA00 
FESO 
040C 
302F 


LOPK 
TBLR 
AOD 
TBLR 
ADD 
TBLR 
TBLR 
A0O0 
TBLR 
ADD 
TBLR 
AOD 
TBLR 
LOPK 
AOD 
SACL 
LAC 
SACL 
LOPK 
LAC 
SACH 
LACK 
AND 
SACL 
LAC 
RPTK 


SAR 
LAC 
SACL 
LAC 
SACL 
LAC 
SACL 
SUB 
SACL 
LAC 
SACL 
LAC 
SACL 
CALL 


LOPK 
CALL 


LOPK 
ZAC 
CALL 


LAR 


5588 SPINIE LARP 


SAVCLR,8 
CLRSAV 
>000F 
SAVCLR 
SAVCLR 
SAVEA 

15 

SPEED 
SAVEA 


ARO, SAVEA 


SPINIE 


ARO, SAVARO 


XA 
XC 
YA 


DELAY! 
6 
CLEAR 


ARO ,SAVARO 


ARO 


read 


read 


read 


read 


read 


read 


save 


load 


seperate colors : 
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word; expect center x value 


word; expect center y value 


word; expect hor{fzontal axis 


word; expect vertical axis 


word; expect the colors 


word; expect rate of change 


current table address 


time delay constant 


back side 


: front side 


# cycles = a / delta 


load 
load 
load 
a= 

load 
load 


draw 


wait 


center x value 
center y value 


hor {zontal axis value 


8 - delta 


vertical axis value 


color 
a filled ellipse 


a while 


clear the screen 
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C0284 


C0285 
C0286 
C0287 
C0288 
C0289 
C0290 
C0291 
C0292 
C0293 
C0294 
C0295 
C0296 


C0297 
C0298 


C0299 
C0300 
C0301 


C0302 
C0303 
C0304 


C0305 
C0306 
C0307 
C0308 
C0309 
C0310 
co311 
C0312 
C0313 
C0314 
C0315 
C0316 
C0317 
C0318 
C0319 


C0320 
C0321 


C0322 
C0323 
C0324 


C0325 
C0326 
C0327 


C0328 
C0329 
C0330 
C0331 


OIA] 
01A2 
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FB90 
0189 


2022 
6000 
6002 
2023 


SPIN2 


SPIN2E 


BANZ 


LAC 
SACL 
SACL 
LAC 
SUB 
SACL 
AOD 
SACL 
LAC 
SACL 
CALL 


LDOPK 
CALL 


LOPK 


ZAC 
CALL 


LAR 


SPINI 

XA 

X] iy 
X2 , 
YA 

Y8 

Yl * 
YB, 1 

Y2 . 
SAVCLR 
COLOR . 
LINE * 
0 

DELAY! * 
6 

CLEAR ° 
ARO, SAVEA 
SPINZE 
ARO, SAVARO 
XA 

XC . 
YA 

Yc . 
XB 

A * 
SPEEDO 

XB £ 
YB 

8B # 
CLRSAV 
COLOR . 
FELIPS ® 
0 

DELAY! * 
6 

CLEAR * 
ARO, SAVARO 
ARO 

SPIN2 
SAVCLR * 
CLRSAV 
CLRSAV 


15:17:30 12-05-85 
PAGE 0011 


Xl 
X2 


XC 
XC 


= YC - B 
Yc + 8 
load color 


the line 


a while 


clear the screen 


load center x value 

load center y value 

toad horizontal axis value 
a=a+ delta 

load vertical axis value 
load color 


Graw a filled ellipse 


wait a while 


clear the screen 


swap colors : front and back 
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C0332 
C0333 
C0334 
C0335 
C0336 


C0337 
C0338 
C0339 
C0340 
C034! 
C0342 
C0343 
C0344 
C0345 
C0346 
C0347 
C0348 
C0349 
C0350 
C0351 
C0352 
C0353 
C0354 
C0355 
C0356 
C0357 
C0358 
C0359 
C0360 
C0361 
C0362 
C0363 
C0364 


C0365 


C0366. 


C0367 
C0368 
C0369 
C0370 
C0371 
C0372 
C0373 
C0374 
C0375 
C0376 
C0377 
C0378 
C0379 
C0380 
C0381 
C0382 
C0383 
C0384 


C0385 


01D6 
0107 
0108 
0109 
O1DA 
0108 
010C 


0100 
O1DE 
O1DF 
O1E£0 
O1E1 
O1E2 
O1E3 
O1E4 
O1ES 
O1E6 
O1E7 
O1E8 
O1E9 
O1EA 
O1EB 
O1EC 
O1ED 
O1EE 
O1EF 
OIFO 
OIF! 


O1F2 
O1F3 
OIF 4 


O1FS 
O1F6 
OIF? 
O1F8 
O1F9 
OIFA 
O1FB 
O1FC 
O1FO 
OIFE 
OFF 
0200 
0201 
0202 
0203 
0204 
0205 
0206 
0207 
0208 
0209 
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6031 
2025 
112E 
6025 
F180 
0186 
CE26 


ELPSET 


RET 


FILLED ELLIPSE SET 


LAC 
LOPK 
TBLR 
AOD 


SAVCLR 

YB 

ONE6, | 

YB bd 
SP INO 


CMDADR * 


SAVCLR * 
OA * 
DB . 
SAVEA id 
ONE 
CMDADR * 
6 

ARO ,SAVEA 
SETEIE 
ARO, SAVARO 


XC * 


SAVCLR * 
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b=b- 2 


DRAWING CONTROL 
get current table address 


read word; expect center x value 


read word; expect center y value 


read word; expect horizontal axis 


read word; expect vertical axis 


read word; expect a color 
read word; expect horizontal delt 
read word; expect vertical delta 


read word; expect # of ellipses 


save current table address 


Joad center x value 

load center y value 

load horfzontal axis value 
a=a+ da 

load vertical axis value 
b = b + db 

load a color 


color = (color + 1) &7 
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TMS32020 - TMS34061 DEMONSTRATION PAGE 0013 TMS32020 - TMS34061 DEMONSTRATION PAGE 0014 
C0386 020A 603! SACL SAYCLR * if color = 0 then color = 1 C0440 023E 6024 SACL XB *"a=a+da 
C0387 020B FE80 SETE2 CALL FELIPS * draw a filled ellipse C0441 023F 103F SUB AMAX 

020C 0688 C0442 0240 F180 BGZ WELP2 *" a> amax ? 
C0388 0200 302F LAR ARO, SAVARO 0241 0246 
C0389 O20E 5588 SETEIE LARP ARO C0443 0242 003F ADO AMAX 
C0390 O20F FB90 BANZ SETE1 C0444 0243 1040 SUB AMIN 

0210 O1F5 C0445 0244 F180 BGZ WELP3 “a < amin ? 
C0391 0211 CE26 RET 0245 0249 
C0392 7 C0446 0246 2035 WELP2 LAC DA 
C0393 - FILLED ELLIPSE DEMONSTRATION CONTROL C0447 0247 CE23 NEG 
C0394 * C0448 0248 6035 SACL DA * da = -da 
C0395 0212 206C WELPSC LAC CMDADR * get current table address C0449 0249 2025 WELP3 LAC YB 
C0396 0213 C806 LOPK 6 C0450 024A 6013 SACL 8 * load vertical axis value 
C0397 0214 5822 TBLR XA * read word; expect center x value C0451 024B 0036 ADD OB 
C0398 0215 002E ADO ONE6 C0452 024C 6025 SACL YB * b= b + db 
C0399 0216 5823 TBLR YA * read word; expect center y value C0453 0240 1041 SUB BMAX 
C0400 0217 002E ADD ONE6 C0454 024E€ F180 BGZ WELP4 "b> bmax ? 
C0401 0218 5824 TBLR XB * read word; expect horizontal axis O24F 0254 
C0402 0219 002E ADD ONE6 C0455 0250 004! AOD BMAX 
C0403 O21A 5825 TBLR Y8 * read word; expect vertical axis C0456 0251 1042 SUB BMIN 
C0404 02!1B 002E ADD ONE6 C0457 0252 F180 BGZ WELPS *b < bmin ? 
C0405 021C 5831 TBLR SAVCLR * read word; expect a color 0253 0257 
C0406 0210 002E ADD ONE 6 C0458 0254 2036 WELP4 LAC DB 
C0407 O21E 5835 TBLR DA * read word; expect hor{fzontal delt C0459 0255 CE23 NEG 
C0408 O21F 002E ADD ONE6 C0460 0256 6036 SACL 0B * db = -db 
C0409 0220 5836 TBLR 08 * read word; expect vertical delta C0461 0257 2022 WELPS LAC XA 
C0410 0221 002E ADD ONE6 C0462 0258 6010 SACL XC * load center x value 
C0411 0222 5805 TBLR OX * read word; expect center x delta C0463 0259 0005 ADD OX 
C0412 0223 O02E ADD ONE6 C0464 025A 6022 SACL XA * xe = xc + dx 
C0413 0224 5806 TBLR OY * read word; expect center y delta C0465 025B 103B SUB XMAXA 
C0414 0225 002E AOD ONE6 C0466 025C F180 BGZ WELP6 * xc > xcmax ? 
C0415 0226 5832 TBLR SAVEA * read word; expect # of ellipses 0250 0262 
C0416 0227 C800 LOPK 0 C0467 O25€ 003B ADO XMAXA 
C0417 0228 0062 ADD ONE C0468 025F 103C SUB XMINA 
C0418 0229 606C SACL CMDADR * save current table address C0469 0260 F180 BGZ WELP7 * xe < xemin ? 
C0419 022A C806 LOPK 6 0261 0265 
C0420 022B 2037 LAC XMAX C0470 0262 2005 WELP6 LAC DX 
C0421 022C 1024 SUB XB C0471 0263 CE23 NEG 
C0422 022D 6038 SACL XMAXA * xcmax = xmax - a C0472 0264 6005 SACL DX * dx = -dx 
C0423 022E 2038 LAC XMIN C0473 0265 2023 WELP7 LAC YA 
C0424 022F 0024 ADD XB C0474 0266 6011 SACL Yc * load center y value 
C0425 0230 603C SACL XMINA * xcmin = xmin + a C0475 0267 0006 ADO DY 
C0426 0231 2039 LAC YMAX C0476 0268 6023 SACL YA * ye = yc + dy 
C0427 0232 1025 SUB YB C0477 0269 1030 SUB YMAXB 
C0428 0233 6030 SACL YMAXB * yomax = ymax - b C0478 026A F180 BGZ WELP8 * yc > ycomax ? 
C0429 0234 203A LAC YMIN 026B 0270 
C0430 0235 0025 ADO YB C0479 026C 003D A00 YMAXB 
C0431 0236 603E SACL YMINB * yemin = ymin + b C0480 0260 !03E SUB YMINB 
C0432 . C0481 026E F180 BGZ WELP9 * ye < yemin ? 
C0433 0237 3032 WELPO LAR ARO,SAVEA O26F 0273 
C0434 0238 FF80 B WELPIE C0482 0270 2006 WELP8 LAC DY 

0239 027C C0483 0271 CE23 NEG 
C0435 - C0484 0272 6006 SACL DY * dy = -dy 
C0436 023A 7O2F WELPI SAR ARO, SAVARO C0485 0273 2031 WELP9 LAC SAVCLR- 
C0437 0238 2024 LAC XB C0486 0274 6004 SACL COLOR * load a color 
C0438 023C 6012 SACL A * load horizontal axis value C0487 0275 002E ADD ONE6 
C0439 023D 0035 AOD DA C0488 0276 D004 ANDK >0007 
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THS32020 - THS3406!1 DEMONSTRATION PAGE 0015 THS32020 - TMS34061 DEMONSTRATION PAGE 0016 
0277 0007 00002 . 
C0489 0278 6031 SACL SAVCLR * color = (color + 1) & 7 00003 0280 CE26 DEMOO RET 
C0490 0279 FE8O CALL FELIPS °° draw a filled ellipse 00004 - 
027A 0688 00005 - LINE DRAWING DEMONSTRATION #1 
C049! 0278 302F LAR ARO, SAVARO 00006 . 
C0492 027C 5588 WELPIE LARP ARO 00007 0281 C806 DEMO! LOPK 6 
C0493 0270 FB90 BANZ WELP! 00008 0282 CAFO LACK 240 
O27E 023A 00009 0283 6022 SACL XA * x! = 240 
C0494 027F CE26 RET 00010 0284 CA64 LACK 100 
C0495 * 0001! 0285 6023 SACL YA * y2 = 100 
0004 COPY LOEMOS.ASM 00012 0286 CA00 ZAC 
00013 0287 6024 SACL XB * x2 = 0 
00014 0288 6025 SACL YB "y2 = 0 
00015 0289 CAO! LACK I 
00016 028A 6031 SACL SAVCLR * color = |! 
00017 * 
00018 0288 COEE LARK ARO, 238 
00019 O28C 702F DIL! SAR ARO, SAVARO 
00020 028D 2022 LAC XA 
0002! O28E 6000 SACL X1 * load xl coordinate value 
00022 O28F 002E ADO ONE6 
00023 0290 6022 SACL XA *xl=xl +1 
00024 0291 2024 LAC XB 
00025 0292 6002 SACL X2 * load x2 coordinate value 
00026 0293 0043 ADD THREE6 
00027 0294 6024 SACL XB "x2 = x2 + 3 
00028 0295 2023 LAC YA 
00029 0296 6001 SACL Y! " load yl coordinate value 
00030 0297 2025 LAC YB 
0003! 0298 6003 SACL Y2 ® load y2 coordinate value 
00032 0299 2031 LAC SAVCLR 
D0033 029A 6004 SACL COLOR * load a color 
00034 0298 002E ADD ONE6 
00035 029C 6031 SACL SAVCLR ®* color = (color + 1) & 7 
00036 0290 132E SUB ONE6, 3 
00037 029E F380 BLZ DILIA 
O29F 02A2 
00038 O2A0 202E LAC ONE6 
00039 O2A! 603! SACL SAVCLR * if color = 0 then color = 1! 
00040 02A2 FEB8O DILIA CALL LINE * plot the line 
02A3 0524 
00041 O02A4 302F LAR ARO, SAVARO 
00042 02A5 5588 LARP ARO 
00043 02A6 FB90 BANZ DILI 
02A7 028C 
00044 * 
00045 02A8 C062 LARK ARO,98 
00046 02A9 702F DIL2 SAR ARO, SAVARO 
00047 O2AA 2022 LAC XA 
00048 02AB 6000 SACL X1 * load x! coordinate value 
D0049 O2AC 2024 LAC XB 
00050 O02AD 6002 SACL X2 " load x2 coordinate value 
00051 O2AE 2023 LAC YA 
00052 O2AF 6001 SACL Yl * load yl coordinate value 
00053 0280 002E ADD ONE6 
D0054 0281 6023 SACL YA "yl =yl +i 
00055 02B2 2025 LAC YB 


IOOPESILL PUR OZOZESWLL O41 BuIsA UONeSUIZ|duy sorydeInH YW *€Z 


GRAPHIC 32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 15:17:30 12-05-85 GRAPHIC 32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 15:17:30 12-05-85 


TMS32020 - THS34061 DEMONSTRATION PAGE 0017 T™4S32020 - TMS34061 DEMONSTRATION PAGE 0018 
00056 0283 6003 SACL Y2 * load y2 coordinate value 00107 O2EA 2025 LAC YB 
DO0057 02B4 0043 AOD THREES 00108 O2EB 6003 SACL Y2 * load y2 coordinate value 
00058 0285 6025 SACL YB ® y2 = y2 + 3 00109 O2EC 1043 SUB THREES 
00059 0286 2031 LAC SAVCLR D0110 O2ED 6025 SACL YB * y2 = y2 - 3 
00060 0287 6004 SACL COLOR * load a color 00111 O2EE 2031 LAC SAVCLR 
00061 0288 102E SUB ONE6 * color = (coior - 1) & 7 00112 O2EF 6004 SACL COLOR * joad a color 
00062 0289 F180 BGZ OIL2A 00113 O2FO 102E SUB ONE6 * color = (color ~ 1) & 7 
02BA 02BC 00114 O2F1 F180 BGZ DIL4A 
00063 0288 CA07 LACK 7 O2F2 O2F 4 
D0064 02B6C 6031 DIL2A SACL SAVCLR ®* {ff color = 0 then color = 7 00115 O2F3 CA07 LACK 7 
00065 0280 FESO CALL LINE * plot the lfne 00116 O2F4 6031 DIL4A SACL SAVCLR * {f color = 0 then color = 7 
O2BE 0524 00117 O2FS FESO CALL LINE * plot the line 
00066 O026F 302F LAR ARO, SAVARO O2F6 0524 
D0067 02CO 5588 LARP ARO 00118 O2F7 302F LAR ARO, SAVARO 
00068 02C1 FB90 BANZ DIL2 00119 O2F8 5588 LARP ARO 
02C2 02A9 00120 02F9 FB90 BANZ O1L4 
00069 * O2FA O02E1 
00070 02C3 COEE LARK ARO, 238 00121 O2FB CE26 RET 
DOO7T! 02C4 702F ODIL3 SAR ARO, SAVARO 00122 « 
00072 02C5 2022 LAC XA 00123 ad LINE DRAWING DEMONSTRATION #2 
00073 02C6 6000 SACL x1 * load xi coordinate value 00124 * 
00074 02C7 102E SUB ONE6 00125 O2FC C806 OEMO2 LOPK 6 
00075 02C8 6022 SACL XA *xl=sxi-! 00126 O2FO CA00 ZAC 
00076 02C9 2024 LAC XB 00127 O2FE 6022 SACL XA *xl = 0 
00077 O2CA 6002 SACL X2 * load x2 coordinate value 00128 O2FF 6023 SACL YA *yl = 0 
00078 02CB 1043 SUB THREE6 00129 0300 2037 LAC XMAX 
00079 02CC 6024 SACL XB * x2 2 x2 - 3 00130 0301 002E ADD ONE6 
00080 02CD 2023 LAC YA 00131 0302 6024 SACL XB * x2 = xmax + | 
DO0081 O2CE 600! SACL Y!1 “ load yl coordinate value 00132 0303 2039 LAC YMAX 
DO0082 O2CF 2025 LAC Y8 00133 0304 002E AOD ONES 
DO00B3 0200 6003 SACL ¥2 * load y2 coordinate value 00134 0305 6025 SACL YB * y2 = ymax + J 
00084 0201 2031 LAC SAVCLR 00135 0306 202E LAC ONE6 
DO085 0202 6004 SACL COLOR * load d color 00136 0307 6031 SACL SAVCLR * color = 1 
00086 0203 002E ADD ONES 00137 7 
DO087 0204 6031 SACL SAVCLR * color = (color + 1) & 7 00138 0308 CO4F LARK ARO, 79 
00088 0205 132E€ SUB ONE6, 3 00139 0309 7O02F O2L! SAR ARO, SAVARO 
00089 0206 F380 BLZ DIL3A 00140 030A 2022 LAC XA 
0207 O2DA 00141 030B 6000 SACL X1 * load x! coordinate value 
00090 0208 202E LAC ONE6 00142 030C 0045 ADD NINE6 
D009! 0209 6031 SACL SAVCLR *® {ff color = 0 then color = | 00143 0300 6022 SACL XA "xl =xlb+9 
00092 O2DA FE8O ODIL3A CALL LINE * plot the lfne 00144 O030E 2024 LAC XB 
020B 0524 00145 O30F 6002 SACL X2 * load x2 coordinate value 
00093 020C 302F LAR ARO, SAVARO 00146 0310 1045 SUB NINE6 
00094 0200 5588 LARP ARO D0147 0311 6024 SACL XB "x2 = x2 - 9 
00095 O2DE F890 BANZ OIL3 00148 0312 2023 LAC YA 
O2D0F 02C4 00149 0313 600) SACL Y! " load y! coordinate value 
00096 * 00150 0314 2025 LAC YB 
00097 02E0 C062 LARK ARO, 98 00151 0315 6003 SACL Y2 * load y2 coordinate value 
00098 O2E1 702F DIL4 SAR ARO, SAVARO 00152 0316 2031 LAC SAVCLR 
00099 02E2 2022 LAC XA 00153 0317 6004 SACL COLOR * load a color 
00100 02E3 6000 SACL Xl * load xl coordinate value 00154 0318 002E ADD ONES 
00101 02E4 2024 LAC XB 00155 0319 6031 SACL SAVCLR * color = (color + 1) & 7 
00102 02E5 6002 SACL X2 * load x2 coordinate value 00156 O31A 132E suB ONE6,3 
00103 O2E6 2023 LAC YA DO157 031B F380 BLZ O2L1A 
DO104 02E7 6001 SACL Yl * load yl coordinate value O31C O3IF 
00105 O2E8 102€ SUB ONE6 00158 0310 202E LAC ONE6 


00106 02E9 6023 SACL YA *ylezyl- i! DO159 O31E 6031 SACL SAVCLR * if color = 0 then color = ! 
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D0160 


00161 
00162 
00163 


00164 
00165 
00166 
00167 
D0168 
00169 
00170 
00171 
D0172 
00173 
DO174 
00175 
00176 
00177 
00178 
00179 
D0180 
00181 
00182 
00183 
DO0184 


00185 
00186 
00187 


00188 
00189 
00190 


00191 
00192 
D0193 
00194 
00195 
DO0196 


00197 
00198 
00199 
00200 
00201 
D0202 
D0203 
D0204 
00205 
00206 
00207 
00208 
00209 
D0210 


O31F 
0320 
0321 
0322 
0323 
0324 


0325 
0326 
0327 
0328 
0329 
032A 
0328 
032C 
0320 
032E 
032F 
0330 
0331 
0332 
0333 
0334 
0335 
0336 
0337 
0338 
0339 
033A 
0338 
033C 
0330 
033E 
033F. 
0340 
0341 
0342 


0343 
0344 
0345 
0346 
0347 
0348 
0349 
034A 
034B 
034C 
0340 


034E 
034F 
0350 
0351 
0352 
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FE80 
0524 
302F 
5588 
FB90 
0309 


C038 
702F 
2022 
6000 
2024 
6002 


D2L1A 


D2L2 


D2L2A 


DEMO3 


D311 


CALL 


LAR 
LARP 
BANZ 


LINE 


LOPK 
LALK 


SACL 
SACL 
ZAC 

SACL 
LACK 
SACL 
LACK 
SACL 


LARK 
SAR 
LAC 
SACL 
LAC 


LINE * 
ARO, SAVARO 
ARO 

O2L1 
ARO,59 
ARO, SAVARO 
XA 

Xl . 
XB 

X2 hs 
YA 

Yl * 
FIVE6 

YA . 
YB 

Y2 . 
FIVE6 

Y8 . 
SAVCLR 
COLOR . 
ONE6 
SAVCLR * 
ONE6 , 3 
D2L2A 
ONE6 
SAVCLR * 
LINE * 
ARO, SAVARO 
ARO 

D2L2 


6 

360 

XA 

XB ” 
YA * 
150 

YB . 
1 

SAVCLR * 
ARO, 29 
ARO, SAVARO 
XA 

Xl ™ 
XB 
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plot the line 


load xl coordinate value 
load x2 coordinate value 
toad yl coordinate value 
yl = yl + 5 
load y2 coordinate value 
y2 = y2-5 
load a color 


color = (color + 1) & 7 


{f color = 0 then color = 1 
plot the tine 


DRAWING DEMONSTRATION #3 


xl = 360 
x2 = 360 
yl = 0 

y2 = 150 
color = | 


toad x! coordinate value 
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00211 0353 
00212 0354 
00213 0355 
00214 0356 
00215 0357 
00216 0358 
00217 0359 
00218 035A 
00219 035B 
00220 035C 
00221 
00222 035E 
00223 035F 
00224 0360 
00225 0361 


00226 0363 
00227 0364 
00228 0365 


00229 0367 
D0230 0368 
00231 


00232 

00233 0368 
00234 036C 
00235 0360 
00236 036E 
00237 036F 
00238 0370 
00239 0371 
00240 0372 
00241 
00242 0374 
00243 0375 
00244 0376 
00245 0377 
00246 0378 
00247 0379 
00248 037A 
00249 0378 
DO250 037C 


00251 
00252 037F 
00253 0380 


00254 0382 
D0255 0383 
DO286 0384 


D0257 
00258 0386 
00259 0387 
00260 0388 
D0261 
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TMS34061 DEMONSTRATION 


6002 
0243 
6024 
2023 
6001 
0044 
6023 
2025 
6003 
2031 
6004 
002E 
6031 
132E 
F380 
0365 
202E 
6031 
FE80 
0524 
302F 
5588 
FB90 
034F 


Co10 
702F 
2022 
6000 
2024 
6002 
1243 
6024 
2023 
6001 
0044 
6023 
2025 
6003 
2031 
6004 
102E 
F180 
037F 
CA07 
6031 
FE80 
0524 
302F 
5588 
FB90 
036C 


CO1D 
702F 
2022 
6000 


D3L1A 


D3L2 


D3L2A 


D3L3 


SACL 
ADD 


PC 1.0 85.157 15:17:30 12-05-85 
PAGE 0020 
X2 * load x2 coordinate value 
THREE6, 2 
XB * x2 = x2 + 12 
YA 
Yl ° * load yl coordinate value 
FIVE6 
YA *yl = yl + 5 
YB 
Y2 * load y2 coordinate value 
SAVCLR 
COLOR * load a color 
ONE6 
SAVCLR * color = (color + 1!) & 7 
ONE6,3 
D3L1A 
ONE6 
SAVCLR * if color = 0 then color = 1 
LINE * plot the line 
ARO, SAVARO 
ARO 
D3L1 
ARO, 29 
ARO, SAVARO 
XA 
X1 * load xl coordinate value 
XB 
X2 * load x2 coordinate value 
THREE6,2 
XB * x2 = x2 - 12 
YA 
Y! * load yl coordinate value 
FIVE6 
YA * yl = yl +5 
YB 
Y2 * load y2 coordinate value 
SAVCLR 
COLOR * load a color 
ONE6 * color = (color - 1) & 7 
D3L2A 
7 
SAVCLR * if color = 0 then color = 7 
LINE * plot the line 
ARO, SAVARO 
ARO 
D3L2 
ARO,29 
ARO, SAVARO 
XA 
XI * load xl coordinate value 
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T™MS32020 - TMS34061 DEMONSTRATION PAGE 0021 ™™S32020 - TMS34061 DEMONSTRATION PAGE 0022 
00262 038A 2024 LAC XB 00313 O38F C806 DEMO4 LOPK 6 
D0263 038B 6002 SACL X2 * load x2 coordinate value 00314 03CO CA00 ZAC 
00264 038C 1243 SUB THREE6,2 00315 03C! 6022 SACL XA * xi = 0 
00265 0380 6024 SACL XB "x2 = x2 - 12 00316 03C2 6024 SACL XB * x2 = 0 
D0266 038E 2023 LAC YA 00317 03C3 6025 SACL YB * y2 = 0 
00267 O038F 6001 SACL Y! * load yl coordinate value 00318 03C4 CA96 LACK 150 
D0268 0390 1044 suB FIVE6 DO0319 03C5 6023 SACL YA * yl = 150 
DO0269 0391 6023 SACL YA *yl = yl - 5 00320 03C6 CAO! LACK | 
DO0270 0392 2025 LAC YB 0032! 03C7 6031 SACL SAVCLR * color = | 
D027! 0393 6003 SACL Y2 * load y2 coordinate value DO322 03C8 COIE LARK ARO ,30 
00272 0394 2031 LAC SAVCLR 00323 03C9 702F OD4L1 SAR ARO, SAVARO 
00273 0395 6004 SACL COLOR * load a color D0324 03CA 2022 LAC XA 
00274 0396 002E AOD ONE6 00325 03CB 6000 SACL X1 * load xl coordinate value 
00275 0397 603) SACL SAVCLR * color = (color + 1) &7 00326 03CC 2024 LAC XB 
00276 0398 132E SUB ONE6 ,3 00327 03CD 6002 SACL X2 * load x2 coordinate value 
00277 0399 F380 BLZ O03L3A 00328 O3CE 0243 ADO THREEG,2 
039A 0390 00329 O3CF 6024 SACL XB * x2 = x2 + 12 
00278 0398 202E LAC ONE6 DO0330 0300 2023 LAC YA 
00279 039C 6031 SACL SAVCLR * if colot = 0 then color = 1 00331 030! 600! SACL Yl * load y! coordinate value 
00280 039D FE80 O03L3A CALL LINE * plot the line 00332 03D2 1044 SUB FIVE6 
039E 0524 00333 0303 6023 SACL YA *yl=yl - 5 
00281 039F 302F LAR ARO, SAVARO 00334 0304 2025 LAC YB 
00282 03A0 5588 LARP ARO 00335 0305 6003 SACL Y2 * load y2 coordinate value 
00283 O3Al FB90 BANZ 03L3 00336 0306 2031 LAC SAVCLR 
03A2 0387 00337 0307 6004 SACL COLOR * load a color 
00284 * 00338 03D8 002E ADD ONE6 
DO285 03A3 C010 LARK ARO, 29 00339 0309 6031 SACL SAVCLR * color = (color + 1) &7 
00286 03A4 702F 03L4 SAR ARO, SAVARO 00340 O3DA 132E SUB ONE6,3 
00287 03A5 2022 LAC XA 00341 0308 F380 BLZ D4L1A 
DO288 03A6 6000 SACL Xl * load xl coordinate value 030C 03DF 
00289 03A7 2024 LAC XB 00342 0300 202E LAC ONE6 
DO290 03A8 6002 SACL X2 * load x2 coordinate value 00343 O3DE 6031 SACL SAVCLR * if color = 0 then color = 1 
00291 03A9 0243 ADD THREE6,2 00344 O3DF FE80 D4LIA CALL LINE * plot the line 
D0292 O3AA 6024 SACL XB "x2 = x2 + 12 03E0 0524 
00293 03AB8 2023 LAC YA 00345 O3€! 302F LAR ARO, SAVARO 
00294 O3AC 6001 SACL Y! * load yl coordinate value D0346 03E2 5588 LARP ARO 
00295 O3AD 1044 SUB FIVE6 00347 03E3 FB90 BANZ D4L1 
00296 O3AE 6023 SACL YA *yl = yl - 5 03E4 03C9 
00297 O3AF 2025 LAC YB 00348 * 
00298 0380 6003 SACL Y2 * load y2 coordinate value 00349 03E5 2037 LAC XMAX 
00299 038: 2031 LAC SAVCLR 00350 03E6 6022 SACL XA * x1 = xmax 
DO0300 03B2 6004 SACL COLOR * load a color 0035! 03E€7 000! LALK 359 
DO0301 0383 102E SUB ONE6 * color = (color - 1) &7 O3E8 0167 
DO0302 0384 FI80 BGZ D03L4A 00352 03€9 6024 SACL XB * x2 = 359 
0385 03B7 00353 O3E£A CA00 ZAC 
D0303 0386 CA07 LACK 7 00354 03EB 6023 SACL YA *yl = 0 
00304 0387 6031 O3L4A SACL SAVCLR * if color = 0 then color = 7 D0355 O3EC 6025 SACL YB *" y2 = 0 
D0305 0388 FE80 CALL LINE * plot the line 00356 03€D0 COIE LARK ARO, 30 
03B9 0524 DO357 O3EE 7O2F D4ti2 SAR ARO, SAVARO 
00306 03BA 302F LAR ARO, SAVARO 00358 O3EF 2022 LAC XA 
00307 0388 5588 LARP ARO D0359 O3FO 6000 SACL xX! * load xl coordinate value 
00308 03B8C FB90 BANZ D03L4 D0360 03F1 2024 LAC XB 
03BD 03A4 00361 03F2 6002 SACL X2 * load x2 coordinate value 
00309 O3BE CE26 RET 00362 03F3 0243 ADD THREE6,2 
00310 7 D0363 03F4 6024 SACL XB * x2 = x2 + 12 
00311 * LINE ORAWING DEMONSTRATION #4 D0364 03F5 2023 LAC YA 
00312 7 D0365 03F6 6001 SACL Yl * toad yl coordinate value 
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™S32020 — TMS34061 DEMONSTRATION PAGE 0023 ™S32020 - TMS34061 DEMONSTRATION PAGE 0024 

00366 03F7 0044 ADO FIVE6 D0417 042E D001 LALK 360 

D0367 03F8 6023 SACL YA "yl = yl +5 042F 0168 

00368 03F9 2025 LAC YB 00418 0430 6024 SACL XB * x2 = 360 

D0369 03FA 6003 SACL Y2 * load y2 coordinate value D0419 0431 2039 LAC YMAX 

D0370 03FB 2031 LAC SAVCLR 00420 0432 6023 SACL YA * yl = ymax 

00371 O3FC 6004 SACL COLOR * load a color D042! 0433 6025 SACL YB * y2 = ymax 

00372 03FD 102E SUB ONE6 * color = (color - 1) &7 00422 0434 COIE LARK ARO, 30 

00373 O3FE F180 BGZ D4L2A 00423 0435 702F D4L4 SAR ARO, SAVARO 
O3FF 0401 00424 0436 2022 LAC XA 

00374 0400 CA07 LACK 7 00425 0437 6000 SACL Xl * load xl coordinate value 

00375 0401 6031 O4L2A SACL SAVCLR * if color = 0 then color = 7 00426 0438 2024 LAC XB 

00376 0402 FE80 CALL LINE * plot the line 00427 0439 6002 SACL X2 * load x2 coordinate value 
0403 0524 D0428 043A 1243 SUB THREE6, 2 

00377 0404 302F LAR ARO, SAVARO D0429 043B 6024 SACL XB * x2 = x2 - 12 

00378 0405 5588 LARP ARO 00430 043C 2023 LAC YA 

00379 0406 F890 BANZ D4L2 00431 043D 6001 SACL Y1 * load yl coordinate value 
0407 O3EE 00432 043€ 1044 SUB FIVE6 

00380 be 00433 043F 6023 SACL YA *"yl =yl - 5 

00381 0408 2039 LAC YMAX 00434 0440 2025 LAC YB 

00382 0409 6025 SACL YB * y2 = ymax 00435 0441 6003 SACL Y2 * load y2 coordinate value 

00383 040A CA95 LACK 149 00436 0442 2031 LAC SAVCLR 

00384 0408 6023 SACL YA *yl = 149 00437 0443 6004 SACL COLOR * load a color 

00385 040C 2037 LAC XMAX 00438 0444 102E SuB ONE6 * color = (color - 1) &7 

00386 0400 6022 SACL XA "xl = «max 00439 0445 F180 BGZ D4L4A 

00387 040E 6024 SACL XB * x2 = xmax 0446 0448 

00388 O40F COIE LARK ARO, 30 00440 0447 CA07 LACK 7 

00389 0410 702F O4L3 SAR ARO, SAVARO 00441 0448 6031 O4L4A SACL SAVCLR * if color = 0 then color = 7 

00390 0411 2022 LAC XA D0442 0449 FE8O CALL LINE * plot the line 

D0391 0412 6000 SACL Xl * load x1 coordinate value 044A 0524 

00392 0413 2024 LAC XB 00443 0448 302F LAR ARO, SAVARO 

00393 0414 6002 SACL X2 * load x2 coordinate value D0444 044C 5588 LARP ARO 

00394 0415 1243 SUB THREE6,2 00445 044D FB90 BANZ D4L4 

00395 0416 6024 SACL XB * x2 = x2 - 12 O44E 0435 

D0396 0417 2023 LAC YA 00446 O44F CE26 RET 

00397 0418 6001 SACL Yl * load yl coordinate value 00447 " 

00398 0419 0044 ADO FIVE6 00448 : LINE DRAWING DEMONSTRATION #5 

D0399 041A 6023 SACL YA *yl = yl +5 00449 ® 

D0400 041B 2025 LAC YB 00450 0450 C806 DEMOS  LDPK 6 

00401 041C 6003 SACL Y2 * load y2 coordinate value D045! 0451 CA00 ZAC 

00402 041D 2031 LAC SAVCLR 00452 0452 6022 SACL XA * xl = 0 

00403 O41E 6004 SACL COLOR * load a color 00453 0453 6023 SACL YA *yl = 0 

00404 O41F O002E AOD ONE6 00454 0454 D001 LALK 360 

00405 0420 6031 SACL SAVCLR * color = (color + 1) & 7 0455 0168 

00406 0421 132E SUB ONEG,3 00455 0456 6024 SACL XB * x2 = 360 

00407 0422 F380 BLZ D4L3A D0456 0457 CA96 LACK 150 
0423 0426 00457 0458 6025 SACL YB * y2 = 150 

00408 0424 202E LAC ONE6 00458 0459 CA01 LACK | 

00409 0425 6031 SACL SAVCLR * {ff color = 0 then color = | 00459 045A 6031 SACL SAVCLR * color = 1 

00410 0426 FE80 OD4L3A CALL LINE * plot the line 00460 £ 
0427 0524 00461 0458 COID LARK ARO,29 

00411 0428 302F LAR ARO, SAVARO 00462 045C 702F OD5LI1 SAR ARO, SAVARO 

D0412 0429 5588 LARP APO 00463 045D 2022 LAC XA 

00413 042A FB90 BANZ 04L3 00464 045E 6000 SACL xX] * load xl coordinate value 
042B 0410 00465 O045F 0243 AOD THREE6, 2 

00414 * DO0466 0460 6022 SACL XA *“xl = xl + 12 

00415 042C CA00 ZAC 00467 0461 2024 LAC XB 

00416 0420 6022 SACL XA *xi = 0 00468 0462 6002 SACL X2 * load x2 coordinate value 
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00469 0463 0243 ADD THREE6,2 
00470 0464 6024 SACL XB * x2 = x2 + 12 
00471 0465 2023 LAC YA 
00472 0466 600! SACL Yl * load y! coordinate value 
00473 0467 0044 ADD FIVE6 
00474 0468 6023 SACL YA *"yl = yl +5 
00475 0469 2025 LAC YB 
00476 046A 6003 SACL Y2 * load y2 coordinate value 
00477 0466 1044 SUB FIVE6 
00478 046C 6025 SACL YB * y2 = y2 - 5 
00479 0460 2031 LAC SAVCLR 
00480 046E 6004 SACL COLOR * load a color 
00481 046F 002E ADD ONE6 
00482 0470 603! SACL SAVCLR * color = (color + 1) & 7 
00483 0471 132E SUB ONE6,3 
00484 0472 F380 BLZ D5LIA 
0473 0476 
00485 0474 202E LAC ONE6 
00486 0475 6031 SACL SAVCLR * if color = 0 then color = 1 
00487 0476 FE8O0 OSLIA CALL LINE * plot the line 
0477 0524 
00488 0478 302F LAR ARO, SAVARO 
00489 0479 5588 LARP ARO 
D0490 047A F890 BANZ O5L1 
047B 045C 
00491 * 
D0492 047C COID LARK ARO, 29 
00493 047D 702F O5L2 SAR ARO, SAVARO 
00494 O047€ 2022 LAC XA 
D0495 047F 6000 SACL xX] * load x! coordinate value 
00496 0480 0243 ADD THREE6,2 
00497 048! 6022 SACL XA * xi = xl + 12 
00498 0482 2024 LAC XB 
00499 0483 6002 SACL X2 * load x2 coordinate value 
00500 0484 1243 SuB THREE6,2 
0050! 0485 6024 SACL XB * x2 = x2 - 12 
00502 0486 2023 LAC YA 
00503 0487 6001 SACL Y! * load yl coordinate value 
00504 0488 0044 ADO FIVE6 
00505 0489 6023 SACL YA *yl =yl +5 
00506 048A 2025 LAC YB 
00507 048B 6003 SACL Y2 * load y2 coordinate value 
D0508 048C 0044 AOD FIVE6 
00509 048D 6025 SACL YB *"y2 = y2 +5 
00510 048E 2031 LAC SAVCLR 
00511 O48F 6004 SACL COLOR * toad a color 
00512 0490 102E SUB ONE6 * color = (color - 1) & 7 
00513 0491 F180 BGZ D5L2A 
0492 0494 
00514 0493 CA07 LACK 7 
00515 0494 603! O5L2A SACL SAVCLR * if color = 0 then color = 7 
00516 0495 FE80 CALL LINE * plot the line 
0496 0524 
00517 0497 302F LAR ARO, SAVARO 
00518 0498 5588 LARP ARO 
D0519 0499 FB90 BANZ O05L2 
049A 0470 
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00520 

D0S21 0498 
D0522 049C 
00523 
00524 049E 
00525 049F 
D0526 04A0 
00527 04A1 
00528 04A2 
00529 04A3 
D0530 04A4 
00531 
D0532 04A6 
00533 04A7 
00534 04A8 
00535 
DO536 O4AA 
00537 04A8B 
DO0538 O4AC 
00539 04AD 
D0540 04AE 
00541 
00542 0480 
00543 0481 
00544 0482 


00545 0484 
00546 04B5 
00547 0486 


D0548 0488 
00549 04B9 
00550 04BA 


00551 

00552 048C 
00553 04BD 
D0554 04BE 
00555 O4BF 
00556 04C0 
00557 04C) 
00558 04C2 
DO559 04C3 
00560 04C4 
00561 
00562 04C6 
D0563 04C7 
00564 04C8 
00565 04C9 
D0566 O4CA 
D0567 04CB 
00568 04CC 
D0569 04CD 
00570 O4CE 
D057! 
00572 0400 
00573 0401 
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* 


co10 LARK ARO,29 
702F O5L3 SAR ARO ,SAVARO 
2022 LAC XA 
6000 SACL X} . 
1243 SUB THREE6, 2 
6022 SACL XA 7 
2024 LAC XB 
6002 SACL X2 " 
1243 SUB THREE6, 2 
6024 SACL XB * 
2023 LAC YA 
600! SACL Y1 7 
1044 SUB FIVE6 
6023 SACL YA id 
2025 LAC YB 
6003 SACL Y2 : 
0044 ADD FIVE6 
6025 SACL YB * 
2031 LAC SAVCLR 
6004 SACL COLOR * 
002E ADO ONE6 
6031 SACL SAVCLR * 
132E SUB ONE6, 3 
F380 BLZ DSL3A 
04B6 
202E LAC ONE6 
6031 SACL SAVCLR * 
FE8O DSL3A CALL LINE . 
0524 
302F LAR ARO, SAVARO 
5588 LARP ARO 
FB90 BANZ O5L3 
049C 

* 
COID LARK ARO,29 
702F D5L4 SAR ARO, SAVARO 
2022 LAC XA 
6000 SACL XI ed 
1243 SUB THREE6, 2 
6022 SACL XA a} 
2024 LAC XB 
6002 SACL X2 . 
0243 AOD THREE6,2 
6024 SACL XB * 
2023 LAC YA 
6001 SACL Y * 
1044 SUB FIVE6 
6023 SACL YA = 
2025 LAC YB 
6003 SACL Y2 id 
1044 SUB FIVE6 
6025 SACL YB . 
2031 LAC SAVCLR 
6004 SACL COLOR 
102E SUB ONE6 ig 
F180 BGZ D5L4A 


157 15:17:30 


PAGE 0026 


coordinate value 
xl = xl - 12 
load x2 coordinate value 
x2 = x2 - 12 
coordinate value 
=o5 
load y2 coordinate value 
y2 = y2 - § 
load a color 


color = (color + 1) &7 


if color = 0 then color = | 
plot the line 


load xl coordinate value 
xt = xl - 12 
load x2 coordinate value 
x2 = x2 + 12 
load yl coordinate value 
yl = yl - 5 
load y2 coordinate value 


y2 = y2- 5 


* load a color 


color = (color - 1) & 7 
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00574 
00575 
00576 


00577 
00578 
D0579 


DO0580 
0005 


04D2 
0403 
0404 
0405 
0406 
0407 
0408 
0409 
04DA 
0408 
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TMS3406! DEMONSTRATION 


0404 
CA07 
6031 
FE80 
0524 
302F 
5588 
FB90 
04BD 
CE26 


D5L4A 


LACK 
SACL 
CALL 


LAR 
LARP 
BANZ 


RET 
COPY 


PAGE 0027 


7 
SAVCLR * if color = 0 then color = 7 
LINE * plot the line 


ARO, SAVARO 


ARO 
D5L4 


CLEAR. ASM 
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™™S32020 - 1TMS34061 DEMONSTRATION PAGE 0028 
E0002 * 

£0003 CARRERE EERE SEESE REESE SERS ERTS ERE ETE RRR REE ER EERE AES 
E0004 * ROUTINE: CLEAR 

£0005 * 

E0006 * This routine is called with the display color to be used 
£0007 * to clear the display screen fn the least significant byte 
E0008 * of the accumulator. The following approach is taken: 
E0009 . 

E0010 * 1. The present control register contents are read and 
E0011 * saved. 

E0012 * 

E0013 * 2. Control Register 2 is modified to allow eight pixels 
£0014 * to be written with each indirect access through the X-Y 
€0015 * register. 

E0016 ® 

E0017 * 3. The last row of the display memory (off screen) is 
£0018 * filled with the display information. Each write access 
E0019 * fills eight pixels. Each row of display memory 

E0020 * corresponds to two lines of CRT data. 

E0021 * 

£0022 * 4. The Vertical Start Blank register is read to determine 
E0023 * the size of the active display and to update the Vertical 
£0024 * Interrupt register to generate a vertical interrupt at the 
£0025 * next vertical sync. 

£0026 * ; 

E0027 * 5. The Status Register is read and tested until! the 
£0028 * vertical interrupt occurs. 

£0029 * 

E0030 * 6. Screen refresh and the display are disabled. 

£0031 . 

£0032 * 7. The VRAM shift registers are loaded from the last row 
E0033 * of VRAM memory. 

£0034 # 

E0035 * 8. The rows in VRAM memory corresponding to the active 
E0036 * display area are loaded with the data in the VRAM shift 
E0037 * registers. 

£0038 * 

£0039 * 9. Screen refresh and the display are re-enabled for 
E0040 * normal operation. 

E0041 TES RRERSERERS ERE RESESEAL ERE REE EER ES ERE RETREAT EES  Y 
E0042 : 

£0043 040C CLEAR EQU $ 

£0044 040C 7860 SST STSO * save current data page pointer 
€0045 0400 C809 LOPK >»0009 * VSC register page 

£0046 O4DE C263 LARK AR2,FILLC * internal mem on DP=0 
£0047 O40F 558A LARP AR2 

£0048 04E0 60A0 SACL *+ * save fill/clear color 

£0049 04E1 2030 LAC CTLRIL * save Control Register |} 

£0050 O04E2 60A0 SACL #4 

£0051 04E3 2038 LAC CTLRIH 

£0052 04E4 60A0 SACL "+ 

£0053 04E5 2040 LAC CTLR2L * save Control Register 2 

E0054 04E6 60A0 SACL #4 

E0055 0467 2048 LAC CTLR2H 

E0056 04E8 60A0 SACL #4 

E0057 04E9 CABC LACK >00BC * X-Y mode RAS override 

£0058 O4EA 6040 SACL CTLR2L * load Control! Register 2 
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E0059 
E0060 
£0061 
£0062 
E0063 
£0064 
£0065 
E0066 
E0067 
£0068 
E0069 


£0070 
E0071 
E0072 
£0073 
E0074 
E0075 
£0076 
E0077 
E0078 
E0079 


£0080 


E0081 
£0082 


£0083 
£0084 
E0085 


E0086 
E0087 
E0088 
E0089 
E0090 


£0091 
E0092 
£0093 


£0094 
£0095 
£0096 
£0097 
E0098 
E0099 
£0100 
E0101 
£0102 
£0103 
£0104 
E0105 
£0106 
£0107 
£0108 


04EB 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 15:17:30 12-05-85 GRAPHIC 
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CAFF 
6870 
6078 
C263 
2090 
C810 
CBFF 
6008 
C808 
2060 
D004 
OOFF 
0868 
OOA0 
6080 
C809 
6020 
2880 
6828 
2050 
2050 
0004 
0001 
F680 
O4FF 
2030 
005 
0020 
6030 
2048 
D004 
OODF 
6048 
2080 
CE19 
6089 
0100 
2BFC 
608A 
C004 
0100 
2000 
A4BAQ 
60E0 
60E0 
558A 
20A0 
6030 
20A0 
6038 
20A0 
6040 
20A0 
6048 
c80o 
5060 
CE26 


WAITI 


LACK 
SACH 
SACL 
LARK 
LAC 


>OOFF = * 
XYADRL* 
XYADRH 

AR2 ,FILLC 

+. 

0010s * 
255 

XINC 
>0008 ss * 
VSBLL* 
>OOFF 


VSBLH,8 
e+ * 


>0009 : 
VINTL . 


STATL * 


CTLRIL * 
>0020 


CTLRIL 
CTLR2H * 
>O00F 


CTLR2H 
#* 


*.0,1 * 
ARI, >2BFC 


*.0,2 * 
ARO,4 
AR! ,>2000 


*+, 1 
#O+ * 
*0+ 

AR2 

*+ 

CTLRIL * 
#+ 

CTLRIH 

*+ 

CTLR2L , 
*+ 

CTLR2H 

0 

STSO . 


row 255 of VRAM 0006 
load X-Y Address Register 


X-Y indirect control page 


* write 8 pixels 256 times: 2 lines 


VSC register page 
read Vertical Start Blank 


increment to one line below last 


VSC register page 
load Vertical Interrupt 


wait for vertical interrupt 


turn screen refresh off 


blank display 


load display line count 
divide by 2: update 2 lines/write 
save for repeat count 


load shift register from memory 


* load loop count for screen clear 


load memory from shift register 


load Control! Register 1 


load Control! Register 2 


restore data page pointer 


COPY 
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FO002 * 

FOQ003 HRHEHHHRAHHSHHKREHHHEKRKSHHAKLHKHFKAKKHKHHHKHEHKRKRHFHKRKHKKHGHHKHHHRHHE REE 
FO004 * ROUTINE: LINE 

FOO05 * 

FO006 * This routine is called with the following information 
F0007 * located in the 5 consecutive words on page 6 labeled as 
F0008 * X1, Yl, X2, Y2, COLOR. The following approach fs taken to 
FO009 * draw a line of the color specified from one of the 

FOo010 * specified points to the other following Bresenham’s Line 
F001) * Algorithm. 

F0012 * 

FO013 * octant \ #3 1 #2 / 

FOOI4 * definition \ \ / 

FOOI5 * #4\ | / #1 

FO016 7 \t/ 

FOOIT7 . I\ 

FOO018 . #5 / | \ #8 

Fo019 ‘ / H \ 

F0020 ® / #6 | #7 \ 

Fo021 . 

FO022 * 1. Determine the values for dx and dy, swapping the 
F0023 * points if necessary to assure that dy Is positive. 

F0024 . dx = x2 - xl 

F0025 * dy = y2 - yl 

FO0026 * This maps octants 5 through 8 onto 1 through 4 

F0027 * respectively and reduces the code correspondingly. Each 
F0028 * octant is coded seperately to simplify the decisions in 
F0029 * selecting between pixels. 

F0030 * 

FOO31 * d c c d 

F0032 * (@] 6) 0 ie) 

F0033 * ! / \ ' 

FO034 * OCTANTS 1 & 2 | / Ned OCTANTS 3 & 4 
FO0035 * H \} 

FO0036 * X------0 Q------ x 

F0037 * a b b a 

F0038 + 

F0039 * 2. Since the data bus fis eight bits wide and addresses 
F0040 * two pixels at a time, it is necessary to determine whether 
F004! * the first pixel of the line is the left or right pixel of 
F0042 * the pair (BFLAG). For each of the four octants there are 
F0043 * at least two cases depending on the location left/right of 
F0044 * the pixel that {is to be modified. 

F0045 

F0046 * OCTANT 1 OCTANT 2 

F0047 * 

F0048 # $-~--- +----~ + 4----- +----- + +----- +----- + +----- +----- + 
F0049 *1'007!00; !} 00 } 070} ' 0707} 00! } 0 07! 020 } 
FO050 * +----- +----- + +----- +----- + +----- +-~---- + 4----- +----- + 
FOO51 *'xX0?7! 00% ! 0X !} 0270 } '‘XO!o0o;};};Oo0x;iood!} 
F0052 # 4----- +----- + +----- +----- + +----- +----- + +----- +----- + 
F0053 * 

F0054 ba 

F0055 * OCTANT 3 OCTANT 4 

F0056 * 

FO0O57 # 4+----- +----- + +----- +----— + +----- +----- + +----- +----- + 
FO058 * | 02707! 00 | } O O07} 070 } {070 !00:! | 002! 00} 
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F0059 # 4---—- +----- + +----- +----- + Fenn n= +----- + +----- +----- + 
F0060 *“1ox!1oo!r!oo;xo} 'O7X !}! 00! $002! X00} 
F0061 # 4----- +----- + +----- $----- + 9 $—---= +----- + +----- +----- + 
F0062 * 
F0063 * 3. Octants ! and 2 are seperated from octants 3 and 4 by 
F0064 * determining whether dx is positive or negative (ASIGN). 
F0065 * 
F0066 * 4. Octant ! can be seperated from octant 2 by determining 
F0067 * whether the absolute value of dx is greater than dy 
F0068 * (octant |) or tess than dy (octant 2). Correspondingly 
F0069 * octant 3 is seperated from octant 4 by determining whether 
F0070 * the absolute value of dx is less than dy (octant 3) or 
F0071 * greater than dy (octant 4). (ABSIGN) 
F0072 * 
F0073 * §. Within the coding for each octant, processing 
F0074 * oscillates through two basic blocks (untque for each 
F0075 * octant). These blocks assume the pixel! to be modified is 
F0076 * the left or right pixel of a pixel pair and determine 
F0077 * where the next pixel in the line is located. 
F0078 * 
F0079 * 6. Two special cases of horizontal! and vertical lines are 
F0080 * isolated to speed up the processing for those cases. 
F * 
See IRR RRR RRERERERESER REESE ERE RSE RES ER EERE ERE SERS ERS ERE EERE REESE SE 
F0083 . 
F0084 0524 LINE EQU $ 
F0085 0524 7860 SST STSO * save current data page pointer 
F0086 0525 C806 LOPK 6 
FO087 0526 C008 LARK ARO,8 * set-up access to VSC registers 
F0088 0527 D100 LRLK AR1, VSCREG+>0080+XYOFFL 

0528 04£0 
F0089 0529 5589 LARP ARI 
F0090 052A 2004 LAC COLOR 
F009! 052B 600A SACL COLORL 
F0092 052C 6408 SACL COLORH, 4 
F0093 052D 0404 ADD COLOR, 4 
F0094 052E 600C SACL COLOR2 
FO095 052F 2003 DELTA LAC Y2 * dy = y2 - yl 
FO096 0530 1001 SUB Y1 
F0097 0531 F480 BGEZ POSY * test for positive dy 

0532 053D 
F0098 0533 4001 ZALH Y1 * negative dy : swap y2 and yl 
F0099 0534 4903 ADDS Y2 
FO100 0535 6803 SACH Y2 
F010! 0536 6001 SACL Yt 
FO102 0537 4000 ZALH Xl * swap x2 and xl 
F0103 0538 4902 ADDS X2 
F0104 0539 6802 SACH X2 
FO!05 053A 6000 SACL Xl 
FO106 0538 FF80 B DELTA 

053C 052F 
FO107 0530 6006 POSY “SACL DY * positive dy 
FO108 053E F680 BZ HL INE 

053F 066A 
F0109 0540 2002 LAC X2 * dx = x2 - xi 
FO110 0541 1000 SUB Xi 
FO111 0542 6005 SACL DX 


ps9 
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FO1I2 
F0113 
FO114 
FOII5 
FO1I6 


FO117 
FO118 
FO1I9 
F0120 
F012! 
F0122 
F0123 
FOl24 
FO0125 
F0126 
F0127 


F0128 


F0129 
F0130 
FO131 
F0132 
F0133 


F0134 
F0135 
F0136 
F0137 


F0138: 


F0139 
F0140 
F014] 
F0142 
F0143 
F0144 
FO145 


F0146 


FO147 
F0148 


FO149 
F0150 


FOI5I 
F0152 
F0153 
FO154 


FO155 


FO156 
FOI57 


0543 
0544 
0545 
0546 
0547 
0548 
0549 
054A 
0548 
054C 
0540 
054E 
O54F 
0550 
0551 

0552 
0553 
0554 
0555 
0556 
0557 
0558 
0559 
055A 
0558 
055C 
0550 
055E 
055F 
0560 
0561 

0562 
0563 
0564 
0565 
0566 
0567 
0568 
0569 
056A 
0568 
056C 
056D 
056€ 
OS6F 
0570 
0571 


0572 
0573 
0574 
0575 
0576 
0577 
0578 


32020 FAMILY MACRO ASSEMBLER 
TMS34061 DEMONSTRATION 


CA20 
60E0 
2F00 
6000 
DF 04 
0006 
68E0 
2000 
6800 
2701 
0000 
60E0 
2F01 
6880 
2005 
680E 
F680 
06A2 
F180 
0558 
CE23 
6005 
1006 
680F 
F480 
0561 
4005 
4906 
6806 
6005 
2106 
6008 
1005 
6007 
1005 
6009 
3005 
0100 
0800 
0200 
0820 
200F 
F380 
058C 
200E 
F380 
0610 


D300 
0828 
0400 
0808 
2000 
F380 
0592 


POSDX 


OTEST 


O10R4 


oct! 


LACK 
SACL 
LAC 

SACL 
ANDK 


SACH 
LAC 
SACH 
LAC 
ADD 
SACL 
LAC 
SACH 
LAC 
SACH 
BZ 
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PAGE 0033 


>0020 * load X-Y Offset Register (lo byte 
*Q+ 
X1,15 
BFLAG * bit flag for even/odd bit (0/1) 
>0006,15 * mask for next 2.1sb’s of xl 
*0+ * load X-Y Offset Register (hi byte 
x1,13 * seperate 7 msb’s of x! 
a 
Y1,7 * add isb of yl to x! 
X1 
*0+ * load low byte of X-Y Address Reg 
Y1,15 * seperate 8 msb’s of y! 
i * load high byte of X-Y Address Reg 
OX 
ASIGN * sign of dx (+/-) (0/-1) 
VL INE 
POSDX 
OX 
OY * \dx! - dy or a - b 
ABSIGN * sign of {dx} - dy (+/-) (0/-1) 
OTEST 
DX * negative idx} - dy : swap dx & dy 
DY or swap a and b 
OY 
OX 
OY,1 * dy corresponds to b 
INCR1 *" fncrl = 2b> 0 
Ox * dx corresponds to a 
D *d= 2b-a 
Ox 
INCR2 * incr2 = 2b - 2a < 0 
ARO,DX * ARO = # of pixels in line = a 
ARI ,XYIND+XYNOP 
AR2 ,XY IND+Y INC 
ABSIGN 
O20R3 
ASIGN 
OCT4 
A LINE IN OCTANT #]) 
AR3,XYIND+XIY1 
AR4, XY IND+X INC 
BFLAG 
OCTILO 


GRAPHIC 
T™MS32020 - 


FO158 
FO1S9 
F0160 
F016) 
FO162 
F0163 
F0164 
FO165 
FO166 


FO!67 
FO168 
F0169 


FO!70 
FOI7] 
FO0172 
FO173 
FOI74 
FOI75 
FO176 


FOI77 
F0178 
FOI79 
F0180 
FO181 
F0182 
F0183 


F0I84 


FO0185 
F0186 
F0187 
F0188 
F0189 
F0190 
FOI91 
FO192 
F0193 


FO194 
FO195 
F0196 
F0197 
FO198 
F0199 
F0200 


F0201 


F0202 
F0203 
F0204 
F0205 
F0206 


0579 
O57A 
0578 
057C 
057D 
OS7E 
OS7F 
0580 
0581 

0582 
0583 
0584 
0585 
0586 
0587 
0588 
0589 
058A 
058B 
058c 
0580 
058E 
O58F 
0590 
0591 


0592 
0593 
0594 
0595 
0596 
0597 
0598 
0599 
059A 
059B 
059C 
059D 
059E 
OS9F 
OSA0 
O5A1 
O5A2 
05A3 
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* 
€f +----- +----- + 
sf '007!003 
* OCTANT |! (LEFT PIXEL) +----- +----- + 
* 1X07} 003 
* +----- +----- + 
* 
2007 OCTIHI LAC 8) 
F480 BGEZ OLINC2 
0588 
0008 ADD INCR1 
5588 MAR *,0 
FB9B BANZ OCTIHL,*-,3 
05A9 
5589 MAR *,! 
6007 SACL 0) *d=d+ incrl 
CAOF LACK >O00F * write left pixel only 
4€80 AND * 
4008 OR COLORH 
6080 SACL * 
FF80 B DONE 
0667 
0009 OI1INC2 ADD INCR2 
6007 SACL D *d=d+ incr2 
CAOF LACK >OO0F * write left pixel only 
4E8A AND *,2 
4008 OR COLORH 
6088 SACL *,0,0 
FB99 BANZ OCTILO,*-,1 
0592 
FF80 8 DONE 
0667 
* 
* +----- +----- + 
* 'o00o0! 0703 
* OCTANT ! (RIGHT PIXEL) +----- +----- + 
* ; OX } 070 } 
. +----- +----- + 
# 
2007 OCTILO LAC D 
F380 BLZ OLINC) 
059F 
0009 ADD INCR2 
6007 SACL 0 *"“d=d + incr2 
CAFO LACK >00FO * write right pixel only 
4E8B AND *,3 
AD0A OR COLORL 
6088 SACL *,0,0 
FB99 BANZ OCTIHI,*-,1! 
0579 
FF80 B DONE 
0667 
0008 O1INC! ADD INCR1 
6007 SACL D *"d=d+ incr! 
CAFO LACK >O0F0 * write right pixel only 
4E8C AND *,4 
ADOA OR COLORL 
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F0207 05A4 6088 SACL *,0,0 FO251 0502 6007 SACL 0 * dg =d + incr2 
FO0208 05A5 FB99 BANZ OCTIHI,*-,1 FO0252 0503 FB99 BANZ OCT2L0,*-,1 
O5A6 0579 0504 0507 
F0209 OSA7 FFS8O B DONE F0253 0505 FF80 B DONE 
O5A8 0667 0506 0667 
FO210 O5A9 F480 OCTIHL BGEZ OIXTYI F0254 ® 
O5AA 0584 F0255 * +----- +----- + 
FO211 OSAB 0008 ADD INCR1 F0256 . + 0 02! 020 } 
FO212 O5AC 6007 SACL D *d=d+ incrl F0257 * OCTANT 2 (RIGHT PIXEL) +----- $----- + 
F0213 O5A0 200C LAC COLOR2 * write both pixels of the pair F0258 * 'ox;r1oo! 
FO0214 O5AE 558C MAR "14 F0259 ® +----- $----=- + 
FO215 O5AF 6088 SACL *,0,0 F0260 * 
F0216 05B0 FB99 BANZ OCTIHI, *-,1 F0261 0507 2007 OCT2LO LAC 0 
0581 0579 FO0262 0508 F480 BGEZ O2Xx1Y1 
FO217 05B2 FF8O 8 DONE 0509 05E4 
05B3 0667 F0263 050A 0008 O2x0Y1 AOD INCRI 
F0218 05B4 0009 OIXIYI ADD INCR2 F0264 0508 6007 SACL D *d=d+ incri 
FO219 05B5 6007 SACL D *d=d + iner2 FO0265 O50C CAFO LACK >00F0 * write right pixel only 
F0220 0586 200C LAC COLOR2 * write both pixels of the pair F0266 O5DD 4E8A AND *,2 
FO221 0587 6088 SACL *,0,0 F0267 O5DE 4D0A OR COLORL 
F0222 0588 F899 BANZ OCTIHI,*-,1 F0268 OSDF 6088 SACL *,0,0 
05B9 0579 FO0269 OS5E0 FB99 BANZ OCT2LO,*-,1 
F0223 OSBA FF80 B DONE 05E1 0507 
05BB 0667 F0270 O5E2 FF8O B DONE 
F0224 * 05E3 0667 
FO0225 O58C 200E O20R3 LAC ASIGN F0271 O5E4 0009 O2XIYI ADD INCR2 
FO0226 O5BD F380 BLZ OcT3 FO0272 O5E5 6007 SACL D "“d=d + iner2 
O5BE O5EE F0273 OS5SE6 CAFO LACK >00F 0 * write right pixel only 
F0227 : F0274 O5€7 4E8B AND *,3 
F0228 : DRAW A LINE IN OCTANT #2 F0275 OSE8 4D0A OR COLORL 
F0229 . FO276 O5€9 6088 SACL *,0,0 
F0230 OSBF D300 OCT2 LRLK AR3,XYINO+X1Y1 F0277 OSEA FB99 BANZ OCT2HI,*-,1 
05CO0 0828 05SEB 05C4 
FO231 05C1 2000 LAC BFLAG F0278 OSEC FF8O B DONE 
F0232 O05C2 F380 BLZ OCT2L0 OSED 0667 
05C3 0507 F0279 7 
F0233 * F0280 . DRAW A LINE IN OCTANT #3 
F0234 * $+--~-- +----- + F0281 . 
FQ235 * ; 0702} OO } F0282 OSEE 0300 OCT3 LRLK AR3,XYIND+XDYI 
F0236 * OCTANT 2 (LEFT PIXEL) +----- +----- + OSEF 0830 
F0237 ‘ ,'xO!oo! F0283 OSFO 2000 LAC BFLAG 
F0238 : +----- +----- + F0284 O5F1 F380 BLZ OCT3LO 
F0239 . O5F2 O60A 
F0240 05C4 CAOF OCT2HI LACK >O00F * write left pixel only F0285 . 
FO0241 OS5SC5 4E8A AND *,2 F0286 * +----- +----- + 
FO0242 O05C6 4008 OR COLORH F0287 is | O 027} 020 } 
F0243 05C7 6088 SACL *,0,0 F0288 * OCTANT3 (LEFT PIXEL) 4+----- +----- + 
F0244 05C8 2007 LAC 8) F0289 . ,O0;1xO$} 
F0245 05C9 F480 BGEZ O2INC2 F0290 * +----- +----- + 
O5CA 0501 F029) by 
F0246 05CB 0008 O2INCi ADD INCR1 F0292 05F3 2007 OCT3HI LAC 8) 
F0247 O5CC 6007 SACL rt) *"d=d+ incr! F0293 05F4 F480 BGEZ O3XDY | 
F0248 05CD FB99 BANZ OCT2HI,*-,1 O5F5 0600 
OSCE 05C4 F0294 05F6 0008 O3X0Y!1 ADD INCRI 
FO0249 OSCF FF8O 3) OONE F0295 O5F7 6007 SACL D *"d=d+ incrl 
05D0 0667 F0296 OSF8 CAOF LACK >O000F * write left pixel only 


FO250 0501 0009 O2INC2 ADD INCR2 F0297 OSF9 4EBA AND *,2 


999 
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F0298 O5FA 4008 OR COLORH F0343 * +----- +----- + 
F0299 O5FB 6088 SACL *,0,0 F0344 ; 
F0300 OSFC FB99 BANZ OCT3HI,*-,1 F0345 0624 2007 OCT4HI LAC D 
O5FD 05F3 F0346 0625 F480 BGEZ O4XDY I 
FO301 OSFE FF8O B DONE 0626 0631 
OSFF 0667 F0347 0627 0008 O4xDYO ADD INCR1 
F0302 0600 0009 O3XDYI ADD INCR2 F0348 0628 6007 SACL D *d=d + inerl 
F0303 0601 6007 SACL D "d=d + incr2 F0349 0629 CAOF LACK >O00F * write left pixel only 
F0304 0602 CAOF LACK >000F * write left pixel only F0350 062A 4E8C AND *,4 
F0305 0603 4E8B AND *,3 F0351 062B 4008 OR COLORH 
F0306 0604 400B OR COLORH F0352 062C 6088 SACL *,0,0 
F0307 0605 6088 SACL *,0,0 F0353 062D FB99 BANZ OCT4LO,*-,1 
F0308 0606 FB99 BANZ OCT3LO,*-,1 062E 063B 
0607 060A F0354 062F FF8O B DONE 
F0309 0608 FF80 B DONE 0630 0667 
0609 0667 F0355 0631 0009 O4xXDYI ADD INCR2 
F0310 * F0356 0632 6007 SACL i) *d=d + incr2 
FO311 7 +----- $----- + F0357 0633 CAOF LACK >O00F * write left pixel only 
FO312 * ' 07027!00! F0358 0634 4E8B AND *53 
F0313 * OCTANT 3 (RIGHT PIXEL) +----- $----- + F0359 0635 4D0B OR COLORH 
FO314 * 'Ox!100! F0360 0636 6088 SACL *,0,0 
FO315 * a feta + F0361 0637 FB99 BANZ OCT4LO,*-,1 
F0316 * 0638 0638 
FOQ317 O60A CAFO OCT3LO LACK >O0FO * write right pixel only F0362 0639 FF80 B DONE 
F0318 060B 4E8A AND *,2 063A 0667 
FO0319 060C 4D0A OR COLORL F0363 . 
F0320 0600 6088 SACL *,0,0 F0364 * $oo--- oe + 
F032! O60E 2007 LAC 0 F0365 . ; 070! 003 
F0322 O60F F480 BGEZ O3INC2 F0366 * OCTANT 4 (RIGHT PIXEL) $----- +----- + 
0610 0617 F0367 : + 02x 100} 
F0323 0611 0008 O3INCI ADD INCRI F0368 . $----- +----- + 
F0324 0612 6007 SACL i) *"d=d + incrl F0369 : 
F0325 0613 FB99 BANZ OCT3L0,*-,1 F0370 063B 2007 OCT4LO LAC D 
0614 060A FO371 063C F480 BGEZ O4INC2 
F0326 0615 FF80 B DONE 063D 064A 
0616 0667 F0372 063E 0008 ADD INCRI 
F0327 0617 0009 O3INC2 ADD INCR2 F0373 063F 5588 MAR *, 
F0328 0618 6007 SACL 9) *d=d + incr2 FO0374 0640 FB9C BANZ OCT4HL ,*-,4 
F0329 0619 FB99 BANZ OCT3HI,*-,1 0641 0654 
O61A 05F3 F0375 0642 5589 MAR et 
F0330 061B FF80 B DONE F0376 0643 6007 SACL D *"d=d + incrl 
061C 0667 F0377 0644 CAFO LACK >00F0 * write right pixel only 
F033! * F0378 0645 4E80 AND * 
F0332 * DRAW A LINE IN OCTANT #4 F0379 0646 4D0A OR COLORL 
F0333 * F0380 0647 6080 SACL . 
F0334 0610 0300 OCT4 LRLK AR3,XYIND+XDYI F0381 0648 FF8O B DONE 
O61E 0830 0649 0667 
F0335 O61F 0400 LRLK AR4,XYIND+XDEC F0382 064A 0009 O4INC2 ADD INCR2 
0620 0810 F0383 064B 6007 SACL D *d=d+ incr2 
F0336 0621 2000 LAC BFLAG F0384 064C CAFO LACK >00F0 * write right pixel only 
F0337 0622 F380 BLZ OCT4LO F0385 0640 4E8A AND woe 
0623 0638 F0386 064E 4D0A OR COLORL 
F0338 * F0387 064F 6088 SACL *,0,0 
F0339 * a ee + F0388 0650 FB99 BANZ OCT4HI,*-,1 
F0340 * ' 007/00} 0651 0624 
F0341 * OCTANT 4 (LEFT PIXEL) +----- —— + F0389 0652 FF8O B DONE 
F0342 * ; 0027; x0} 0653 0667 
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F0390 


F0391 
F0392 
F0393 
F0394 
F0395 
F0396 


F0397 


F0398 
F0399 
F0400 
F0401 
F0402 


F0403 


F0404 
F0405 
F0406 
F0407 
F0408 
F0409 
FO410 
FO411 
F0412 
F0413 
F0414 


FO415 
F0416 
FO417 
F0418 
F0419 
F0420 
F0421 
F0422 
F0423 


F0424 
F0425 
F0426 
F0427 
F0428 
F0429 
F0430 
F0431 
F0432 
F0433 


F0434 


F0435 
F0436 
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TMS3406!1 DEMONSTRATION PAGE 0039 
F380 OCT4HL BLZ O4INCI 
O65F 
0009 AOD INCR2 
6007 SACL D *d=d + incr2 
200C LAC COLOR2 * write both pixels of the pair 
5588 MAR *,3 
6088 SACL *,0,0 
063B 
FF80 B DONE 
0667 
0008 O4INCi ADD INCR1 
6007 SACL 9) *“d=d + incr} 
200C LAC COLOR2 * write both pixels of the pair 
6088 SACL *,0,0 
FB99 BANZ OCT4LO,*-,1 
0638 
FF80 B DONE 
0667 

# 
C800 OONE LDPK 0 
5060 LST STsSO * restore entry data page pointer 
CE26 RET 
# 
. DRAW A HORIZONTAL LINE 
* 
2002 HLINE LAC X2 * dx = x2 - xl 

1000 SUB Xl 
6005 SACL OX 
F480 BGEZ DXPOS 
0673 
CE23 NEG 
6005 SACL OX * save ‘dx! 

2002 LAC X2 * swap xl & x2 : draw left to right 
6000 SACL XI 

CA20 DXPOS'~ LACK >0020 * load X-Y Offset Register (lo byte 
60E0 SACL "0+ 

2F00 LAC X1,15 

600D SACL BFLAG * bit flag for even/odd bit (0/1) 
OF 04 ANDK >0006,15 * mask for next 2 Isb’s of x] 

0006 

68E0 SACH *0+ * load X-Y Offset Register (hi byte 
2D00 LAC X1,13 * seperate 7 msb’s of xl 

6800 SACH X1 

2701 LAC Y1,7 * add |Isb of yl to x! 

0000 ADD a| 

60E0 SACL *0+ * load low byte of X-Y Address Reg 
2F01 LAC Y1,15 * seperate 8 msb’s of yl 

6880 SACH 7 * load high byte of X-Y Address Reg 
3005 LAR ARO,DX * ARO = # of pixels in line 

0100 LRLK AR1,XYIND+XYNOP 

0800 

0200 LRLK AR2,XYIND+XINC 

0808 

2000 LAC BFLAG 

F380 BLZ HOZLO 

0699 
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F0437 
F0438 


F0439 
F0440 
F044} 
F0442 
F0443 
F0444 


F0445 
F0446 
F0447 


F0448 


F0449 
F0450 
F0451 
F0452 
F0453 
F0454 


F0455 


F0456 
F0457 
F0458 
F0459 
F0460 


F0461 


F0462 
F0463 
F0464 


F0465 
F0466 
F0467 
FO0468 
F0469 


F0470 
F0471 
F0472 
F0473 
F0474 
F0475 
F0476 


0007 
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5588 
FB9A 
0693 
5589 
CAOF 
4€80 
4D08 
6080 
FF8O 
0667 
200C 
6088 
FB98 
068A 
FF80 
0667 


HOZHI 


FILL2 


HOZLO 


VL INE 


VRTHI 


VRTLO 


LARP 
BANZ 


LARP 


LACK 
ANO 


DRAW 


LAR 
LRLK 


LRLK 


LARP 
LAC 


15:17:30 
PAGE 0040 


ARO 
FILL2,*-,2 


ARI 
>000F 


COLORH 
# 


* write left pixel only 


DONE 


COLOR2 
*,0,0 
HOZHI,*-,0 


* write both pixels of the pair 


DONE 


* write right pixel only 


A VERTICAL LINE 


ARO,DY * ARO = # of pixels in line 
AR1,XYIND+XYNOP 


AR2,XYIND+Y INC 
ARI 
BFLAG 
VRTLO 
>000F * write left pixel 
* 

VRTHI,*-,1 

DONE 
>O0FO * write right pixel 


VRTLO,*-, 1 


FELIPS2.ASM 


12-05-85 


8S9 
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™S32020 - TMS34061 DEMONSTRATION PAGE 0041 

G0002 . 

G0003 HHSHHKHKKHKKLHAKHKKHSKKTFHCHKRKKHKKKHKHKHHKKHHHRHEHHRHHHKHHERHEH ES 

G0004 * ROUTINE: FELIPS 

G0005 * This routine is called with the following information 

G0006 * located fn 5 words on page 6 labeled as XC, YC, A, B, 

G0007 * COLOR. Using this information, a solid filled ellipse of 

G0008 * the specified color is drawn centered at (xc,yc) with 

Go009 * x-axis intercept a and y-axis intercept b. A four-way 

GO010 * stepping algorithm fs used to determine the boundaries of 

GO011 * the ellipse. The x coordinate value of the first quadrant 

GO012 * end point of each horizontal line used to construct the 

G0013 * ellfpse are stored in consecutive locations on internal 

G0014 * data ram pages 4 and 5. 

G0015 - 

G0016 AAAS REE RRR ES CRS ERS ESSE SEES ESSE SE ESSER SOLSESS SY SY 

G0017 * 

G0018 O6BB FELIPS EQU $ 

G0019 0688 7860 SST STso * save current data page pointer 

G0020 O6BC C809 LDOPK >0009 *'VSC register page 

G0021 O6BD C264 LARK AR2,SAVREG * fnternal mem on OP=0 

G0022 O6BE 558A LARP AR2 

G0023 O6BF 2030 LAC CTLRIL * save Control Register |! 

G0024 06CO 60A0 SACL #4 

G0025 06C1 2038 LAC CTLRIH 

G0026 06C2 60A0 SACL He 

G0027 06C3 2040 LAC CTLR2L * save Control Register 2 

G0028 06C4 60A0 SACL #4 

G0029 06C5 2048 LAC CTLR2H 

G0030 06C6 60A0 SACL + 

G0031 06C7 C806 LOPK 6 

G0032 06C8 D100 LRLK AR1,>0200 * set-up access to data stack 
06C9 0200 

G0033 O6CA 5589 LARP ARI 

G0034 O6CB 2004 LAC COLOR 

G0035 O6CC 600A SACL COLORL 

G0036 06CD 6408 SACL COLORH, 4 

G0037 O6CE 0404 ADD COLOR, 4 

G0038 O6CF 600C SACL COLOR2 

G0039 * 

G0040 * CALCULATE CURVE DRAWING PARAMETERS 

G0041 7 

G0042 0600 3212 LAR AR2,A * first stack value 

G0043 06D! 3C12 LT A * square horizontal axis value A 

G0044 0602 3812 MPY A 

G0045 06D3 CEI4 PAC 

G0046 0604 6814 SACH TI1H * Tl = A**2 

G0047 0605 6015 SACL TIL 

G0048 0606 CE15 APAC 

G0049 0607 6816 SACH T2H * T2 = 2*T! 

G0050 0608 6017 SACL T2t 

G0051 0609 3C13 LT B * square vertical axis value 8 

G0052 O6DA 3813 MPY B 

G0053 06DB CE14 PAC 

G0054 06D0C 6818 SACH T3H * 73 = B**2 

G0055 060D 6019 SACL T3L 

GN056 O6DE CE15 APAC 

G0057 O6DF 681A SACH T4H * T4 = 2*T3 
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G0058 
G0059 
G0060 
G0061 
G0062 
G0063 
G0064 
G0065 
G0066 
G0067 
G0068 
G0069 
G0070 
G0071 
G0072 
G0073 
G0074 
G0075 
60076 
G0077 
G0078 
G0079 
G0080 
G0081 
G0082 
G0083 
G0084 
G0085 
G0086 
G0087 
G0088 


G0089 
G0090 
G0091 
G0092 
G0093 
G0094 
G0095 
G0096 
G0097 
G0098 
G0099 
GO100 
G0101 


G0102 


G0103 
G0104 
GO105 
G0106 
G0107 
G0108 
G0109 
GO110 
GOI! 


06E0 
06E 1 
06E2 
06E3 
06E4 
06E5 
O6E6 
06E7 
O06E8 
06E9 
O6EA 
O6EB 
O6EC 
06ED 
O6EE 
O6EF 
O6F 0 
O6F | 
O6F 2 
O6F3 
O6F 4 
O6F5 
O6F6 
O6F7 
O6F8 


O6F9 
O6FA 
06FB 
O6FC 
O6FD 
O6FE 
O6FF 
0700 
0701 
0702 
0703 
0704 
0705 
0706 
0707 
0708 
0709 
O70A 
0708 
070C 
0700 
070E 
070F 
0710 
0711 
0712 
0713 
0714 
0715 
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TMS34061 DEMONSTRATION PAGE 0042 
6018 SACL T4t 
3C12 LT A 
3818 MPY T3H 
CE14 PAC 
CBOF RPTK 15 
CE18 SFL 
3819 MPY T3L 
CES APAC 
681C SACH T5H * 75 = T3*A 
6010 SACL T5L 
691E SACH T6H, 1 * T6 = 2*T5 
611F SACL TéL,1 
CA00 ZAC 
6020 SACL T7H *T17=0 
6021 SACL TTL 
4014 ZALH TIH 
4915 ADDS TIL 
4818 ADOH T3H 
4919 ADDS T3L 
CE19 SFR 
CE19 SFR 
AAIC SUBH T5H 
4510 SUBS T5L 
6826 SACH OH * OD) = (Tl + 73)/4 - T5 
6027 SACL OL 
* 
* CALCULATE ELLIPSE BOUNDARY FOR ONE QUADRANT 
* a 
72A0 SAR AR2,*+ 
2026 LAC OH 
F480 BGEZ STEPX 
0718 
4020 STEPY ZALH T7H 
4921 ADDS TIL 
4816 ADOH T2H 
4917 ADDS T2L 
6820 SACH TT7H * 77 = 77 + T2 
6021 SACL TIL 
4826 AOODH OH 
4927 ADDS OL 
6826 SACH OH *“D=0+T7 
6027 SACL OL 
7T2A0 SAR AR2, *+ * save x-value on data stack 
2026 LAC OH 
F380 BLZ STEPY 
O6FD 
FF80 B STEPX 
071B 
AQIE ‘LOOPX ZALH T6H 
ASIF ADDS T6L 
AGIA SUBH T4H 
451B SUBS T4L 
681E SACH T6H * 76 = 16 - T4 
601F SACL T6L 
CE23 NEG 
4826 ADOH OH 
4927 ADDS OL 
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GO112 
GO113 
GO114 
GO115 


GO116 
GO117 


GO118 
GO119 
G0120 
GO121 
G0122 
G0123 


GO0124 
G0125 
G0126 
G0127 
G0128 


G0129 
G0130 
G0131 
G0132 


G0133 
G0134 
G0135 
G0136 
G0137 


G0138 
G0139 
G0140 
GO14! 
G0142 


G0143 
GO144 
G0145 
G0146 
GO147 
G0148 
GO149 
G0150 


GO151 
G0152 
G0153 
GO0154 


GO0155 
G0156 
GO157 
G0158 


0716 
0717 
0718 
0719 
O7IA 
0718 
o71C 
0710 
O7I1€E 
O71F 
0720 
0721 
0722 
0723 
0724 
0725 
0726 
0727 
0728 
0729 
072A 


072B 
072C 
0720 
072E 
072F 
0730 
0731 
0732 
0733 
0734 
0735 
0736 
0737 
0738 
0739 
073A 
0738 
073C 
0730 
073€ 
073F 
0740 
0741 
0742 
0743 
0744 
0745 
0746 
0747 
0748 
0749 
O74A 
0748 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 15:17:30 12-05-85 


TMS34061 DEMONSTRATION PAGE 0043 
6826 SACH OH *O0=0 - T6 
6027 SACL OL 
2026 LAC OH 
F380 BLZ STEPY 
O6FO 
558A STEPX LARP 2 " . 
FB99 BANZ LOOPX, *-,1 * continue until X at origin 
0700 
712C PLOT! SAR ARI, PIXCNT 
2013 LAC B 
6046 SACL SAVEB 
092E ADD ONE6,9 
102Cc SUB PIXCNT 
F380 BLZ PLOT2 * plotting stack = b ? 
0728 
602C SACL PIXCNT 
322C LAR AR2,PIXCNT 
CA00 ZAC 
60AA NILPT SACL *+,0,2 * zero remainder of stack 
FB99 BANZ NILPT,*-,1 
0728 
* 
# DRAW “SHORT” HORIZONTAL LINES FOR UPPER HALF 
i 
0400 PLOT2 ~ LRLK AR4, VSCREG+>0080+XYOF FL 
04€0 
558C LARP AR4 
CA20 LACK >0020 
6089 SACL *,0,1 * load X-Y Offset Register(lo byte) 
3213 LAR AR2,B * number of values on data stack 
0300 LRLK AR3 , XY 1ND+XYNOP 
0800 
5590 MAR *. 
7213 LOOP! SAR AR2,B 
2010 LAC XC 
1080 SUB . 
F480 BGEZ OSCRX 1 
O73A 
CA00 ZAC 
6000 OSCRX! SACL X1 * Xl = XC - X >= 0 
2010 LAC XC 
OG9C ADO *-~,0,4 
6002 SACL X2 * X2 = XC + X 
2002 LAC X2,13 * seperate 7 MSBs from 3 LSBs (X2) 
682A SACH X2H 
0004 ANDK >FFFF 
FFFF 
6C2B SACH X2L,4 
2000 LAC X1,13 * seperate 7 MSBs from 3 LSBs (X1) 
6828 SACH X1H 
0004 ANOK >FFFF 
FFFF 
6C29 SACH XIL,4 
202A LAC X2H 
1028 SUB X1H 
F580 BNZ WIDE * endpoints in same 8 pixel group ? 
0797 
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G0159 
G0160 
GO16! 


G0162 
G0163 
G0164 
GO165 


G0 166 
G0167 
G0168 
G0169 
G0170 
GO17! 
GO172 
G0173 
G0174 
G0175 
GO176 
GO177 
G0178 


G0179 
G0180 


G018! 
0182 
G0183 
G0184 
G0185 
G0186 


G0187 


G0188 
G0189 
GO0T90 
G0191 


G0192 
G0193 


G0194 
G0195 
G0196 
G0197 
G0198 
G0199 


G0200 


G0201 
G0202 
G0203 
G0204 


074C 
074D 
074E 
O07 4F 
0750 
0751 
0752 
0753 
0754 
0755 
0756 
0757 
0758 
0759 
075A 
0758 
075C 
0750 
075€ 
O75F 
0760 
0761 
0762 
0763 
0764 
0765 
0766 
0767 
0768 
0769 
076A 
0768 
076C 
0760 
O76E 
O76F 
0770 
0771 
0772 
0773 
0774 
0775 
0776 
0777 
0778 
0779 
O77A 
0778 
077C 
0770 
O77E 
O77F 
0780 


0781 
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TMS34061 DEMONSTRATION PAGE 0044 
2011 LAC Yc * *short" line 
1013 SUB B 
F480 BGEZ OSCRY |! 
0751 
CA00 ZAC 
6001 OSCRY! SACL Yt * YI 2 YC - Y > 0 
co08 LARK ARO,8 ° * set-up access to VSC registers 
0400 LRLK AR4, VSCREG+>0080+XYOF FH 
04E8 
558C LARP AR4 
2E29 LAC X1L,14 * load XY coordinates for upper lin 
68E0 SACH 0+ 
2701 LAC Y1,7 
0028 AOD X1H 
60E0 SACL “0+ 
2F 01 LAC Y1,15 
6888 SACH *,0,0 
2F 2B LAC X2L,15 
1F29 SUB X1L,15 
682C SACH PIXCNT * #$ pixels = X2 - Xl 
302C LAR ARO, PIXCNT 
0400 LRLK AR4, XYIND+XINC 
0808 
9129 BIT X1L,1 * start on an odd pixel ? 
F980 BBNZ SLHIU 
0776 
558B LARP AR3 
CAFO SLLOU- LACK >00F0 * pixel in Isnibble only 
4E8C AND *,4 
400A OR COLORL 
6088 SACL *,0,0 
FB90 BANZ SLHIU 
0776 
FF80 B DONE 1U 
077C 
558C SLHLU LARP AR4 
200C LAC COLOR2 * pixel pair 
6088 SACL *,0,0 
FB90 BANZ SLHIU 
0776 
FF80 8B DONE 1U 
077C 
FB9B SLHIU BANZ SLHLU, ®-,3 
O76F 
CAOF LACK >000F * pixel fn msnibble only 
4E80 AND * 
4008 OR COLORH 
6080 SACL * 
558A DONEIU LARP AR2 
F899 BANZ LOOP1,*-,1 
0734 
FF80 8 PLOT3 
07CD 
e 
= DRAW "LONG" HORIZONTAL LINES FOR UPPER HALF 
e 
7213  LOOP2 SAR AR2,8 
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TMS32020 ~ 1TMS3406! DEMONSTRATION PAGE 0045 T™MS32020 - TMS34061 DEMONSTRATION PAGE 0046 
G0205 0782 2010 LAC XC G0253 0788 6080 SACL 7 
G0206 0783 1080 SUB # G0254 O7BC 5588 RIGHT! LARP ARO 
G0207 0784 F480 BGEZ OSCRX2 G0255 07BD CA00 ZAC 
0785 0787 G0256 07BE 6080 SACL * * load Control Register 2 
G0208 0786 CA0O0 ZAC G0257 O7BF 0000 LRLK ARO, VSCREG+>0080+XYOFFH 
GO0209 0787 6000 OSCRX2 SACL XI * Xl = XC - X >= 0 07CO0 04€8 
G0210 0788 2010 LAC XC G0258 07C1 6080 SACL ai 
GO211 0789 009C ADD *-,0,4 G0259 07C2 D000 LRLK ARO, VSCREG+>0080+XYADRL 
GO212 078A 6002 SACL X2 *" X2 = XC + X 07C3 04FO 
G0213 078B 2002 LAC X2,13 * seperate 7 MSBs from 3 LSBs (X2) G0260 07C4 2080 LAC = 
GO214 078C 682A SACH X2H G0261 07C5 608C SACL ",0,4 
G0215 0780 D004 ANDK >FFFF G0262 07C6 000! LALK RSBASE 
O78E FFFF 07C7 0888 
GO2!6 O78F 6C2B SACH X2L,4 G0263 07C8 0028 ADD X2L 
G0217 0790 2000 LAC X1,13 * seperate 7 MSBs from 3 LSBs (X1) G0264 07C9 CE24 CALA * write pixels right of blocks 
GO2'8 0791 6828 SACH X1H G0265 O7CA 558A LARP AR2 
G0219 0792 0004 ANDK >FFFF G0266 07CB FB99 BANZ LOOP2,*~,1 
0793 FFFF 07CC 0781 
G0220 0794 6C29 SACH XIL,4 G0267 j 
GO0221 0795 202A LAC X2H G0268 DRAW "LONG" HORIZONTAL LINES FOR LOWER HALF 
GO222 0796 1028 SUB X1H G0269 : 
G0223 0797 112E WIDE SUB ONEG, 1 * "long” line G0270 07CD 0400 PLOT3 LRLK AR4, VSCREG+>0080+XYOF FL 
G0224 0798 6020 SACL PAXCNT * #8 pixel groups O7CE 04€0 
G0225 0799 2011 LAC Yc G0271 O7CF 558C LARP AR4 
G0226 079A 1013 SUB 8 G0272 0700 CA20 LACK >0020 
G0227 0798 F480 BGEZ OSCRY2 G0273 0701 6089 SACL *,0,1 * load X-Y Offset Register(lo byte) 
079C 079€ GO274 07D2 3246 LAR AR2,SAVEB * # of values on data stack 
G0228 079D CA00 ZAC G0275 07D3 0300 LRLK AR3,XYIND+XYNOP 
G0229 079E 6001 OSCRY2 SACL Y! * yl = vO - Y >= 0 0704 0800 
G0230 079F C008 LARK ARO,8 * set-up access to VSC registers G0276 0705 55A0 MAR + 
G023i O7A0 0400 LRLK AR4, VSCREG+>0080+XYOF FH G0277 0706 55A0 MAR 4 
O7Al 04E€8 G0278 0707 2011 LAC yc 
G0232 O7A2 558C LARP AR4 G0279 0708 0046 AOD SAVEB 
G0233 O7A3 2E29 LAC XIL,14 ®* load XY coordinates for upper lin G0280 0709 6011 SACL yc 
G0234 O7A4 68E0 SACH "0+ G028!1 O7DA FF8O B LOOPE 
G0235 O7A5 2701 LAC Y1,7 0708 0824 
G0236 O7A6 0028 ADO XIH GO282 070C 7213 LOOP3 SAR AR2,8 
G0237 O7A7 60E0 SACL *0+ G0283 0700 2010 LAC XC 
G0238 O7A8 2F0} LAC y1,15 G0284 07DE 1080 SUB * 
G0239 07A9 6888 SACH *,0,0 GO0285 O7DF F480 BGEZ OSCRX3 
G0240 O7AA 0000 LRLK ARO, VSCREG+>0080+CTLR2L O7EO O7E2 
O7AB 04CO G0286 07E1 CA00 ZAC 
G0241 O7AC 0400 LRLK AR4,XY IND+X INC G0287 O7E2 6000 OSCRX3 SACL xX} * XI = XC - X d= 0 
07A0D 0808 G0288 07E3 2010 LAC XC 
G0242 O7AE CA00 ZAC G0289 07E4 OOAC ADD *+,0,4 
G0243 O7AF 6088 SACL *,0,3 * toad Control Register 2 G0290 O7E5 6002 SACL X2 * X2 = XC + X 
G0244 0780 D001 LALK LSBASE G029!1 O7E6 2002 LAC X2,13 * seperate 7 MSBs from 3 LSBs (X2) 
0781 0878 G0292 O7E7 682A SACH X2H 
G0245 07B2 0029 AOD XIL G0293 O7E8 0004 ANDK >FFFF 
G0246 07B3 CE24 CALA * write pixels left of blocks O7ES FFFF 
G0247 07B4 CABC LACK >008C * X-Y mode RAS override G0294 O7EA 6C28 SACH X2L,4 
G0248 07B5 608C SACL *,0,4 * load Control! Register 2 G0295 07EB 2000 LAC X1,13 * seperate 7 MSBs from 3 LSBs (XI) 
G0249 0786 202D LAC PAXCNT G0296 O7EC 6828 SACH X1H 
G0250 07B7 F380 BLZ RIGHT G0297 O7ED 0004 ANDK >FFFF 
0788 07BC O7VEE FFFF 
G0251 07B9 200C LAC COLOR2 G0298 O7EF 6C29 SACH X1L,4 
G0252 07BA 4820 RPT PAXCNT * fill 8 pixel blocks G0299 O7FO 202A LAC X2H 


G0300 
G0301 


G0302 
G0303 
G0304 
G0305 
G0306 
G0307 
G0308 


G0309 
G0310 
GO311 
G0312 
G0313 
G0314 
G0315 
G0316 
G0317 


G0318 


G0319 
G0320 
G0321 


G0322 
G0323 
G0324 
G0325 
G0326 
G0327 
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G0328 
G0329 
G0330 
G0331 
G0332 
G0333 
G0334 


G0335 
G0336 


G0337 
G0338 
G0339 


G0340 
G0341 
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O7F 1 
O7F2 
O7F3 
OTF 4 
O7F5 
O7F6 
O7F7 
O7F8 
O7F9 
O7FA 
O7FB 
O7FC 
O7FD 
O7FE 
O7FF 
0800 
0801 
0802 
0803 
0804 
0805 
0806 
0807 
0808 
0809 
080A 
0808 
080C 
0800 
O80E 
O80F 
0810 
0811 
0812 
0813 
0814 
0815 
0816 
0817 
0818 
0819 
O81A 
0818 
O8iICc 
0810 
O8IE 
O8 IF 
0820 
0821 
0822 
0823 
0824 
0825 
0826 
0827 
0828 
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1028 
F680 
0830 
112€ 
6020 
2011 
1013 
6003 
C008 
0400 
04E8 
558C 
2E29 
68E0 
2703 
0028 
60E0 
2F03 
6888 
0000 
04C0 
0400 
0808 
CA00 
6088 
0001 
0878 
0029 
CE24 
CABC 
608C 
202D 
F380 
0816 
200C 
482D 
6080 
5588 
CA00 
6080 
D000 
04€8 
6080 
0000 
04F 0 
2080 
608C 
0001 
0888 
0028 
CE24 
558A 
FB99 
070C 
FF80 
086A 


RIGHT2 


LOOPE 


SUB 
BZ 


SUB 
SACL 
LAC 
SUB 
SACL 
LARK 
LRLK 


LARP 
LAC 
SACH 
LAC 
ADD 
SACL 
LAC 
SACH 
LRLK 


LRLK 


ZAC 
SACL 
LALK 


ADD 
CALA 
LACK 
SACL 
LAC 
BLZ 


LAC 
RPT 
SACL 
LARP 
ZAC 
SACL 
LRLK 


SACL 
LRLK 


LAC 
SACL 
LALK 


AOD 

CALA 
LARP 
BANZ 


8 


PC 1.0 85.157 15:17:30 12-05-85 
PAGE 0047 
X1H 
NARROW * endpoints in same 8 pixel group ? 
ONEG, | * "long" line 
PAXCNT * # 8 pixel groups 
Yc 
B 
Y2 *Y2 = YC + Y 
ARO,8 * set-up access to VSC registers 


AR4, VSCREG+>0080+XYOFFH 


AR4 
X1L,14 
#0+ 
Y2,7 
X1H 
*0+ 
¥2,15 
*,0,0 
ARO, VSCREG+>0080+CTLR2L 


* load XY coordinates for lower lin 


AR4,XYIND+X INC 


*,0,3 * load Control! Register 2 
LSBASE 
XIL 

* write pixels left of the blocks 
>00BC * X-Y mode RAS override 
*,0,4 * load Control! Register 2 
PAXCNT 
RIGHT2 
COLOR2 
PAXCNT * f{1l 8 pixel blocks 
a 
ARO 
* * load Control Register 2 


ARO, VSCREG+>0080+XYOFFH 


ARO, VSCREG+>0080+XYADRL 


*,0,4 
RSBASE 


X2b 

* write pixels right of the blocks 
AR2 
LOOP3,*-, 1 


ALDONE 


GRAPHIC 
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G03 46 
G0347 
G0348 
G0349 
G0350 
G0351 


G0352 
G0353 
G0354 
G0355 
G0356 
G0357 
G0358 
G0359 


G0360 
G0361 
G0362 
G0363 


G0364 
G0365 
G0366 
G0367 
G0368 
G0369 


G0370 
G037! 
G0372 
G0373 
G0374 
G0375 
G0376 
G0377 
G0378 
G0379 


G0380 
G0381 


G0382 
G0383 
G0384 
G0385 
G0386 
G0387 


G0388 
G0389 
G0390 
G0391 
G0392 


G0393 
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7213 
2010 
1080 
F480 
082F 
CA00 
6000 
2010 
OOAC 
6002 
2002 
682A 
0004 
FFFF 
6C2B 
2000 
6828 


0004 


FFFF 
6C29 
2011 
1013 
6003 
c008 
0400 
04E8 
558C 
2E29 
68E0 
2703 
0028 
60E0 
2F03 
6888 
302C 
0400 
0808 
9129 
F980 
0861 
558B 
CAFO 
4E8C 
400A 
6088 
FB90 
0861 
FF80 
0867 
558C 
200C 
6088 
FB90 
0861 
FF80 


LOOP4 


OSCRX4 


NARROW 


SLLOL 


SLHLL 
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DRAW "SHORT" HORIZONTAL LINES FOR LOWER HALF 


AR2,B 
XC 


* 


OSCRX4 


X1 * Xl = XC - X >= O 


X2 * X2 = XC + X 
seperate 7 MSBs from 3 LSBs (X2) 


seperate 7 MSBs from 3 LSBs (X1) 


YC * "short" line 

Y2 *Y2 = YC + Y 

ARO,8 * set-up access to VSC registers 
AR4, VSCREG+>0080+XYOFFH 


AR4 
X1L,14 
*0+ 
Y2,7 
X1H 
*0+ 
Y2,15 
*,0,0 
ARO, PIXCNT 
AR4,XYIND+XINC 


* load XY coordinates for lower lin 


XiL,1 
SLHIL 


* start on an odd pixel ? 


* pixel in Isnibble only 


* pixel pair 


799 
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0860 0867 G0433 0896 FF80O B RE7 
G0394 0861 FB9B SLHIt BANZ SLHLL,*#-,3 0897 O8F6 
0862 085A G0434 * 
G0395 0863 CAOF LACK >000F * pixe! in msnibble only G0435 7 WRITE 8 PIXELS LEFT OF BLOCK 
G0396 0864 4E80 AND . G0436 * (0 UNWRITTEN PIXELS) 
G0397 0865 4008 OR COLORH G0437 * 
G0398 0866 6080 SACL 7 G0438 7 }* # # # # © # #1 B PIXEL BLOCKS! 
G0399 0867 558A DONEIL LARP AR2 G0439 : 
G0400 0868 FB99 BANZ LOOP4,*-,1! G0440 0898 558C LEO LARP AR4 
0869 0829 G0441 0899 200C LAC COLOR2 
G0401 . G0442 089A 6080 SACL * 
G0402 086A 558A ALDONE LARP AR2 G0443 089B 6080 SACL * 
G0403 086B C264 LARK AR2, SAVREG G0444 089C 6080 SACL # 
G0404 O086C C809 LOPK >0009 * VSC register page G0445 0890 6088 SACL *,0,0 
G0405 086D 20A0 LAC #4 G0446 O089E CE26 RET 
G0406 086E 6030 SACL CTLRIL * load Control Register | G0447 O89F CE26 RET 
G0407 O86F 20A0 LAC *+ G0448 * 
G0408 0870 6038 SACL CTLRIH G0449 * WRITE 7 PIXELS LEFT OF BLOCK 
G0409 0871 20A0 LAC #4 GO0450 as (1 UNWRITTEN PIXELS) 
G0410 0872 6040 SACL CTLR2L = * load Contro) Register 2 G0451 * 
G0411 0873 20A0 LAC + G0452 * 2" * # # *# * #1 B PIXEL BLOCKS! 
G0412 0874 6048 SACL CTLR2H G0453 . 
G0413 0875 C800 LOPK 0 G0454 O8AO CAFO LE! LACK >OOFO 
G0414 0876 5060 LST STSO * restore data page pointer G0455 O8Al 4E8C AND *,4 
G0415 0877 CE26 RET G0456 O8A2 400A OR COLORL 
G0416 . G0457 08A3 6080 SACL * 
G0417 0878 FF80 LSBASE B LEO G0458 O8A4 200C LAC COLOR2 
0879 0898 G0459 O8A5 6080 SACL $ 
G0418 087A FF80 B LEI G0460 O8A6 6080 SACL * 
087B O8A0 G0461 O8A7 6088 SACL *,0,0 
G0419 087C FF80 B LE2 G0462 O8A8 CE26 RET 
087D 08A9 G0463 bi 
G0420 O87E FF80 8 LE3 G0464 7 WRITE 6 PIXELS LEFT OF BLOCK 
O87F O8AF G0465 . (2 UNWRITTEN PIXELS) 
G0421 0880 FF80 B LE4 G0 466 * 
0881 08B7 G0467 * 27" * * * * #1 8 PIXEL BLOCKS! 
G0422 0882 FF80 8 LES G0468 * 
0883 O8BC G0469 O8A9 558C LE2 LARP AR4 
G0423 0884 FF80 B LE6 G0470 O8AA 200C LAC COLOR2 
0885 08C3 G0471 O8AB 6080 SACL * 
G0424 0886 FF8O 8 LE7 G0472 O8AC 6080 SACL * 
0887 08C7 G0473 O8AD 6088 SACL *,0,0 
G0425 Md G0474 O8AE CE26 RET 
G0426 0888 FF8O0 RSBASE B REO G0475 * 
0889 O8CC G0476 7 WRITE 5 PIXELS LEFT OF BLOCK 
G0427 088A FF8O B RE 1 G0477 # (3 UNWRITTEN PIXELS) 
088B 0802 G0478 . 
G0428 O88C FFSO B RE2 G0479 : ‘272 * * # # #! 8 PIXEL BLOCKS! 
088D 08D5 G0480 * 
G0429 088E FF8O B RE3 G0481 O8AF CAFO LE3 LACK >O0FO 
O88F O8DC G0482 O8B0 4E8C AND *,4 
G0430 0890 FF80 8 RE4 G0483 08B1 4D0A OR COLORL 
0891 O8E0 G0484 08B2 6080 SACL = 
G0431 0892 FF8O B RE5 G0485 08B3 200C LAC COLOR2 
0893 08E8 G0486 08B4 6080 . SACL 7 
G0432 0894 FF80 8 RE6 G0487 0885 6088 SACL *,0,0 
0895 O8ED G0488 08B6 CE26 RET 
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G0489 
G0490 
G0491 
G0492 
G0493 
G0494 
G0495 
G0496 
G0497 
G0498 
G0499 
G0500 
G0501 
G0502 
G0503 
G0504 
G0505 
G0506 
G0507 
G0508 
G0509 
G0510 
G051} 
G0512 
G0513 
G0514 
G0515 
G0516 
G0517 
G0518 
G0519 
G0520 
G0521 
G0522 
G0523 
G0524 
G0525 
G0526 
G0527 
G0528 
G0529 
G0530 
G0531 
G0532 
G0533 
G0534 
G0535 
G0536 
G0537 
G0538 
G0539 
G0540 
G0541 
G0542 
G0543 
G0544 
G0545 


0887 
osBs 
0889 
O8BA 
0888 


08BC 
0880 
O8BE 
O8BF 
08CO 
08C1 
0sc2 


08C3 
08C4 
oscs 
08C6 


08C7 
oscs 
0sc9 
O8CA 
0scB 


oscc 
oscop 
O8CE 
Oscr 
0800 
08D! 
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558C 
200C 
6080 
6088 
CE26 


CAFO 
4E8C 
400A 
6080 
200C 
6088 
CE26 


558C 
200C 
6088 
CE26 


CAFO 
4E8C 
400A 
6088 
CE26 


558B 
CAOF 
4€8C 
4008 
6088 
CE26 


LE4 


2 2 @ &@ & & 


LE6 


ze ¢ 2 8 @ 


LE7 


ss 8s 8 8 


REO 


WRITE 4 


LARP 
LAC 
SACL 
SACL 
RET 


WRITE 3 


WRITE 2 


LARP 
LAC 
SACL 
RET 


WRITE 1 


LACK 
AND 
OR 
SACL 
RET 


WRITE 1 


PIXELS LEFT OF BLOCK 
(4 UNWRITTEN PIXELS) 


12222" * * *| 8 PIXEL BLOCKS} 


AR4 
COLOR2 
* 


*,0,0 


PIXELS LEFT OF BLOCK 
(5 UNWRITTEN PIXELS) 


1222722" * "| 8 PIXEL BLOCKS} 


>00F 0 
"4 
COLORL 
* 
COLOR2 
*,0,0 


PIXELS LEFT OF BLOCK 
(6 UNWRITTEN PIXELS) 


12227222 * *{ 8 PIXEL BLOCKS} 


AR4 
COLOR2 
*,0,0 


PIXELS LEFT OF BLOCK 
(7 UNWRITTEN PIXELS) 


12222222 "| 8 PIXEL BLOCKS; 


PIXELS RIGHT OF BLOCK 
(7 UNWRITTEN PIXELS) 


18 PIXEL BLOCKS {* 722722 2 2} 


AR3 
>O00F 
“4 
COLORH 
*,0,0 
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G0546 
G0547 
G0548 
G0549 
G0550 
G055} 
G0552 
G0553 
G0554 
G0555 
G0556 
G0557 
G0558 
G0559 
G0560 
G0561 
G0562 
G0563 
G0564 
G0565 
G0566 
G0567 
G0568 
G0569 
G0570 
G0571 
G0572 
G0573 
G0574 
G0575 
G0576 
G0577 
G0578 
G0579 
G0580 
G058 1 
G0582 
G0583 
G0584 
G0585 
G0586 
G0587 
G0588 
G0589 
G0590 
G0591 
G0592 
G0593 
G0594 
G0595 
G0596 
G0597 
G0598 
G0599 
G0600 
G0601 
G0602 


0802 
08D3 
0804 


0805 
08D6 
0807 
0808 
0809 
O8DA 
0808 


0s0c 
080D 
O8DE 
O8DF 


08E0 
O8E | 
08E2 
08E3 
08E 4 
08E5 
O8E6 
08E7 


0ses 
08E9 
OB8EA 
08EB 
O8EC 
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200C 
6088 
CE26 


200C 
6088 
CAOF 
4E8C 
4008 
6088 
CE26 


200C 
6080 
6088 
CE26 


200C 
6080 
608B 
CAOF 
4E8C 
4008 
6088 
CE26 


200C 
6080 
6080 
6088 
CE26 


ses @ &@© 8 & 


RE3 


RE4 


RES 


WRITE 2 


LAC 
SACL 
RET 


WRITE 3 


WRITE 4 


LAC 
SACL 
SACL 
RET 


WRITE 5 


LAC 
SACL 
SACL 
LACK 
AND 
OR 
SACL 
RET 


WRITE 6 


LAC 
SACL 
SACL 
SACL 
RET 


15:17:30 
PAGE 0052 


PIXELS RIGHT OF BLOCK 
(6 UNWRITTEN PIXELS) 

'8 PIXEL BLOCKS !* * 277272727! 
COLOR2 

*,0,0 

PIXELS RIGHT OF BLOCK 

(S UNWRITTEN PIXELS) 

'8 PIXEL BLOCKS 1* * * 227272! 
COLOR2 

*,0,3 

>000F 

"4 


COLORH 
*,0,0 


PIXELS RIGHT OF BLOCK 

(4 UNWRITTEN PIXELS) 

18 PIXEL BLOCKS |}* * * * 72 2? 2} 
COLOR2 

* 

*,0,0 

PIXELS RIGHT OF BLOCK 

(3 UNWRITTEN PIXELS) 

{8 PIXEL BLOCKS |" * * * * 2 7 2} 
COLOR2 

* 

*,0,3 

>O00F 

*,4 


COLORH 
*,0,0 


PIXELS RIGHT OF BLOCK 
(2 UNWRITTEN PIXELS) 

18 PIXEL BLOCKS 1* * * * * * 2 2! 
COLOR2 

* 


*,0,0 


12-05-85 
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G0603 
G0604 
G0605 
G0606 
G0607 
G0608 
G0609 
G0610 
GO6I1 
G0612 
G0613 
G0614 
G0615 
G0616 
G0617 
G0618 
G0619 
G0620 
G0621 
G0622 
G0623 
G0624 
G0625 
G0626 
G0627 
G0628 
G0629 

0008 


O08ED 
O8EE 
O8EF 
O8F 0 
O8F | 
O8F 2 
08F3 
O8F 4 
O8F5 


O8F 6 
08F7 
08F8 
O8F9 
O8FA 
O8FB 
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200C 
6080 
6080 
6088 
CAOF 
4E8C 
4008 
6088 
CE26 


200C 
6080 
6080 
6080 
6088 
CE26 


ss 2B & & @ 


RE6 


RE7 


WRITE 7 


WRITE 8 


LAC 

SACL 
SACL 
SACL 
SACL 
RET 

COPY 


PIXELS RIGHT OF BLOCK 
(1 UNWRITTEN PIXELS) 


18 PIXEL BLOCKS }* © * * * # # 2: 


PIXELS RIGHT OF BLOCK 
(0 UNWRITTEN PIXELS) 


18 PIXEL BLOCKS }*# * © © © © # #: 


*,0,0 


CUBER.ASM 


GRAPHIC 


HO002 

HO003 

HO004 

HOOOS 

HOO06 

HOO07 

HO008 

HOOO9 

HOOLO 

HOO! ! 

HOO12 

HOO13 

HOOI4 

HOO15 

HOO16 

HOO17 

HOO18 

HOO19 

HOO20 

HOO2 1 

HO022 

HOO23 

HOO24 

HOO25 

HO026 

HO027 

H0028 

H0029 

H0030 

HOO31! 

H0032 

H0033 

HOO34 

HOO035 

H0036 

H0037 

HOO38 

H0039 

HO040 

HOOA! 

HOO42 

HO043 

HOO44 

H0045 

HO046 

H0047 

HO048 

H0049 

HOO50 0004 

HOO051 0400 

H0052 

HO053 

HOO54 

HOO55 O8FC 

HO056 O8FC D200 
O8FD 0347 

HOO57 O8FE 558A 
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LAOS eee ECE ER CPE ERR ESE EEE SESE RSE EEE eee Es 
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ROUTINE: CUBER 


This routine controls the rotation and perspective 
projection of a normalized cube. The cube is defined by 
the vertices (0.5,-0.5,-0.5), (0.5,0.5,-0.5), 
(-0.5,0.5,-0.5), (-0.5,-0.5,-0.5), (0.5,-0.5,0.5), 
(0.5,0.5,0.5), (-0.5,0.5,0.5), and (-0.5,-0.5,0.5). The 
cube is rotated about the z-axis at an angular rate equal 
to R/128 degrees. i.e. The angle of rotation per display 
time is any one of 128 integral quantities determined by 
dividing PI/2 into 128 equal parts. 


]----+------ 6 

/\ H Ji 

/ | ' / | 
/ 4} / 4 
4-------~--- 5} 
| ~7t--> 
1 3------- In--2 oy 
i / 1 f 
1 / / 1 / 
t/ / i/ 
0----------- 1 
x f 

V 


This routine loads (initializes) the matrices used in the 
rotation, scaling, and projection of the cube, initializes 
the vertices, and then controls the modification and 
display of the cube. The modification and display are 
actually two sections of a larger loop. In the first 
section, each point in normalized coordinates is rotated, 
scaled, and projected to determine the display 
coordinates. Once this has been completed for al} 
vertices in the cube, the processing begins in the second 
section. The second section processing consists of 
selecting vertex pairs constituting edges of the cube and 
calling the line drawing algorithm to plot the line 
defining the edge. 


(RRSRAESSLERR ESSERE RSE RE RRR ERE RE RE RE REESE SER ES EES ESE ERE RRS ESE ES SE 


RATE EQU 4 
RPTCNT EQU >0400 


INITIALIZATION OF MATRICES 


CUBER €QU $ 


LRLK AR2,NRMPTS+DP6 


LARP AR2 
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H0058 
H0059 


HOO60 
H0061 
H0062 
H0063 


HO064 
HO065 


HO066 
HO067 


H0068 
HO069 


H0070 
H0071 
HO072 
H0073 


HO074 
HOO75 


H0076 
H0077 
HO078 
HO0079 


HOO080 
HOO08 1. 
HO082 
H0083 
H0084 


HOO085 
HO086 


HOO87 
HO088 
H0089 
HO090 
H0091 
H0092 
H0093 
H0094 
HO095 
H0096 
HO097 
H0098 
H0099 
HO100 
HO1O1 
HO102 
HO103 


O8F F 
0900 
0901 


0902 
0903 
0904 
0905 
0906 
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C817 
FCAO 
0873 


0902 
558A 
0200 


* 


CUBER2 


CTLOOP 


RPTK 
BLKP 


EQU 
LARP 
LRLK 


RPTK 
BLKP 


LDPK 
LALK 


23 
VRTICS, *+ 


$ 
AR2 
AR2,C11+DP4 


11 
CMTRX, *+ 


4 
COSTBL 


cos 
SINTBL 


SIN 

RATE 

THETA 

AR2 ,VSX+DP4 


3 
SCALE , *+ 


6 

ONE6, 11 
NUMRBR 
ARO,RPTCNT 


ARO ,SAVARO 


AR2,NRMPTS+DP6 
AR3 ,DSPPTS+D0P6 


AR4,NRMPTS+DP6 


ARO,?7 
4 
cos 
Ril 
R22 


SIN 
R12 


R12 
R21 
6 

AR2 
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transfer the normalized 


points to data memory 


load the viewpoint matrix 


initialize the COS address 


initialize the SINE addres 


load the rate of rotation 


load the projection matrix 


numerator = >0800 


MODIFICATION AND DISPLAY CONTROL LOOP 


initialize indirect access 


update the rotation matrix 


MATRIX MODIFICATION OF CUBE VERTICES 


LAC 
SACL 


*+ 
X 


# 


fetch the x coordinate 
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H0104 
HO105 
HO106 
H0O107 
HO108 
H0109 


HO110 
HO111 


HO112 
HO113 
HO!14 
HO115 
HO116 
HO117 


HO118 
HO1I9 
HO120 
H0121 
HO122 


HO123 


H0124 
HO125 
H0126 
HO127 
H0128 
H0129 
HO130 
HO131 


H0132 
H0133 


HO134 
HO135 
H0136 
H0137 
H0138 


H0139 
H0140 


HO141 
HO142 
H0143 
HO144 
H0145 


HO 146 
0009 
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20A0 


DOCOS 


NXTROT 


LAC 
SACL 
LAC 
SACL 
SACL 
CALL 


CALL 
CALL 
LARP 
LAC 

SACL 
LAC 

SACL 
BANZ 
CUBE 


ZAC 
CALL 


CALL 


ROTATION MATRIX UPDATE 


#4 

Y 

#4 

Z 

ZR 
ROTZ 


MTRX4 
PRJCTN 


AR3 

XP 

+ 

yP 

*+,0,0 
CLOOP! »*~12 


DISPLAY 


CLEAR 


PLOT 


ENOTBL 
DOCOS 
SINTBL 


SIN 
cos 
THETA 
CoS 
ENDTBL 


NXTROT 
SINTBL 
cos 

6 
ARO , SAVARO 


ARO 
CTLOOP 
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fetch the y coordinate 
fetch the z coordinate 


Zz not affected by rotation 
rotate the vertex 


apply viewpoint matrix 


project the vertex 


store the X screen address 


store the Y screen address 
all vertices processed? 


clear the screen 


draw the cube 


update sine angle address 


reset to beginning of tab! 


update cosine angle addres 


reset to beginning of tab! 


rotation sequence complete 


999 
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10002 . 
10003 TERRES ESEE RE RSERESSEEES ERE RERERESESERSZESCE RSE REESE ERS EER ERE REE EES EE 
10004 * ROUTINE: ROTZ 
10005 - 
10006 * This routine fis performs the function of rotation about 
10007 * the z-axis. When the routine is called it expects that 
10008 * rotation matrix, 2 x 2, is located on page 4 in 
10009 * consecutive locations (RII, R2l, RI2, R22) and the data 
10010 * point matrices are on page 6 in locations labeled X, Y, 
10011 * XR, YR. 
10012 * 
10013 bs +- + 
10014 . {XR YR} =(€{XYJ*" } RII RI2 } 
10015 « ' R21 R22 } 
10016 . +- ~+ 
10017 * 
10018 * This shortcut can be taken since we are defining a simple 
10019 * rotation about the z-axis only and the object is assumed 
10020 * to be located about the axis of rotation. This permits 
10021 * reduction to a two-dimensional rotation and elimination of 
10022 * the homogenous coordinate system for this operation. 
10023 * 
10024 * 1t fis assummed that the points to be rotated are 
10025 * normalized on a unit basis, with values in the range of 
10026 * (-1,+1) and supplied in a Q1I5 format. The same assumption 
10027 * is placed on the rotational values of cos O and sin 0. The 
10028 * output points are therby also guaranteed to be in a QI5 
10029 * format. 
10030 - 
10031 * It {is also assummed that the data page pointer is equal to 
10032 * 6 and that the block B80 is configured as data memory 
10033 * before the routine fs called. 
10034 * 
10035 * This routine modifies the T-reg, P-reg, ACcumulator, and 
10036 * Auxiliary Register !. 
10037 . 
10038 PRE REREEEZEEEESERESE ERE EET ESE REESE EEE ERE EES ERE EERE 
10039 * 
10040 0962 ROTZ EQU $ 
10041 0962 0100 LRLK AR1,R11+0P4 

0963 0212 
10042 0964 5589 LARP ARI 
10043 0965 CA00 ZAC * zero the accumulator 
10044 0966 3CA0 LT #4 
10045 0967 3822 MPY x * X*R11 
10046 0968 3DA0 LTA + 
10047 0969 3823 MPY Y * Y*R21 
10048 096A 3DA0 LTA + * X*RI1 + Y*R21 
10049 0968 6925 SACH XR, | 
10050 096C CAO0 ZAC * zero the accumulator 
10051 096D 3822 MPY x * X*R1I2 
10052 096E 3080 LTA * 
10053 O096F 3823 MPY Y * Y*R22 
10054 0970 CE15 APAC * X*R1Z2 + Y*R22 
10055 0971 6926 SACH YR, 1 
10056 0972 CE26 RET 
0010 COPY MATRIX4.ASM 
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J0002 * 
J0003 PER R SESE SZESESVTSESEEESESRS ESE ESS ESAS EEE RE REZ ESE EEE RESET ES ESE SES 
J0004 * ROUTINE: MTRX4 
J0005 £ 
J0006 * This routine fis performs the function of a combination 
J0007 * (translations, rotations, scalings, etc.) of matrix 
J0008 * operations on a three-dimensional point in a homogenous 
J0009 * coordinate system. The computations executed by the 
J0010 * routine are the matrix multiplication of a [1x4] x [4x4]. 
J0011 * In actuality. since the tast point fis only the homogeneous 
J0012 * coordinate, the matrix multiplication is reduced to a 
J0013 “ {1x4] «x [4x3]. 
J0014 * 
J0015 * When the routine is called it expects that the combination 
J0016 * matrix, 4 x 3, is located on page 4 in consecutive 
J0017 * locations (Cli, C21, C31, C41, Cl2, C22, C32, C42, C13, 
J00!18 * C23, C33, C43) and the data point matrices are on page 6 
J0019 * in locations labeled XR, YR, ZR, XC, YC, ZC. 
J0020 " 
J0021 * +- -+ 
J0022 7 { xc vc 20 1} = { XR YR ZR 1) * } Cli Cl2 C!l3 O } 
J0023 . ! C21 C22 Cc23 Oo |} 
J0024 . ' C3! C32 C33 O } 
J0025 * 1 C41 C42 C43 } 3 
J0026 * +- + 
J0027 bd 
J0028 * It is assummed that the points to be manipulated are 
J0029 * normalized on a unit basis, with values in the range of 
J0030 * (-1,+1) and supplied in a QI5 format. The combination 
J0031 * matrix is assumed to be in a Ql! format, with values in 
J0032 * the range of (-16,+16). The output points are thereby 
J0033 * guaranteed to be in a QII format. 
J0034 * 
J0035 * It is also assummed that the data page pointer is equal to 
J0036 * 6 and that the block BO is configured as data memory 
J0037 * before the routine {fs called. 
J0038 . 
J0039 * This routine modifies the T-reg, P-reg, ACcumulator, and 
J0040 * Auxiliary Register 1. 
J0041 * 
J0042 HHHSRHSKRHSKHSHHHKHHHHKERHSSKKKHFHHKSSHKHCOKKKHSFCHKHKHHHKHKRHRREHRHHRBEA SD 
J0043 . 
J0044 0973 MTRX4 EQU $ 
J0045 0973 D100 LRLK ARI ,C11+0P4 

0974 0216 
J0046 0975 5589 LARP AR1 
J0047 0976 CAO00 ZAC * zero the accumulator 
J0048 0977 3CA0 LT 4 
J0049 0978 3825 MPY XR * XR*CII 
J0050 0979 3DA0 LTA 4 
JO05! O97A 3826 MPY YR * YR*C21 
J0052 097B 3DA0 LTA 4 * XR*®*C11 + YR*C21 
J0053 097C 3827 MPY ZR * ZR*C31 
J0054 097D OFAO ADD *+,15 * XR*®*C11 + YR*C21 + C4l 
J0055 097E 3DA0 LTA #4 * XR*®*C11 + YR*C2!1 + ZR*C31 + C41 
J0056 O97F 6910 SACH XC, 1 
J0057 0980 CA0O ZAC * zero the accumulator 
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J0058 
J0059 
J0060 
J0061 
"J0062 
J0063 
J0064 
J0065 
J0066 
J0067 
J0068 
J0069 
J0070 
J0071 
J0072 
J0073 
J0074 
J0075 

0011 


0981 
0982 
0983 
0984 
0985 
0986 
0987 
0988 
0989 
098A 
0988 
098C 
0980 
098E 
098F 
0990 
0991 
0992 
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3825 MPY 
3DA0 LTA 
3826 MPY 
3DA0 LTA 
3827 MPY 
OF AO ADD 
3DA0 LTA 
6911 SACH 
CA00 ZAC 
3825 MPY 
3DA0 LTA 
3826 MPY 
3DA0 LTA 
3827 MPY 
OF 80 ADD 
CEIS APAC 
6912 SACH 
CE26 RET 
COPY 


SEMBLER PC 1.0 85.157 15:17:30 12-05-85 
PAGE 0059 
XR * XR*C12 
ey 
YR * YR*C22 
*+ * XR®*C12 + YR*C22 
ZR * ZR*C32 
"4,15 * XR*C12 + YR*C22 + C42 
4+ * XR*C12 + YR*C22 + ZR*C32 + C42 
Yc, 1 
* zero the accumulator 
XR * XR*C13 
+ 
YR * YR*C23 
w+ * XR*®*C13 + YR*C23 
ZR * ZR*C33 
*,15 * XR*C13 + YR*C23 + C43 
* XR*®*C13 + YR*C23 + ZR*C33 + C43 
ZC, 1 
PRJCTN.ASM 
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K0002 
K0003 
K0004 
K0005 
K0006 
K0007 
K0008 
K0009 
K0010 
K0011 
K0012 
K0013 
K0014 
K0015 
K0016 
K0017 
K0018 
K0019 
k0020 
K0021 

K0022 
K0023 
K0024 
k0025 
k0026 
K0027 
K0028 
K0029 
K0030 
K0031 

K0032 
K0033 
K0034 
K0035 
K0036 
K0037 
K0038 
K0039 
K0040 
K0041 

K0042 
K0043 
K0044 
K0045 
K0046 
K0047 
K0048 
K0049 
K0050 
Kk0051 

K0052 
K0053 
K0054 
K0055 
K0056 
K0057 
K0058 
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ROUTINE: PRJCTN 

This routine is performs the function of a projecting a 
three-dimensional {mage onto a two-dimensional display 
map. In general this involves the matrix multiplication 
of a [1x4] x [4x4]. Since this is the final operation to 
displaying a point, the homogeneous coordinate column can 
be deleted from the matrix. Also this fis a scaling 
(diagonal) and translation matrix, allowing the 
computations to be further reduced by avoiding multiply 
accumulates fnvolving the known zeroes. 


When the routine is called it expects that the matrix 
parameters to be located on page 4 in consecutive 
locations (VSX, VCX, VSY, VCY) and the data point matrices 
to be on page 6 in locations labeled XC, YC, ZC, XP, YP. 


VSX’ 0 0 
0 vsy’ Q 
0 VS2’ 
vex vCY VCZ: 


{ XP YP ZP | } = [ XC YC ZC 1 } * 


+ -------- + 
So 
a > eo 
+ -------- + 


VCX, VCY, VCZ are the location of the center of object on 
the screen and, of course, VCZ = 0. VSX’, VSY’, and VSZ’ 
constitute the scaling of the object to meet the display 
size or resolution and to include the factor of the 
projection of the z-coordinate onto the x,y-plane. Again 
VSZ’ = 0 and VSX’ and VSY’ are given by the equations: 
VSX’ = VSX/ZC and VSY’ = VSY/2C 
It is assummed that the points to be scaled and projected 
have been provided in a Ql1 format as a result of a 
combination matrix operation. It is furthered assummed 
that the ZC parameter in particular has some integer 
portion (either positive or negative). This is important 
since we perform a division and expect to return the 
result for additional computations as a Q1I5 number. The 
scaling and centering values should be provided such that 
the multiplication and addition generate a QO value as 
output from the original QI! input value when retrieved 
from the accumulator with a SACH <dma>,1. With a distance 
to screen size ratio of 4 and a view point at (6,8,7.5), 
the scaling numbers are represented in Q6 and the 
centering values in a QO format. 


It is also assummed that the data page pointer is equal to 
6 and that the block BO is configured as data memory 
before the routine jis called. 


This routine modifies the T-reg, P-reg, ACcumulator, and 
Auxiliary Register !. 
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K0059 HHHHERAERHEKESAHEHHHEERHHHKRHRHKHEHHHHEEREHHRHENHEEEHE EERE EH OER LO0002 * 
K0060 * L0003 ESET ESIC SITUS ETIS ESI TESTE SIT TI LITT TIT TTTTriTtritttt 
KO0061 0993 PRJCTN EQU $ LO004 * ROUTINE: PLOT 
KO0062 0993 2012 LAC ZC LO005 * 
K0063 0994 6814 SACH ZCSIGN * save sign of ZC LO0006 * This routine controls the line drawing (plotting) required 
K0064 0995 CE1B ABS LO007 * to display a wire-frame cube. The vertices are expected 
K0065 0996 6012 SACL Zc * save absolute value of ZC L0008 * to be stored in a screen location table located on page 6 
K0066 0997 4071 ZALH NUMR8R * load a QI] value equal ! L0009 * beginning with the address DSPPTS. The line drawing 
K0067 0998 CBOE RPTK 14 * divide by ZC LOO10 * routine is called to draw each of the 12 lines connecting 
K0068 0999 4712 SUBC Zc LOOLI * the 8 vertices and defining the edges of cube as shown 
K0069 099A 6013 SACL ZCI * save {ZC! inverse (1/!ZC!) LOO12 * below. 
K0070 099B 2013 LAC ZCl * result is in Q15 format L0013 * 
K0071 099C 4C14 XOR ZCSIGN * sign adjustment LOO14 * or. 
K0072 0990 1014 SUB ZCSIGN L0015 * H 
K0073 099E 6013 SACL ZCI * save ZC inverse (1/ZC) LOO16 - J----+------ 6 
K0074 099F 3C13 LT ZCl L0017 * fi H fi 
K0075 O9A0 3810 MPY XC * XC/ZC L0018 * / 4 H /\ 
K0076 O9A1 CE14 PAC 10019 ig / 4 / 43 
K0077 O9A2 6910 SACH XC, 1 L0020 * 4-----~----- 5 H 
K0078 09A3 3811 MPY YC * YC/ZC L0021 is H H 1 ~-+#--> 
K0079 O9A4 CE14 PAC L0022 * H 32S s=5> inmn-2 oy 
K0080 O9A5 6911 SACH YC,1! L0023 * 1 of 1 f 
KQ081 O9A6 0100 LRLK AR1,VSX+DP4 L0024 * 1 / / i / 
O9A7 0222 L0025 ” / / i/ 
K0082 09A8 5589 LARP ARI L0026 * 0----------- 1 
K0083 O09A9 3CA0 LT + L0027 hs x / 
'S K0084 O9AA 2FA0 LAC *+,15 - VCX L0028 is v 
> K0085 O9AB 3810 MPY XC * XC’ *VSX L0029 * 
KO0086 O9AC 3DA0 LTA *+ * XC’ *VSX + VCX L0030 * The routine proceeds by first drawing lines from points 
9) K0087 O9AD 6900 SACH XP, 1 L0031 * 0-1-2-3-0. Next the lines using point pairs 0-4, 1-5, 
5 K0088 O9AE 2F80 LAC *,15 J VCY L0032 * 2-6, and 3-7 are drawn. Finally lines are drawn 
E. K0089 OSAF 3811 MPY yc * YC’ *VSY 10033 * connecting the points 4-5-6-7-4 in sequence. 
#» K0090 09860 CE15 APAC * YC’*VSY + VCY L0034 * 
=} KO0091I 09B! 6901 SACH YP,1 L0035 POTEET IES SETI SSE ESI STITT VSI C SIS SITES S STS TTT iS TTT Stir erry 
S, K0092 09B2 CE26 RET 10036 . 
g 0012 COPY OBJPLOT. ASM 10037 09B3 PLOT EQU $ 
@ L0038 09B3 CAO1 LACK 1 
ns L0039 09B4 6004 SACL COLOR * initialize color to RED 
e L0040 09B5 D200. LRLK AR2 , DSPPTS+DP6 
3 09B6 035F 
a L004! 09B7 20A0 LAC Hy 
= 10042 09B8 6000 SACL XI " fetch X0 
ga L0043 0989 20A0 LAC + 
> 10044 09BA 6001 SACL Y1 * fetch YO 
© L0045 09BB 20A0 LAC #4 
z L0046 09BC 6002 SACL —_X2 * fetch X1 
n LO047 09B8D 6024 SACL XB * save X!1 for next line 
ne L0048 O9BE 20A0 LAC 4 
S L0049 O9BF 6003 SACL Y2 * fetch Yi 
o LO0050 09C0 6025 SACL YB * save Yl for next line 
5 LOO51 09C1 726F SAR AR2, SAVAR2 
a. L00S2 09C2 FE80 CALL LINE * draw line connecting 0 & 1 
J 09C3 0524 
= L0053 09C4 558A LARP AR2 
os L0054 09C5 326F LAR AR2, SAVAR2 
g LO055 09C6 2024 LAC XB 


LO0056 09C7 6000 SACL x) * fetch X) 
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LO057 09C8 2025 LAC YB L0107 OAO1 326F LAR AR2,SAVAR2 
L0058 09C9 6001 SACL Yl * fetch YI! 10108 OA02 3370 LAR AR3,SAVAR3 
LO059 O9CA 20A0 LAC + L0109 OA03 558A LARP AR2 
10060 O09CB 6002 SACL X2 * fetch X2 LO110 0A04 20A0 LAC *+ 
LO061 O9CC 6024 SACL XB * save X2 for next line LOI11 OAOS 6000 SACL x1 * fetch X1 
L0062 O9CD 20A0 LAC + LO1i2 OA06 20AB LAC *4+,0,3 
10063 O9CE 6003 SACL Y2 * fetch Y2 LO!!3 OAO7 6001 SACL Y! * fetch Y1 
L0064 O9CF 6025 SACL YB * save Y2 for next line L0114 OA08 20A0 LAC ay 
10065 09D0 726F SAR AR2 , SAVAR2 L0115 0A09 6002 SACL X2 * fetch X5 
L0066 09D! FE8O CALL LINE * draw line connecting 1 & 2 LO116 OAOA 20A0 LAC + 
0902 0524 LO117 OAOB 6003 SACL Y2 * fetch Y5 
10067 09D3 558A LARP AR2 10118 OAOC 726F SAR AR2,SAVAR2 
L0068 09D4 326F LAR AR2,SAVAR2 L0119 OA0D 7370 SAR AR3 , SAVAR3 
L0069 09D5 2024 LAC XB L0120 OAOE FE80 CALL LINE * draw line connecting | & 5 
L0070 09D6 6000 SACL XI * fetch X2 OAOF 0524 
L0071 0907 2025 LAC YB LO121 OAI1O 326F LAR AR2 , SAVAR2 
L0072 0908 6001 SACL Yl * fetch Y2 L0122 OA11 3370 LAR AR3, SAVAR3 
10073 0909 20A0 LAC *+ L0123 OA12 558A LARP AR2 
10074 O9DA 6002 SACL X2 * fetch X3 10124 0A13 20A0 LAC #4 
10075 09DB 6024 SACL XB * save X3 for next line 10125 OA!4 6000 SACL x1 * fetch X2 
10076 O9DC 20A0 LAC "+ L0126 OAIS 20AB LAC *+,0,3 
L0077 0900 6003 SACL Y2 * fetch Y3 L0127 OA!I6 6001 SACL Y! * fetch Y2 
L0078 O9DE 6025 SACL YB * save Y3 for next line LO1!28 OA17 20A0 LAC a+ 
L0079 O9DF 726F SAR AR2, SAVAR2 LO!129 OAI8 6002 SACL X2 * fetch X6 
LO080 O9E0 FE80 CALL LINE * draw line connecting 2 & 3 L0130 OA19 20A0 LAC a+ 
O9Et 0524 LO131 OAIA 6003 SACL Y2 * fetch Y6 
L0081 O9E2 558A LARP AR2 LO132 OAIB 726F SAR AR2, SAVAR2 
L0082 09E3 0200 LRLK AR2 , DSPPTS+DP6 L0133 OAIC 7370 SAR AR3 , SAVAR3 
09F4 035F LO134 OAID FE8O CALL LINE * draw line connecting 2 & 6 
L0083 09E5 2024 LAC XB OAIE 0524 
10084 09E6 6000 SACL X1 * fetch X3 LOI35 OAIF 326F LAR AR2 , SAVAR2 
L0085 09E7 2025 LAC YB LO136 0A20 3370 LAR AR3 , SAVAR3 
L0086 O09E8 6001 SACL Y! * fetch Y3 L0137 OA21 558A LARP AR2 
LO087 O9E9 20A0 LAC + L0138 0A22 20A0 LAC #+ 
10088 O9EA 6002 SACL X2 * fetch X0 L0139 0A23 6000 SACL X1 * fetch X3 
L0089 O09EB 20A0 LAC + L0140 0A24 20AB LAC *+,0,3 
L0090 O9EC 6003 SACL Y2 * fetch YO L0141 OA25 6001 SACL Yi * fetch Y3 
L0091 O9ED 726F SAR AR2 , SAVAR2 L0142 OA26 20A0 LAC + 
L0092 O9EE FE80 CALL LINE * draw line connecting 3 & 0 L0143 0A27 6002 SACL X2 * fetch X7 
O9EF 0524 L0144 O0A28 20A0 LAC *+ 
10093 09FO0 D200 LRLK AR2 ,DSPPTS+DP6 L0145 0A29 6003 SACL Y2 * fetch Y7 
OOF! O35F L0146 OA2A 726F SAR AR2, SAVAR2 
10094 09F2 0300 LRLK AR3 ,DSPPTZ+DP6 L0147 0A2B 7370 SAR AR3,SAVAR3 
O9F3 0367 L0148 OA2C FE8O CALL LINE * draw line connecting 3 & 7 
L0095 09F4 558A LARP AR2 OA2D 0524 
L0096 O9F5 20A0 LAC + L0149 OA2ZE CA07 LACK 7 
L0097 O09F6 6000 SACL x1 * fetch X0 LOI50 OA2F 6004 SACL COLOR * initialize color to WHITE 
10098 O09F7 20AB LAC *+,0,3 L015! 0OA30 558A LARP AR2 
10099 O09F8 6001 SACL Yl * fetch YO L0152 0A31 0200 LRLK AR2 , OSPPTZ+DP6 
LOL!00 O9F9 20A0 LAC #+ 0A32 0367 
LO101 O9FA 6002 SACL X2 * fetch X4 10153 0A33 20A0 LAC ha 
L0102 O09FB 20A0 LAC + LOI54 0A34 6000 SACL x1 * fetch X4 
LO!103 O9FC 6003 SACL Y2 * fetch Y4 LO155 OA35 20A0 LAC H+ 
10104 O9FD 726F SAR AR2, SAVAR2 LOI56 0A36 6001 SACL Yl * fetch Y4 
LO105 OSFE 7370 SAR AR3 , SAVAR3 10157 OA37 20A0 LAC + 
10106 O9FF FE8O CALL LINE * draw line connecting 0 & 4 L0158 OA38 6002 SACL X2 * fetch X5 
OA00 0524 L0159 0A39 6024 SACL XB * save X5 for next line 


OL9 
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L0160 OA3A 20A0 LAC 4 M0002 * SINE COSINE 
L0161 OA3B 6003 SACL Y2 * fetch Y5 M6003 bd 
L0162 OA3C 6025 SACL YB * save Y5 for next line M0004 OA6D 0000 SINTBL DATA 0 * 0/360 degrees 270 degrees 
L0163 0A30 726F SAR AR2,SAVAR2 MOOOS OAGE 0324 DATA 804 
LO164 OA3E FE80 CALL LINE * draw line connecting 4 & 5 MOO06 OA6F 0648 DATA 1608 
OA3F 0524 M0007 0A70 096A DATA 2410 
L0165 OA40 558A LARP AR2 M0008 OA7! OC8C DATA 3212 
LO!66 0A41 326F LAR AR2, SAVAR2 M0009 0A72 OFAB DATA 4011 
L0167 OA42 2024 LAC xB M0010 0A73 12C8 DATA 4808 
L0168 0A43 6000 SACL Xl * fetch X5 M0011 OA74 15E2 DATA 5602 
LO0169 0A44 2025 LAC YB MOOI2 OA7S5 I8F9 DATA 6393 
L0170 OA45 6001 SACL Y! * fetch Y5 M0013 0OA76 1COB DATA 7179 
LO171 OA46 20A0 LAC bel 2 MOO14 OA77 IFIA DATA 71962 
L0172 OA47 6002 SACL X2 * fetch X6 M0015 OA78 2223 DATA 8739 
10173 O0A48 6024 SACL XB * save X6 for next line M0016 O0A79 2528 DATA 9512 
L0174 OA49 20A0 LAC *+ M0017 OA7A 2826 DATA 10278 
L0175 OA4A 6003 SACL Y2 * fetch Y6 M0018 OA7B 2BiF DATA 11039 
L0176 OA4B 6025 SACL YB * save Y6 for next line M0019 OA7C 2E11 DATA 11793 
LO177 OA4C 726F SAR AR2 , SAVAR2 M0020 OA7D 30FB DATA 12539 
L0178 OA4D FE80 CALL LINE * draw line connecting 5 & 6 MOO2Z! OA7E 33DF DATA 13279 
OA4E 0524 M0022 OA7F 36BA DATA 14010 
L0179 OA4F 558A LARP AR2 M0023 OA80 398C DATA 14732 
LO!80 OA5O 326F LAR AR2, SAVAR2 M0024 OA81 3C56 DATA 15446 
LOI81 OAS! 2024 LAC XB M0025 OA82 3F 17 DATA 1615] 
LO182 OA52 6000 SACL X1 * fetch X6 M0026 0A83 41CE DATA 16846 
L0183 0A53 2025 LAC YB M0027 OA84 447A DATA 17530 
10184 O0A54 6001 SACL YI * fetch Y6 M0028 0A85 471C DATA 18204 
L0185 OA55 20A0 LAC wy M0029 OA86 49B4 DATA 18868 
L0186 OAS56 6002 SACL X2 * fetch X7 M0030 O0A87 4C3F DATA 19519 
L0187 OA57 6024 SACL XB * save X7 for next line M0031 OA88 4EBF DATA 20159 
L0188 OA58 20A0 LAC ey M0032 O0A89 5133 DATA 20787 
10189 0A59 6003 SACL Y2 * fetch Y7 M0033 OA8A 539B DATA 21403 
LO190 OASA 6025 SACL YB * save Y7 for next line M0034 OA8B 55F5 DATA 22005 
LO!91 OA5SB 726F SAR AR2, SAVAR2 M0035 OA8C 5842 DATA 22594 
10192 OASC FE80 CALL —_LINE * draw line connecting 6 & 7 M0036 OA8D 5A82 DATA 23170 * 45 degrees 305 degrees, 
OA5D 0524 M0037 OASE 5CB3 DATA 23731 
L0193 OASE 558A LARP AR2 M0038 OA8F 5E07 DATA 24279 
L0194 OA5F 0200 LRLK AR2 ,DSPPTZ+DP6 M0039 0A90 60EB DATA 24811 
OA60 0367 M0040 OA91 62F 1 DATA 25329 
LO195 OA61 2024 LAC XB M0041 OA92 64E8 DATA 25832 
L0196 OA62 6000 SACL Xl * fetch X7 M0042 0A93 66CF DATA 26319 
L0197 OA63 2025 LAC YB M0043 0A94 68A6 DATA 26790 
LO198 OA64 6001 SACL Y} * fetch Y7 M0044 0A95 6A6D DATA 27245 
LO199 OA65 20A0 LAC e+ M0045 0A96 6C23 DATA 27683 
LO200 OA66 6002 SACL X2 * fetch X4 M0046 0A97 6DC9 DATA 28105 
L0201 OA67 20A0 LAC + M0047 OA98 6F5E DATA 28510 
LO202 OA68 6003 SACL Y2 * fetch Y4 M0048 0A99 70E2 DATA 28898 
L0203 OA69 726F SAR AR2, SAVAR2 M0049 OA9SA 7254 DATA 29268 
L0204 OA6A FE8O CALL LINE * draw line connecting 7 & 4 M0050 OA9B 73B5 DATA 29621 
OA6B 0524 M0051 OA9SC 7504 DATA 29956 
L0205 OA6C CE26 RET M0052 OA9D 7641 DATA 30273 
0013 COPY SINTBL.ASM M0053 OA9SE 776B DATA 30571 
M0054 OASF 7884 DATA 30852 
M0055 OAAO 7989 DATA 311413 
M0056 OAA! 7A7C DATA 31356 
MQ057 OAA2 7B5C DATA 31580 
M0058 OAA3 7C29 DATA 31785 
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40060 
M006! 
M0062 
M0063 
M0064 
M0065 
M0066 
M0067 
M0068 
M0069 
M0070 
M0071 
M0072 
M0073 
M0074 
M0075 
M0076 
M0077 
M0078 
M0079 
M0080 
M0081 
M0082 
M0083 
M0084 
M0085 
M0086 
M0087 
M0088 
M0089 
M0090 
M0091 
M0092 
M0093 
M0094 
M0095 
M0096 
M0097 
40098 
M0099 
M0100 
M0101 
M0102 
M0103 
M0104 
M0105 
M0106 
MO107 
M0108 
M0109 
M0110 
MO111 
MO112 
M0113 
MOI14 
M0115 
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TCE3 
7089 
7E10 
7E9C 
7F09 
7F61 
7F A6 
7FD8 
TFF5 
TFFF 
TFF5 
7FD8 
7F A6 
7F61 
7F 09 
TE9C 
7E10 
7089 
7CE3 
7¢29 


COSTBL 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


31971 
32137 
32285 
32412 
32521 
32609 
32678 
32728 
32757 
32767 
32757 
32728 
32678 
32609 
32521 
32412 
32285 
32137 
31971 
31785 
31580 
31356 
31113 
30852 
30571 
30273 
29956 
29621! 
29268 
28898 
28510 
28105 
27683 
27245 
26790 
26319 
25832 
25329 
24811 
24279 
23731 
23170 
22594 
22005 
21403 
20787 
20159 
19519 
18868 
18204 
17530 
16846 
16151 
15446 
14732 
14010 
13279 


* 90 degrees 


* 135 degrees 
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M0116 
MOLI? 
M0118 
M0119 
M0120 
M0121 

MO122 
M0123 
M0124 
M0125 
M0126 
M0127 
M0128 
M0129 
M0130 
M0131 

M0132 
M0133 
M0134 
M0135 
M0136 
40137 
M0138 
M0139 
M0140 
MOI41 

M0142 
M0143 
M0144 
MO145 
M0146 
M0147 
M0148 
M0149 
M0150 
M0151 

M0152 
M0153 

M0154 
MOI5S 
M0156 
M0157 
M0158 
M0159 
M0160 
M0161 

M0162 
M0163 

M0164 
M0165 
M0166 
M0167 
M0168 
MOI69 
M0170 
MOI71 

M0172 


OADD 
OADE 
OADF 
OAEO 
OAE! 
OAE2 
OAE3 
OAE4 
OAES 
OAE6 
OAE7 
OAES 
OAE9 
OAEA 
OAEB 
OAEC 
OAED 
OAEE 
OAEF 
OAFO 
OAF | 
OAF2 
OAF3 
OAF 4 
OAF5 
OAF6 
OAF7 
OAF8 
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30FB 
2E11 
2BI1F 
2826 
2528 
2223 
IFIA 
1COB 
18F9 
15E2 
12C8 
OF AB 
ocec 
096A 
0648 
0324 
0000 
FCOC 
F9B8 
F696 
F374 
F055 
E038 
EAIE 
E707 
E3F5 
EOE6 
DDOD 
DADS 


DATA 
DATA 
OATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


12539 
11793 
11039 
10278 
9512 
8739 
71962 
7179 
6393 
5602 
4808 
4011 
3212 
2410 
1608 
804 
0 
-804 


* 180 degrees 


* 215 degrees 
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T™MS32020 -— TMS34061 DEMONSTRATION PAGE 0069 T™MS32020 - TMS34061 DEMONSTRATION PAGE 0070 
M0173 0B16 93DD DATA -27683 M0230 OB4F AAOB DATA -22005 
M0174 OBI7 9237 DATA ~-28105 M0231 OB50 AC66 DATA -21402 
M0175 0B18 90A2 DATA -28510 MOZ32 0B51 AECD DATA -20787 
M0176 OBI9 SFIE DATA -28898 M0233 OB52 B141 DATA -20159 
M0177 OBIA 8DAC OATA -29268 M0234 0B53 83C1 DATA ~19519 
M0178 OBIB 8C4B DATA -29621 M0235 0B54 B64D DATA -18867 
M0179 OBIC BAFC DATA -29956 M0236 0855 B8E4 DATA -18204 
M0180 OBI0D 89BF DATA -30273 M0237 OB56 BB86 DATA ~17530 
M0181 OBIE 8895 DATA -30571 M0238 0857 BE32 DATA -16846 
M0182 OBIF 877C DATA -30852 M0239 0B58 COE9 DATA -16151 
M0183 O0B20 8677 DATA -31113 M0240 0B59 C3AA DATA -15446 
M0184 0821 8584 DATA ~31356 M0241 OBSA C674 DATA -14732 
M0185 OB22 84A4 DATA -31580 M0242 OBSB C946 DATA -14010 
M0186 0823 8307 DATA -31785 M0243 OBSC CC21 DATA -13279 
M0187 0824 831D OATA -31971 M0244 0B5D CF05 DATA -12539 
M0188 0B25 8277 DATA -32137 M0245 OBSE DIEF DATA -11793 
M0189 0826 BIE3 DATA ~32285 M0246 OBSF D4E1 DATA -11039 
M0190 OB27 8164 DATA -32412 M0247 0B60 D7DA DATA -~10278 
M0191 O0B28 80F7 DATA -32521 M0248 0B6!1 DADS DATA -9512 
M0192 0B29 809F DATA ~-32609 M0249 0B62 DDDD DATA -8739 
M0193 OB2A 805A DATA -32678 M0250 0B63 EOE6 DATA -7962 
M0194 0B2B 8028 DATA ~-32728 M025! 0B64 E3F5 DATA -7179 
M0195 OB2C 80UB DATA ~32757 MO252 0B65 E707 DATA -6393 
M0196 OB2D 8001 DATA -32767 M0253 OB66 EAIE DATA -5602 
M0197 OB2E 8008 DATA -32757 M0254 0867 E038 DATA ~-4808 
M0198 OB2F 8028 DATA -32728 M0255 0B68 F055 DATA -4011 
M0199 0830 805A DATA -32678 * 270 degrees 180 degrees M0256 0B69 F374 DATA -3212 
M0200 0831 809F DATA -~32609 M0257 OB6A F696 DATA -2410 
M020! 0832 80F7 DATA -32521 M0258 OB6B F9B8 DATA -1608 
M0202 0833 8164 DATA -32412 M0259 OB6C FCDC ENDTBL DATA -804 
M0203 0B34 81E3 DATA -32285 0014 COPY ROM. ASM 
M0204 0835 8277 DATA -32137 

M0205 0836 8310 DATA -31971 

M0206 0B37 83D7 DATA ~31785 

M0207 0B38 84A4 DATA -31580 

M0208 0B39 8584 DATA -31356 

M0209 OB3A 8677 DATA -31113 

M0210 083B 877C DATA -30852 

M0211 0B3C 8895 DATA -30571 

M0212 0830 89BF DATA ~30273 

M0213 OB3E SAFC DATA ~-29956 

M0214 OB3F 8C4B DATA ~-29621 

M0215 0B40 8DAC DATA ~-29268 

M0216 OB41 8FIE DATA -28898 

M0217 0B42 90A2 DATA ~-28510 

M0218 0843 9237 DATA -28105 

M0219 0844 9300 DATA -27683 

M0220 0B45 9593 DATA -27245 

M0221 0B46 975A DATA -26790 

M0222 0B47 9931 DATA -26319 

M0223 0848 9818 DATA -25832 

M0224 0B49 SDOF DATA -25329 

M0225 OB4A 9F15 DATA ~24811 

M0226 OB4B A129 DATA -24279 

M0227 OB4C A340 DATA -23731 

M0228 OB4D AS57E DATA ~23170 * 305 degrees 215 degrees 

M0229 OB4E A7BE DATA -22594 


IOOPESWLL PUR OZOZESWL 2 Buis~, UONeaWZ]dwY soIydeIn Y “¢Z 


€L9 


GRAPHIC 32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 15:17:30 12-05-85 GRAPHIC 32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 15:17:30 12-05-85 


TS32020 - TMS34061 DEMONSTRATION PAGE 0071 T™S32020 - TMS34061 DEMONSTRATION PAGE 0072 
NO002 * 00002 * 
NO003 TRAE RERASESE SESE REESE RES SE ESE EE SESE ERE SESE ESSE ESE EEE EES SEES EY 00003 HAKEKHHHEHERAAHHEHKHHHEKHKHHHHHKRKRHHHKHKEHEEHHHEHRRERERKRHREHHEREREEE 
NO004 . ROM - RAM DATA INITIALIZATION 00004 * 
NOO05 PRE RRSP EERSTE ESSEC ERC CSE ESSEC ESE CECE ESE TE eSEEES | 00005 * DEMONSTRATION COMMAND STRING 
NO0006 . 00006 * 
NO00O7 0860 O2CF ROMDAT DATA 719 XMAX 00007 “ This file provides a series of data statements which are 
NOO08 OB6E 0128 DATA 299 YMAX 00008 * used to control the graphics demonstration for the 
NOO009 OB6F 0095 DATA 149 AMAX 00009 * TMS32020-VSC system. The data statements are used to 
NOO!10 OB70 002A DATA 42 AMIN 00010 * arbitrarily select the various fundamental algorithms 
NOO!! OB71 006! DATA 97 BMAX 00011 * (pixel, lfnes, ellipses, fills, etc.) along with the 
NOO12 OB72 0017 DATA 23 BMIN 00012 * appropriate data to generate a video graphics 
N0013 * 00013 * demonstration. The data word 32020 in decimal is used as 
NOO14 0B73 4000 VRTICS DATA >4000 X0,Y0,Z0 00014 * a synchronization value at the start and between commands 
NOO15 0874 C000 DATA >C000 00015 * to allow recovery from errors in entering the data 
NO016 0B75 C000 DATA >C000 00016 * sequence. The synchronization word is followed by a 
NO017 0876 4000 DATA >4000 Xi,Y1,Z1 00017 * single command word selecting the appropriate graphics or 
NOO18 0877 4000 DATA >4000 00018 * control function and as many words as necessary to provide 
NO019 0878 C000 DATA >C000 00019 * all of the required information to the selected function. 
NO0020 0B79 C000 DATA >C000 X2,Y¥2,Z2 00020 * 
NOO021 OB7A 4000 DATA >4000 0002 1 SHRHKARHHSKHHSHHHHHHEHHHHHSHHEHHHHHHKHKENRHSHREKRHEEHHEERHREHREHHREEE 
NOO022 0B7B C000 DATA >c000 00022 " 
N0023 OB7C C000 DATA >C000 X3,Y3,2Z3 00023 0898 7D14 BEGIN DATA 32020 
NO024 0B70 C000 DATA >C000 00024 OBSC 0002 DATA >0002 CLEAR SCREEN 
NO025 OB7E C000 DATA >C000 00025 0890 0000 DATA >0000 
NO026 OB7F 4000 DATA >4000 X4,Y4,24 00026 7 
NOO27 OB80 C000 DATA >C000 00027 OB9E 7014 DATA 32020 ROTATING CUBE 
N0028 OBS! 4000 DATA >4000 00028 OB9SF 0016 DATA >0016 
N0029 OB82 4000 DATA >4000 X5,Y5,Z5 00029 * 
N0030 0883 4000 DATA >4000 00030 OBAS 7014 DATA 32020 
NO031 0B84 4000 DATA >4000 00031 OBA! 0002 DATA >0002 CLEAR SCREEN 
N0032 OB85 C000 DATA >C000 X6,Y6,Z6 00032 OBA2 0000 DATA >0000 
NO033 O0B86 4000 DATA >4000 00033 © 
N0034 0887 4000 DATA >4000 00034 O0BA3 7014 DATA 32020 
NO035 OB88 C000 DATA >c000 X7,Y7,Z7 00035 OBA4 000A DATA >000A SPIN FILLED ELLIPSE 
NO036 0B89 C000 DATA >»c000 00036 OBAS 0168 DATA 360 
NO037 OB8A 4000 DATA >4000 00037 OBA6 O0A0 DATA 160 
N0038 * 00038 OBA7 006C DATA 108 
NO039 OB8B E666 CMTRX DATA »E666 C11,C21,C31,C41 00039 OBAB 003C DATA 60 
NO040 OBSC 1333 DATA >1333 00040 OBA9 0201 DATA >0201 
NO041 OB8D 0000 DATA 0 00041 OBAA 0010 DATA >0010 
NO042 OBBE 0000 DATA 0 00042 - 
NO043 OB8F 0885 DATA >0B85 €12,C22,C32,C42 00043 OBAB 7014 DATA 32020 
NO044 0890 OFSC DATA >OF SC 00044 OBAC 0009 DATA >»0009 FILLED ELLIPSE 
NO045 0891 £666 DATA »E666 00045 OBAD 01F3 DATA 499 
NO046 0B92 0000 DATA 0 00046 OBAE 0090 DATA 144 
N0047 0B93 FC29 DATA »FC29 €13,C23,C33,C43 00047 OBAF 0O00C DATA 220 
NO048 0B94 FAE1 DATA >FAE1 00048 OBBO 0073 DATA 115 
NO049 0895 FB33 DATA >»FB33 00049 OBB! 0005 DATA >0005 
NO050 0B96 6400 DATA >6400 00050 . 
NOO51 * 00051 O08B2 7014 DATA 32020 
NO052 0897 2880 SCALE DATA 162*64 VSX 00052 0BB3 0009 DATA >0009 FILLED ELLIPSE 
NO0S3 0B98 0167 DATA 359 VCX 00053 OBB4 OIFD DATA 509 
NO054 0B99 1680 DATA 90*64 VSY 00054 OBBS 0103 DATA 259 
NO055 OB9A 0090 DATA 157 VCY 00055 OBB6 00C8 DATA 200 
0015 COPY CONTROL . ASM 00056 OBB7 OOIE DATA 30 
00057 OBBS 0006. DATA >0006 
00058 is 


plo 
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00059 
00060 
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00075 
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00077 
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00079 
00080 
00081 

00082 
00083 
00084 
00085 
00086 
00087 
00088 
00089 
00090 
00091 

00092 
00093 
00094 
00095 
00096 
00097 
00098 
00099 
00100 
00101 

00102 
00103 
00104 
00105 
00106 
00107 
00108 
00109 
00110 
OO1L 

00112 
00113 

00114 
00115 


0889 
OBBA 
0668 
OBBC 
0BBD 
OBBE 
OBBF 


0BCO 
OBC! 
O0BC2 
0BC3 
O0BC4 
OBC5 
OBC6 


0BC7 
oBCc8 
O0BC9 
OBCA 
06CB 
oBcCc 
0BCD 


OBCE 
OBCF 
0800 
0801 
OBD2 
0803 
0604 


0BD5 
OBD6 
0807 
0BD8 
0809 
OBDA 
0B0B 


O0B0C 
0B0D 
OBDE 
OBOF 
OBEO 
OBE | 
OBE2 


OBE3 
OBE4 
OBES 
OBE6 
OBE7 
OBES 
OBES 


OBEA 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 
T™MS34061 DEMONSTRATION 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


DATA 
DATA 


FILLED ELLIPSE 


FILLED ELLIPSE 
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00122 
00123 
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00134 
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00140 
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00143 
00144 
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00146 
00147 
00148 
00149 
00150 
00151 

00152 
00153 
00154 
00155 
00156 
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00158 
00159 
00160 
00161 

00162 
00163 
00164 
00165 
00166 
00167 
00168 
00169 
00170 
00171 
00172 


OBEB 
OBEC 
OBED 
OBEE 
OBEF 
OBFO 


OBF 1 
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0009 
029E 
OOA5 
0014 
000c 
0003 


7014 
0009 
02C6 
0089 
000A 
0006 
0003 


7014 
0009 
O06E 
OOFA 
0050 
0020 
0004 


7014 
0001 
012C 


7014 
0002 
0000 


7014 
0008 
0167 
0095 
0018 
0096 
0006 
0018 
FFF6 
OOOF 


7014 
0001 
o12Cc 


7014 
0002 
0000 


7014 
000Cc 
O16F 
006] 
0055 
OO4A 
0601 
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>0001 
60*5 


32020 
>0002 
>0000 


32020 
>000C 
367 
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FILLED ELLIPSE 
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00204 
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00206 
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00213 
00214 
00215 
00216 
00217 
00218 
00219 
00220 
00221 
00222 
00223 
00224 
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00226 
00227 
00228 
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Ocic 
0C10 
OCIE 
OCIF 
0c20 


0C21 
0C22 
0C23 


0c24 
0C25 
0C26 


0C27 
0C28 
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0013 DATA 
FFF5 DATA 
0028 DATA 
FFES DATA 
0960 DATA 
* 
7014 DATA 
0001 DATA 
012C DATA 
* 
7014 DATA 
0002 DATA 
0000 DATA 
# 
7014 DATA 
001! DATA 
7014 DATA 
0001 DATA 
012C DATA 
* 
7014 DATA 
0002 DATA 
0000 DATA 
* 
7014 DATA 
0012 DATA 
* 
T1014 DATA 
0001 DATA 
012C DATA 
Ld 
7014 DATA 
0002 DATA 
0000 DATA 
# 
71014 DATA 
0013 DATA 
* 
7014 DATA 
0001 DATA 
o12c DATA 
* 
7014 DATA 
0002 DATA 
0000 DATA 
# 
7014 DATA 
0014 DATA 
# 
7014 DATA 
0001 DATA 
012C DATA 
; * 
7014 DATA 
0002 DATA 
0000 DATA 


19 
-hi 
40 
-72 


300*E . 
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>0001 
60*5 


32020 
>0002 
>0000 


32020 
0011 


32020 
>0001 
60*5 
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>0001 
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>0013 


32020 
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>0000 


32020 
>»0014 


32020 
>0001 
60*5 
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>0002 
>0000 
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CLEAR SCREEN 


LINE DEMO | 


DELAY: 


CLEAR SCREEN 


LINE DEMO 2 


DELAY 
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LINE DEMO 3 


DELAY 
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LINE DEMO 4 


DELAY 
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00256 
00257 
00258 
00259 
00260 

0016 


0C47 
0C48 


0c49 
OC4A 
0C4B 


0Cc4C 
0c 4D 


OC4E 
OC 4F 


0c50 
0c51 


0c52 
0C53 


0C54 
0C55 


0C56 
0C57 


0C58 
0c59 


OC5A 
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7014 DATA 

0015 DATA 
# 

71014 DATA 

0001 DATA 

012C DATA 
* 

7014 DATA 

0000 DATA 
* 

7D14 DATA 

0000 DATA 
* 

7014 DATA 

0000 DATA 
* 

7014 DATA 

0000 DATA 
* 

71014 DATA 

0000 DATA 
# 

7014 DATA 

0000 DATA 
* 

7014 DATA 

0000 DATA 
* 

7014 DATA 

0000 DATA 

COPY 


32020 
>0015 


32020 
>0001 
60*5 


32020 
>0000 


32020 
>0000 


32020 
>0000 


32020 
>0000 


32020 
>0000 


32020 
>0000 


32020 
>0000 


32020 
>0000 
RAMB2 . ASM 
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PAGE 0076 


LINE DEMO 5 


DELAY 


RESET 


RESET 


RESET 


RESET 


RESET 


RESET 


RESET 


RESET 


9L9 
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GRAPHIC 


P0002 
P0003 
P0004 
P0005 
P0006 
P0007 
P0008 
P0009 
P0010 
POOL! 
P0012 
P0013 
P0014 
P0015 
POOI6 
POO!7 
P0018 
POOIS 
P0020 
P0021 
P0022 
P0023 
P0024 
P0025 
P0026 
P0027 
P0028 
P0029 
P0030 
P0031 
P0032 
P0033 
P0034 
P0035 
P0036 
P0037 
P0038 
P0039 
P0040 
P0041 
P0042 
P0043 
P0044 
P0045 
P0046 
P0047 
P0048 
P0049 
P0050 
PO005! 
P0052 
P0053 
P0054 
P0055 
P0056 
P0057 
P0058 


0000 


0000 


0001 


0002 


0003 


0004 


0005 


0060 


0060 


0061 


0062 


0063 


0064 


0065 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 
TMS32020 - TMS34061 DEMONSTRATION 


ocsc 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 
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* 

(ARR SRESSEEEE SEE ES SEE ESE SSE ERE ESET ER RSE ESAS EERE REESE SY 
" RAM - MEMORY-MAPPED REGISTERS AND BLOCK B2 

(ARR SESESEERE REESE RSE SE SES REET ERR ERS EES ERS ERE EEE SE SERS SE SY 
# 


. LEGEND FOR INTERNAL UTILIZATION 
* 
* I = Initialization 
* P = Parser and demo setup routines 
* C = Clear screen routines 
* L = Line drawing routines 
¥ E = Ellipse drawing routines 
bs R = Rotating cube routines 
* 
OPO EQU $ I P Cet i€E R 
* 
DORG >0000 


* Memory-Mapped Register #00 
DRR DATA 0 * Serial port data receive register 


* 


* Memory-Mapped Register #01 
DXR DATA 0 * Serial port data transmit registe 
* 


* Memory-Mapped Register #02 
TIM DATA 0 * Timer register 
2 


* Memory~Mapped Register #03 
PRD DATA 0 * Period register 
* 


* Memory-Mapped Register #04 
IMR OATA 0 * Interrupt mask register 
* 


* Memory-Mapped Register #05 


GREG DATA 0 * Global memory allocation register 
* 
* 
DORG >0060 
# 
* RAM Location #0060 C L OE 
STSO DATA 0 
* 
* RAM Location #006! 
STS] DATA 0 
* 
* RAM Location #0062 P C 
ONE DATA 0 
* 
* RAM Location #0063 C 
FILLC DATA 0 
* 
* RAM Location #0064 C E 
SAVREG DATA 0 
# 
* RAM Location #0065 C E 
DATA 0 


GRAPHIC 
TMS32020 - 


P0059 
P0060 
P0061 
P0062 
P0063 
P0064 
P0065 
P0066 
P0067 
P0068 
P0069 
P0070 
P0071! 
P0072 
P0073 
P0074 
POO75 
P0076 
P0077 
P0078 

0017 


0066 


0067 


0068 


0069 


006A 


0068 


006C 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 


TMS34061 DEMONSTRATION 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


* RAM Location 
DATA 

* 

* RAM Location 
DATA 

* 

* RAM Location 

TMP 1 DATA 

* 


* RAM Location 
SYNCH DATA 
# 


* RAM Location 
TIMDLY DATA 

* 

* RAM Location 
CMND DATA 

* 


* RAM Location 
CMDADR DATA 
COPY 


#0066 
0 


#0067 
0 


#0068 
0 


#0069 
0 

#O06A 
0 


#006B 
0 


#006C 
0 
RAMBO .ASM 


15:17:30 
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LL9 


GRAPHIC 
TMS32020 - 


Q0002 
Q0003 
Qo0004 
Q0005 
Q0006 
Q0007 
Qo008 
Q0009 
Q0010 
Q0067 
Q0068 
Q0069 
Q0070 
Q0071 
Q0072 
Q0073 
Q0074 
Q0075 
Q0076 
Q0077 
Q0078 
Q0079 
Q0080 
Q0081 
Q0082 
Q0083 
Q0084 
Q0085 
Q0086 
Q0087 
Q0088 
Q0089 
Q0090 
Q009! 
Q0092 
Q0093 
Q0094 
Q0095 
Q0096 
Q0097 
Q0098 
Q0099 
Q0100 
QO101 
Q0102 
Q0103 
Q0104 
Q0105 
Q0106 
Q0107 
Q0108 
Q0109 
Q0110 
Q0111 
QO112 
Q0113 
Q0114 


0200 


0000 


0012 


0013 


0014 


0015 


0016 


0017 


0018 


0012 


OO1A 


0018 


OOIC 


001D 


OOIE 


OOIF 


0020 


0021 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 15:17:30 12-05-85 
T#S3406!1 DEMONSTRATION PAGE 0079 
* 
SHPHEHHHHTHHHHTHEHERHPEHERHRHREEE HERRERO REHERER EERE R ERE 
* RAM - BLOCK 80 (PAGES 4 AND 5) 


0200 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000" 


0000 


0000 


0000 


0000 


0000 


AHSHHHSHHHKESHHHEHHKEHHHHHHHEHKHHKAHKEHHEHHERHHRHEHEREREEH EEE EHH 
* 


DORG >80*4 


OP4 EQU $ J P cC lL E R 
# 
DORG >0000 E 

tt 

* RAM Location #0212 R 
R11 DATA 0 

* 

* RAM Location #0213 R 
R21 DATA 0 

# 

* RAM Location #0214 R 
R12 DATA 0 

* 

* RAM Location #0215 R 
R22 DATA 0 

* 

* RAM Location #0216 R 
Cil DATA 0 

* 

* RAM Location #0217 R 
C21 DATA 0 

* 

* RAM Location #0218 R 
C31 DATA 0 

* 

* RAM Location #0219 R 
C4) DATA 0 

# 

* RAM Location #021A R 
Cl2 DATA 0 

« 

* RAM Location #021B R 
C22 DATA 0 

* 

* RAM Location #021C R 
C32 DATA 0 

Ad 

* RAM Location #0210 R 
C42 DATA 0 

* 

* RAM Location #021E R 
C13 DATA 0 

# 

* RAM Location #021F R 
C23 DATA 0 

* 

* RAM Location #0220 R 
C33 DATA 0 

# 

* RAM Location #0221 R 
C43 DATA 0 


GRAPHIC 
TMS32020 - 


QO115 
Q0116 
QOo1i7 
Qos 
QO119 
Q0120 
Qo12! 
Q0d122 
Q0123 
Q0124 
Q0125 
Q0126 
Q0127 
Q0128 
Q0129 
Q0130 
Q0131 
Q0132 
Q0133 
Q0134 
Q0135 

0018 


0022 


0023 


0024 


0025 


0026 


0027 


0028 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 


T™™S3406! DEMONSTRATION 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


* RAM Location 
VSX DATA 

* 

* RAM Location 
VCX DATA 

* 

* RAM Location 
VSY DATA 

* 

* RAM Location 
vCcY DATA 

* 


* RAM Location 
SIN DATA 

* 

* RAM Location 
cos DATA 

* 


* RAM Location 
THETA DATA 
COPY 


#0222 
0 


#0223 
0 


#0224 
0 


#0225 
0 


#0226 
0 


#0227 
0 


#0228 
0 
RAMB1 .ASM 
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GRAPHIC 
TMS32020 - 


RO002 
RO003 
ROOO4 
ROOO5 
RO006 
R0007 
R0008 
RO0009 
ROO10 
ROOL! 
ROOI2 
ROO13 
ROO14 
ROO15 
ROO16 
ROO!7 
RO018 
ROO19 
R0O020 
R002 1 
ROO22 
R0023 
RO0024 
R0025 
R0026 
RO0027 
RO028 
R0029 
RO030 
ROO31 
RO032 
R0033 
R0034 
ROO35 
RO036 
RO037 
RO038 
RO039 
ROO40 
RO041 
RO042 
R0043 
R0044 
RO045 
R0046 
R0O047 
R0048 
RO049 
ROO050 
ROO51 
ROOS2 
R0053 
ROO0S4 
RO055 
RO056 
RO0S7 
ROO58 


0300 


0000 


0000 
0000 


0001 
0001 


0002 


0003 


0004 


0005 


0006 


0007 


0008 


0009 


000A 


0008 


000C 


0000 


OO0E 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 


TMS34061 DEMONSTRATION 


0300 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


SHSKHHSKHKHKHSHEAHHEKHREKAKAHHHEHEHHKEHRHHHEKEHRHHHEHEHREHEHEE EE EE OD 


. RAM - BLOCK Bl (PAGES 6 AND 7) 


HHHHHHHSAHHHHHHSTHHHSHEHHERHHHEKHHTHEHHEEHEHEHHREHHEEHHEHHE EES 


DORG >80"6 
DP6 EQU $ 
* 


DORG >0000 


* RAM Location #0300 


XP BSS 0 

X] DATA 0 

# 

* RAM Location #0301 
YP BSS 0 

Yl DATA 0 


* RAM Location #0302 
X2 DATA 0 


* RAM Location #0303 
Y2 DATA 0 


* RAM Location #0304 
COLOR DATA 0 


* RAM Location #0305 
Ox DATA 0 


RAM Location #0306 
Y DATA 0 


RAM Location #0307 
DATA 0 


RAM Location #0308 
INCRI DATA 0 


* RAM Location #0309 
INCR2 DATA 0 
* 


* RAM Location #030A 
COLORL DATA 0 
* 


* RAM Location #0308 
COLORH DATA 0 
# 


* RAM Location #030C 
COLOR2 DATA 0 
* 


* RAM Location #0300 
BFLAG DATA 0 

# 

* RAM Location #030€ 
ASIGN DATA 0 

# 


GRAPHIC 
T™MS32020 - 


ROO59 
RO060 
R006! 
RO062 
R0063 
R0064 
R0065 
R0066 
ROO67 
R0068 
RO069 
R0070 
ROO71 
ROO72 
RO073 
ROO74 
RO0075 
RO076 
ROO77 
R0078 
R0079 
R0080 
R008} 
RO082 
RO083 
RO0084 
RO085 
ROO86 
R0087 
R0088 
ROO089 
ROO090 
R009] 
RO092 
RO093 
RO094 
ROO95 
R0096 
R0097 
R0098 
RO099 
RO100 
ROO! 
RO102 
RO103 
RO104 
RO105 
RO106 
RO107 
RO108 
RO109 
ROLIO 
RO111 
RO112 
RO113 
RO114 
ROIS 


OOOF 


0010 


0011 


0012 
0012 


0013 
0013 


0014 
0014 


00!5 


0016 


0017 


0018 


0019 


OOIA 


0018 


OOIC 


0010 


OO1E 


OOIF 


0020 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 


T™MS34061 DEMONSTRATION 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


* RAM Location #030F 
ABSIGN DATA 0 
# 


* RAM Location #0310 


XC DATA 0 

* 

* RAM Location #0311 
YC DATA 0 

* 

* RAM Location #0312 
zc BSS 0 

A DATA 0 

* 

* RAM Location #0313 
ZCI BSS 0 

B DATA 0 

* 

* RAM Location #0314 
ZCSIGN BSS 0 
TIH DATA 0 

* 


* RAM Location #0315 
TIL DATA 0 
* 


* RAM Location #0316 
T2H DATA 0 
* 


* RAM Location #0317 
T2e DATA 0 
* 


* RAM Location #0318 
T3H DATA 0 

+ 

* RAM Location #0319 
T3L DATA 0 

* 


* RAM Location #031A 
T4H DATA 0 
* 


* RAM Location #031B 
T4L DATA 0 
* 


* RAM Location #031C 
TSH DATA 0 

* 

* RAM Location #031D 
TSL DATA 0 

hl 

* RAM Location #031E 
T6H DATA 0 

* 


* RAM Location #031F 
T6L DATA 0 
* 


* RAM Location #0320 
T7H DATA 0 
* 
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GRAPHIC 32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 15:17:30 12-05-85 GRAPHIC 32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 15:17:30 12-05-85 
™S32020 -— TMS34061 DEMONSTRATION PAGE 0083 ™S32020 - TMS34061 DEMONSTRATION PAGE 0084 
RO116 * RAM Location #0321 RO173 * RAM Location #033! 
RO117 0021 0000 TTL DATA 0 RO174 0031 0000 SAVCLR DATA 0 
RO118 bs RO175 « 

RO1L19 ® RAM Location #0322 R RO176 * RAM Location #0332 
RO120 0022 x BSS 0 RO!77 0032 0000 SAVEA DATA 0 
RO121 0022 0000 XA DATA 0 RO178 * 

RO122 * RO179 * RAM Location #0333 
RO123 * RAM Location #0323 R RO180 0033 0000 CLRSAV DATA 0 
RO124 0023 Y BSS 0 RO181 « 

RO125 0023 0000 YA DATA 0 RO182 * RAM Location #0334 
RO126 * RO183 0034 0000 SPEED DATA 0 
RO127 * RAM Location #0324 R R0184 * 

RO128 0024 Z BSS 0 RO185 * RAM Location #0335 
RO129 0024 0000 XB OATA 0 RO186 6035 0000 DA DATA 0 
RO130 . RO187 * 

RO131 * RAM Location #0325 R RO188 * RAM Location #0336 
RO132 0025 XR BSS 0 RO189 0036 0000 OB DATA 0 
RO133 0025 Xm BSS 0 RO190 * 

RO134 0025 0000 YB DATA 0 RO19! * RAM Location #0337 
RO135 * RO192 0037 0000 XMAX DATA 0 
RO136 * RAM Location #0326 R RO193 * 

RO137 0026 YR BSS 0 RO194 * RAM Location #0338 
RO138 0026 YM 6SS 0 RO195 0038 0000 XMIN DATA 0 
RO139 0026 0000 DH DATA 0 RO196 * 

RO140 * RO197 * RAM Location #0339 
RO141 * RAM Location $0327 R RO198 0039 0000 YMAX DATA 0 
RO142 0027 ZR BSS 0 RO199 * 

RO143 0027 4, BSS 0 RO200 * RAM Location #033A 
RO144 0027 0000 OL DATA 0 RO201 003A 0000 YMIN DATA 0 
RO145 id RO202 ® 

RO146 * RAM Location #0328 RO203 * RAM Location #0338 
RO147 0028 0000 XIH DATA 0 R0204 0038 0000 XMAXA DATA 0 
RO148 7 RO205 ° 

RO149 * RAM Location #0329 RO206 * RAM Location #033C 
RO150 0029 0000 XIL DATA 0 RO207 003C 0000 XMINA OATA 0 
RO151 * RO208 sd 

RO152 * RAM Location #032A R0209 * RAM Location #0330 
RO153 002A 0000 X2H DATA 0 RO210 0030 0000 YMAXB - DATA 0 
RO154 . RO2Z11 bd 

RO155 * RAM Location #0328 RO212 * RAM Location #033E 
RO156 002B 0000 X2t DATA 0 RO213 O0O03E 0000 YMINB- DATA 0 
RO157 2 RO214 * 

RO158 * RAM Location #032C RO215 * RAM Location #033F 
ROI59 002C 0000 PIXCNT DATA 0 RO216 003F 0000 AMAX DATA 0 
RO160 . RO217 bd 

RO161 * RAM Location #032D RO218 * RAM Location #0340 
RO162 002D 0000 PAXCNT DATA 0 RO219 0040 0000 AMIN DATA 0 
RO163 * RO220 * 

RO164 * RAM Location #032E R RO221 * RAM Location #0341 
RO165 O002E 0000 ONE6 DATA 0 RO222 0041 0000 BMAX DATA 0 
RO166 * R0223 * 

RO167 * RAM Location #032F R RO224 * RAM Location #0342 
ROI68 002F 0000 SAVARO DATA 0 RO225 0042 0000 BMIN DATA 0 
RO169 a RO226 * 

RO170 * RAM Location #0330 RO227 * RAM Location #0343 
RO171 0030 0000 NPOINT DATA 0 RO228 0043 0000 THREE6 DATA 0 
RO172 * RO229 * 


089 
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GRAPHIC 
TMS32020 - 


RO230 
RO231 
RO232 
RO233 
R0234 
RO235 
RO236 
R0237 
RO238 
RO239 
RO240 
RO241 
RO242 
R0243 
R0244 
R0245 
RO246 
R0247 
RO248 
RO249 
RO250 
RO0251 
RO252 
RO253 
RO254 
RO255 
RO256 
RO257 
R0258 
R0259 
RO260 
R026! 
RO262 
RO263 
R0264 
RO265 
R0266 
RO267 
RO268 
R0269 
RO270 
R027! 
RO272 
RO273 
RO274 
RO275 
RO276 
RO277 
R0278 
RO279 
RO280 
RO281 
RO282 
R0283 
R0284 
R0285 
R0286 


0044 
0045 
0046 


0047 
0047 


0048 
0049 
004A 
0048 
004C 
004D 
004E 
O004F 
0050 
0051 
0052 
0053 
0054 
0055 


0056 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 


TMS34061 DEMONSTRATION 


0000 
0000 


0000 


0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 
0000 


0000 


* RAM 
FIVE6 
e 


* RAM 
NINE6 
2 

* RAM 
SAVEB 


* RAM 


Location #0344 
DATA 0 


Location #0345 
DATA 0 


Location #0346 
DATA 0 


Location #0347 


NRMPTS BSS 0 


XNO00 
# 


* RAM 
YNOOO 
* 


* RAM 
ZNO00 
5 
* RAM 
XNOO] 
* 


* RAM 
YNOO! 
# 


* RAM 
ZNOOI 
* 


* RAM 
XNO002 


# 


.* RAM 


YNOO2 
# 


* RAM 
ZNO02 
i 


* RAM 
XN003 
2 
* RAM 
YNO03 
* 


* RAM 
ZN003 
* 


* RAM 
XN004 
# 


* RAM 
YNO04 
# 
* RAM 
ZN004 
* 


* RAM 
XNO005 


DATA 0 


Location #0348 
DATA 0 


Location #0349 
DATA 0 


Location #034A 
DATA 0 


Location #034B 
DATA 0 


Location #034C 
DATA 0 


Location #0340 
DATA 0 


Location #034E 
DATA 0 


Location #034F 
DATA 0 


Location #0350 
DATA 0 


Location #0351 
DATA 0 


Location #0352 
DATA 0 


Location #0353 
DATA 0 


Location #0354 
DATA 0 


Location #0355 
DATA 0 


Location #0356 
DATA 0 


GRAPHIC 
T™MS32020 - 


R0287 
RO288 
RO0289 
RO290 
RO291 
RO292 
R0293 
R0294 
RO0295 
RO296 
RO297 
R0298 
R0299 
RO300 
RO301 
RO302 
R0303 
R0304 
RO305 
RO306 
R0307 
R0308 
RO309 
RO310 
RO311 
RO312 
RO313 
RO314 
RO315 
RO316 
RO317 
RO318 
RO319 
R0320 
R032! 
RO322 
R0323 
R0324 
R0325 
R0326 
RO327 
R0328 
RO329 
RO330 
R0331 
RO332 
R0333 
R0334 
RO0335 
R0336 
RO337 
RO338 
RO339 
RO340 
RO341 
RO342 
R0343 


0057 


0058 


0059 


OO5A 


0058 


005Cc 


005D 


005E 


O05F 
OO5F 


0060 


0061 


0062 


0063 


0064 


0065 


0066 


0067 
0067 


0068 


32020 FAMILY MACRO ASSEMBLER PC !.0 85.157 


T™MS34061 DEMONSTRATION 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


* RAM Location 
YNOO5 DATA 
* 


* RAM Location 
ZNO05 DATA 

* 

* RAM Location 
XNO06 DATA 

* 


* RAM Location 
YNO06 DATA 
* 


* RAM Location 
ZNO06 DATA 
* 


* RAM Location 
XNO07 DATA 
* 


* RAM Location 
YNO07 DATA 
* 


* RAM Location 
ZNO07 DATA 
* 


* RAM Location 
DSPPTS BSS 
xD000 DATA 

* 


* RAM Location 
yDOO00 DATA 
* 


* RAM Location 
XD001 DATA 

* 

* RAM Location 
yooo! DATA 


* RAM Location 
xD002 DATA 
* 


* RAM Location 
YyD002 DATA 

* 

* RAM Location 
xD003 DATA 

* 


* RAM Location 
YD003 DATA 
* 


* RAM Location 
DSPPTZ BSS 
X0004 DATA 

* 

* RAM Location 
YDO04 DATA 


#0357 
0 


#0358 
0 


#0359 
0 


#035A 
0 


#035B 
0 


#035C 
0 


#035D 
0 


#035E 
0 


#035F 
0 
0 


#0360 
0 


#0361 
0 


#0362 
0 

#0363 
0 


#0364 
0 


#0365 
0 


#0366 
0 


#0367 
0 
0 


#0368 
0 
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189 


GRAPHIC 
TMS32020 - 


RO344 
RO345 
RO346 
R0347 
RO348 
RO349 
RO350 
RO351 
RO352 
RO353 
RO354 
RO355 
RO356 
RO357 
RO358 
RO359 
RO360 
RO361 
RO362 
R0363 
RO364 
RO365 
RO366 
RO367 
RO368 
RO369 

0019 


0069 


006A 


0068 


006C 


0060 


O006E 


OO6F 


0070 


0071 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 


T™MS3406!1 DEMONSTRATION 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


0000 


* RAM Location #0369 
XD005 DATA 0 

* 

* RAM Location #036A 
YD005 DATA 0 

* 


* RAM Location #036B 
XD006 DATA 0 
* 


* RAM Location #036C 
YDO06 DATA 0 
# 


* RAM Location #0360 
XD007 DATA 0 

* 

* RAM Location #036E 
YDOO7 DATA 0 

* 


* RAM Location #036F 
SAVAR2 DATA 0 
* 


* RAM Location #0370 
SAVAR3 DATA 0 

od 

* RAM Location #0371 
NUMR8R DATA 0 


COPY DATAMEM. ASM 
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™S32020 - TMS34061 DEMONSTRATION PAGE 0088 
$0002 , 

$0003 TRE RE REER EEE RSE EER REE ESR ES ESE RSE SESE SEE ERE ESSERE ESE RS SY 
$0004 * External Data-Memory Allocations 
$0005 IRE RRR ESERZSE EERE EEE E REESE SESS REESE AR EEE EERE REE REESE SEE ERE SS 
$0006 : 

$0007 0400 DORG >0400 

$0008 . 

$0009 0400 VSCREG 8SS 0 

$0010 . 

S0011i 0000 DORG 0 * data page 08 
$0012 , 

$0013 0000 HESYL BSS 8 

$0014 : 

$0015 0008 HESYH BSS 8 

$0016 7 

S0017 0010 HEBLL BSS 8 

$0018 : 

$0019 0018 HEBLH BSS 8 

$0020 . 

$0021 0020 HSBLL BSS 8 

so022 bs 

$0023 0028 HSBLH = 8SS 8 

$0024 * 

$0025 0030 HTOTL BSS 8 

$0026 * 

$0027 0038 HTOTH BSS 8 

$0028 * 

$0029 0040 VESYL BSS 8 

$0030 J 

$0031 0048 VESYH BSS 8 

$0032 * 

$0033 0050 VEBLL BSS 8 

$0034 * 

$0035 0058 VEBLH BSS 8 

$0036 . 

$0037 0060 VSBLL BSS 8 

$0038 . 

$0039 0068 VSBLH BSS 8 

$0040 : 

$0041 0070 VTOTL BSS 8 

$0042 * 

$0043 0078 VTOTH BSS 8 

$0044 : 

$0045 0000 DORG 0 * data page 09 
$0046 * 

$0047 0000 DAOUPL 8SS 8 

$0048 ic 

$0049 0008 DADUPH BSS 8 

$0050 a. 

$0051 0010 DADSL BSS 8 

$0052 * 

$0053 0018 DADSH BSS 8 

$0054 7 

$0055 0020 VINTL BSS 8 

$0056 . 

$0057 0028 VINTH BSS 8 

$0058 * 


c89 
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GRAPHIC 


TMS32020 - TMS34061 DEMONSTRATION 


$0059 
$0060 
$0061 
$0062 
$0063 
$0064 
$0065 
$0066 
$0067 
$0068 
$0069 
$0070 
$0071 
$0072 
$0073 
$0074 
$0075 
$0076 
$0077 
$0078 
$0079 
$0080 
$0081 
$0082 
$0083 
$0084 
$0085 
$0086 
$0087 
$0088 


0030 
0038 
0040 
0048 
0050 
0058 
0060 
0068 
0070 
0078 
0000 
0000 
0008 
0010 


0018 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 


CTLRIL 


CTLRIH 


CTLR2L 


CTLR2H 


STATL 


STATH 
# 

XYOF FL 
# 
XYOFFH 
* 
XYADRL 
# 


XYADRH 
# 


BSS 
BSS 
BSS 
BSS 
BSS 
BSS 
BSS 
BSS 
BSS 
BSS 
DORG 
BSS 
BSS 
BSS 
BSS 


o & @ 


* data page OA 


15:17:30 12-05-85 
PAGE 0089 


GRAPHIC 


$0090 
S009! 
$0092 
$0093 
$0094 
$0095 
$0096 
$0097 
$0098 
$0099 
$0100 
$0101 
$0102 
$0103 
$0104 
$0105 


0800 


0800 


0000 


0000 


0008 


0010 


0018 


0020 


0028 


0030 


0038 


0040 


0048 


0050 


0058 


0060 


0068 


0070 


0078 


* 


XYNOP 
* 
XINC 
* 
XDEC 
t 
XCLR 
J 
YINC 


XIYI 


XDYI 


XCY1 
* 
YDEC 
* 
X1YO 
* 
XDYD 
# 
XCYO 
* 
YCLR 
* 
XIYC 
* 
XDYC 
* 


XCYC 


NO ERRORS, NO WARNINGS 


DORG 
BSS 
DORG 
BSS 
BSS 
BSS 
BSS 
BSS 
BSS 
BSS 
BSS 
8SS 
BSS 
BSS 
BSS 
BSS 
BSS 
BSS 


BSS 
END 


>0800 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 
TMS32020 - TMS34061 DEMONSTRATION 


* data page 10 
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£89 


GRAPHIC 
LABEL 


A 
ABSIGN 
ALDONE 


CLEAR 

CLOOP | 
CLRSAV 
CLRSCR 
CMDADR 


CMDLEN 
CMDTBL 
CMND 
CMTRX 
COLOR 


COLOR2 


COLORH 
COLORL 


cos 
COSTBL 
CTLOOP 
CTLRIH 
CTLRIL 
CTLR2H 
CTLR2L 
CUBER 
CUBER2 


32020 FAMILY MACRO ASSEMBLER 
REFERENCES 


VALUE 


0012 
0O0F 
086A 
003F 
0040 
OO0E 
0013 


0B9B 
0000 
0041 
0042 
0016 


DEFN 


ROO70 
RO060 
G0402 
RO216 
RO219 
R0057 
ROO74 


00023 
ROOS4 
RO222 
RO225 
Q008 1 
Q0093 
Qo105 
Q0084 
Q0096 
Q0108 
Q0087 
Q0099 
Qolil 
Q0090 
Q0102 
QO1i4 
£0043 
HO102 
RO180 
C0107 
P0078 


C0072 
C0049 
P0075 
N0039 
RO0O027 


ROO5! 


R0048 
ROO45 


Q0132 
M0068 
H0083 
$0061 
$0059 
$0065 
$0063 
HO055 
HO06! 


C0217 
F0132 
G0344 
B0104 
B0106 
F0126 
C0219 
G0139 
c0023 
FO115 
B0108 
BOL10 
H0063 


B0083 
HO117 
co251 
co051 
c0024 
C0135 
C0230 
C0038 
C0042 
C0035 
H0065 
C0126 
D0033 
00300 
F0090 
F0094 
G0328 
G0552 
F0092 
F0467 
F009! 
F0473 
H0068 
HO067 
HO145 
£005! 
£0049 
£0055 
£0053 
c0071 


C0269 
FO147 


C044) 
C0444 
F0149 
C0273 
G0160 


FO1S6 
C0453 
C0456 
J0045 


C0113 
C0317 


c0028 
C0148 
C0246 


co072 
C0039 


C0145 
D0060 
00337 
F0093 
F0213 
G0390 
G0561 
FO174 
G0036 
FO0198 
G0035 
H0089 


E0101 
£0081 
E0084 
£0058 


C0312 


C0443 


FO0225 
C0316 
G0204 


F0231 
C0455 


c0281 
C0330 


C0037 
C0154 
C0341 


C0043 


C0158 
00085 
00371 
G0034 
F0220 
G0441 
G0574 
FOL8I 
G0196 
F0206 
G0184 
HO135 


G0025 
£0083 
£0086 
£0103 


PC 1.0 85.157 


15:17:30 


12-05-85 


PAGE 0091 


C0372 C0438 G0042 G0043 GO0044 G0059 


C0376 C0450 G005! G0052 GOI!I9 G0136 
G0226 G0282 G0305 G0348 G0366 


F0283 F0336 F0422 F0435 F0463 


C0301 
C0331 


coos! 
C0173 
C0360 


C0221 
DOLI2 
00403 
G0037 
F0393 
G0458 
G0584 
F0242 
G0397 
F0267 
G0385 
HO137 


G0408 
£0099 
E0105 
G0027 


C0324 H0122 


C0084 C0107 
C0178 C0187 
C0395 C0418 


c0275 C0295 
DO153 00180 
00437 00480 
L0039 L0150 
FO0400 F0445 
G0470 G0485 
G0598 G0609 
F0298 F0306 
G0543 G0565 
FO0275 F0319 
G0456 G0483 
H0O141 


G0023 G0406 
GO0029 G0412 
G0240 GO317 


C0110 
C0206 


c0318 
Dd022! 
00511 


G0038 
G0496 
G0624 
F0351 
G0589 
F0379 
G0508 


G0410 


co118 
C0211 


C0380 
00248 
00540 


G0189 
G0510 


F0359 
G0615 
F0386 
G0531 


C0129 
C0224 


C0486 
00273 
00571 


G0251 
G0520 


F0442 


F0452 


GRAPHIC 
LABEL 


32020 FAMILY MACRO ASSEMBLER 
REFERENCES 


VALUE 


0007 


028C 
02A2 
02A9 
02BC 
02C4 
O2DA 
02E1 
O2F 4 
0309 
O31F 
0326 
033C 
034F 
0365 
036C 
037F 
0387 
039D 
O3A4 
0387 
03C9 
O30F 
O3EE 
040! 
0410 
0426 
0435 
0448 
045C 
0476 
047D 
0494 
049C 
04B6 
0480 
0404 
0035 
0008 
0000 
0018 
0010 
0008 
0000 
0036 
OOF 4 
oodCc 
O0E0 
052F 
0280 
0281 
O2FC 
0343 
O3BF 


DEFN 


RO0036 


DOOI9 
00040 
00046 
D0064 
00071 
00092 
00098 
DO116 
D0139 
D0160 
D0166 
D0187 
00207 
00228 
D0234 
00252 
00259 
D0280 
00286 
D0304 
00323 
00344 
00357 
00375 
00389 
D0410 
00423 
D044! 
D0462 
00487 
00493 
00515 
00522 
00547 
00553 
DO575 
RO'86 
$0083 
$0081 
$0053 
$0051 
$0049 
$0047 
RO189 
C0102 
coos 1 
C0085 
FO0095 
00003 
00007 
00125 
DO195 
00313 


FOL41 FOI65 FOI7I 
F0244 F0247 FO0251 
F0321 F0324 FO0328 


F0392 
00043 
D0037 
00068 
D0062 
00095 
00089 
D0120 
DO1I4 
00163 
DO157 
00190 
DO184 
00231 
00225 
D0256 
00250 
D0283 
00277 
00308 
00302 
D0347 
00341 
00379 
00373 
D0413 
D0407 
00445 
D0439 
00490 
D0484 
00519 
D0513 
00550 
00544 
00579 
D0573 
C0353 


C0355 
C0088 
C0050 
C0278 
FO0106 
C0065 
C0066 
C0067 
C0068 
C0069 


F0399 


PC 1.0 85.157 15:17:30 12-05-85 


PAGE 0092 


FOI78 FO192 FOI95 FO203 FO212 FO0219 
FO261 FO264 F0272 FO292 F0295 F0303 
F0345 F0348 F0356 F0370 F0376 F0383 


C0373 CO407 C0439 C0446 C0448 


C0377 C0409 C0451 C0458 C0460 


C0298 C032! 


v89 
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GRAPHIC 
LABEL 


ELPSET 
ENDTBL 
FELIPS 
FELPSC 
FILL2 
FILLC 
FIVE6 


GREG 
HEBLH 


32020 FAMILY MACRO ASSEMBLER 
REFERENCES 


VALUE 


0450 
0026 
0027 
0952 
0667 


0867 
077C 
0cS5C 
0200 
0300 


OBA9 


DEFN 


00450 
RO139 
RO144 
HO135 
F0405 


G0399 
G0198 
P0016 
Qo008 
RO0o8 


P0021 
RO313 
R0338 
A0042 
C0077 
RO030 


FO419 
P0024 
R0033 


C0341 
M0259 
G0018 
C0211 
F0445 
P0051 
RO231 


P0036 
$0019 
$0017 
soots5 
$0013 
FO411 
F0438 
F0449 
$0023 
$0021 
$0027 
$0025 
P0033 
R0039 


R0042 


Bool! 
A0026 
A0028 
A0030 
G0440 
G0454 
G0469 


C0070 
G008 1 
G0082 
H0132 
F0176 
F0278 
F0397 
G0388 
GO187 


H0063 
HO056 
L0194 


40085 
L0094 
A0026 
C0054 
C0411 
FO140 
FQ414 


C0413 
F0459 
C0060 
H0131 
C0225 
c0058 
F0438 
£0046 
BOLI4 
00398 
00568 
B0025 


F0108 
F0447 
F0436 


B0024 
F0139 
F0372 
FO143 
F0382 
A0024 


G0417 
G0418 
G0419 


G0087 
G0096 


FOI84 
F0301 
F0403 
G0393 
G0192 


H0073 
H0084 


L0040 
LOLS2 
A0028 
C0056 
C0463 
FOL42 


C0475 


HO138 
C0276 


E0062 
00173 
00432 


F0454 


FOL67 
F0398 
FOI177 
FO391 
C0049 


G0095 
G0098 


F020! 
FO0309 
F0444 


10041 
H0085 


L0082 
L0194 
A0030 
C0057 
C0470 
F0144 


C0482 


C0319 


00177 
D0473 


PC 1.0 85.157 


G0097 
GOITI 


F0209 


F0326 
F0448 


J0045 
HO086 
L0093 
A0034 
C0062 


C0472 
F0413 


C0484 


C0387 


00216 
00477 


GO100 
GO113 


FO217 
F0330 
F0455 


K0081 
L0040 


A0036 
C0063 
FOIL! 
F0416 


FOLO7 


C0490 


00243 
00504 


15:17:30 


12- 


05-85 


PAGE 0093 


GO1I10 GO112 GO114 


F0223 
F0354 
F0470 


L0082 


A0038 


F0125 
F0432 


FOL31 


00268 
00508 


F0249 
F0362 
F0476 


10093 


A0040 


F0130 


FO135 


00295 
D0533 


F0253 
F038) 


L0094 


A0042 


FO134 


F0136 


00332 
D0537 


F0270 
F0389 


10152 


F0137 


FO138 


00366 
00564 


FO202 FO2Z!1 F0246 F0263 FO0294 F0323 FO0347 


FO194 FO218 FO250 FO27!1 FO302 F0327 F0355 


GRAPHIC 


LABEL 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 


VALUE 


DEFN 


G0481 
G0495 
G0506 
G0519 
G0529 
F0084 


C0135 
G0139 
G0204 
G0282 
G0348 
G0342 
G0103 
G0417 
J0044 
G0365 
GO127 
RO234 
ROITI 
RO240 
RO369 
H0O142 
F0202 
FO!77 
FQ149 
FO218 
F0246 
F0250 
FQ225 
F0263 
F0271 
F0323 
F0327 
F0294 
F0302 
F0398 
F0382 
FO0347 
FQ355 
FO154 
FO165 
FO210 
FO192 
F0230 
F0240 
FO261 
FO282 
FO0292 
FO317 
F0334 
F0345 
F0390 
F0370 


REFERENCES 


G0420 
G042) 
G0422 
G0423 
G0424 
C0130 
D0160 
00442 
L0106 
C0053 
G0199 
G0266 
G0343 
G0400 
G028 1 
GO117 
GO0244 
HO110 
G0301 
G0128 
BO116 
C0156 
HO056 
H0078 
H0139 
F0193 
FO166 


FO210 


F0245 
F0148 


F0262 


F0322 


F0293 
F0390 
FQ371 


F0346 


F0200 
FO169 
FOI5S7 


F0248 
F0232 
F0226 
F0300 
F0284 
FO150 
F0388 
FQ374 
F0337 


C0149 
00187 
D0487 
10120 


G0321 


DO142 
co160 
H0084 
KO0066 


F0208 
F0183 


FO277 
F0252 


F03293 
F0308 


F0353 


15:17:30 12-05-85 
PAGE 0094 


C0188 C0201 C0296 D0040 DO065 D0092 00117 
00228 00253 00280 DO305 D0344 00376 D04I0 
DO0516 00547 00576 L0052 L0066 L0080 L0092 
L0134 LO148 L0164 LO178 LO192 10204 


DO146 
C0165 C0176 C0195 
HO086 


FO216 FO222 


FO0269 


F0325 


FO361 F0396 FO0402 


ISOVESILL PUP OZOZESWLL 94) BuIs—E) UONEaW|dwW] so1ydeIn y “¢Z 


$89 


GRAPHIC 


LABEL 
ONE 


ONE6 


OSCRX1 
OSCRX2 
OSCRX3 
OSCRX4 
OSCRY1 
OSCRY2 
OTEST 
PARSER 
PAXCNT 
PIXCNT 
PIXELC 
PLOOP 
PLOT 
PLOTI 


~ PLOT2 


PLOT3 
POLY 1 
POLYC 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 

VALUE ODEFN REFERENCES 

0062 P0048 80027 C0030 C0036 C0083 C0109 C0128 
C0205 C0223 C0245 C0247 C0359 C0417 

OO2E ROI65 B0098 BO101 BO103 BOI05 B0107 BO109 
C0140 C0142 C0144 CO157 C0159 CO168 
C0218 C0220 C0233 C0235 C0238 C0240 
C0346 C0348 C0350 C0352 C0354 C0356 
C0398 C0400 C0402 C0404 C0406 CuL408 
C0487 00022 00034 00036 00038 00053 
00088 DC090 00105 00113 00130 00133 
00158 0018! 00183 00185 00222 00224 
D0276 00278 00301 00338 D0340 00342 
D0408 00438 00481 00483 00485 00512 
00572 GO121 G0223 G0302 HO0077 

073A G0144 G0142 

0787 G0209 G0207 

O7E2 G0287 G0285 

082F G0353  G0351 

O75! GO163 GOI61 

O79E G0229 G0227 

0561 FOI38 = FO133 

OosF C002! 80117 

002D ROI62 G0224 G0249 GO252 GO303 G0326 G0329 

002C ROIS9 GO118 GOI22 GOI24 GOI25 GO176 GO177 

0100 C0118 C0052 

0093 c0025 C0040 C0045 

0983 0037 # H0123 

O71E GO118 

0728 GO0I32 8 G0123 

07CD G0270 G0200 

O12C )§=696C0165) = C0162 

0120 C0154 C0055 

0154 C0204 C0163 

0137 COl76 C0196 

0145 C0190 COI74 

0558 FOI30 FO0I28 

0530 F0107 FO0097 

0003 P0030 80022 

0993 K0061 HOtt! 

0012 Q0069 H0090 10041 

0014 Q0075 H0093 H0095 

0013 Q0072 H0096 

0015 Q0078 HO009! 

0004 H0050 # 4H0071 

oscc G0540 G0426 

0802 G0552 G0427 

0805 G0561 G0428 

080C G0574 G0429 

O8EO G0584 G0430 

O8ES G0598 G0431 

O8ED G0609 G0432 

O8F6 G0624 G0433 

0000 A0024 

O7BC G0254 G0250 

0816 GO0331 G0327 

OO1A A0036 

0B60 NQ007 80099 

0962 10040 HO0!09 


15:17:30 


C0147 


C0122 
C0182 
C0242 
C038! 
C0410 
00061 
00135 
00226 
00372 
00541 


G0378 


€0172 


C0125 
C0214 
C0334 
C0383 
C0412 
D0074 
00154 
D0249 
D0404 
D0543 


12- 
PAGE 0095 


05-85 


C0186 


C0138 
C0216 
C0344 
C0385 
Co4i4 
00086 
D0156 
0274 
D0406 
00545 


GRAPHIC 
LABEL 


RPTCNT 
RSBASE 
SAVARO 


SAVAR2 


SAVAR3 
SAVCLR 


SAVEA 
SAVEB 
SAVREG 
SCALE 
SETEO 
SETE! 
SETEIE 
SETE2 
SIN 
SINTBL 
SLHIL 
SLHIU 
SLHLL 
SLHLU 
SLLOL 
SLLOU 
SPEED 
SPIN 
SPINO 
SPINI 
SPINIE 
SPIN2 
SPINZE 
STATH 
STATL 
STEPX 
STEPY 
STOP 
STSO 
STS! 
SYNC 
SYNCH 
TIH 
TIL 
T2H 
T2L 
T3H 


32020 FAMILY MACRO ASSEMBLER 
REFERENCES 


VALUE 
0400 


0888 
002F 


OO6F 


0070 
0031 


DEFN 


HO0051 
G0426 
RO168 


RO0363 


RO366 
RO174 


RO177 
RO237 
P0054 
N0052 
C0363 
C0366 
C0389 
C0387 
Q0129 
M0004 
G0394 
G0193 
G0389 
G0188 
G0383 
G0182 
RO183 
C0230 


HO0079 
G0262 
C0263 
D0041 
00166 
DO0306 
00462 
H0143 
L005! 
LO121 
L0203 
L0105 
C024) 
C0379 
00039 
00152 
00227 
00336 
00441 
00546 
C0237 
G0120 
G0021 
HOO75 


C0390 
C0364 
C0384 
H0070 
HO069 
G0381 
GO180 
G0394 
GO193 


C0243 
C00s9 
C0336 
C0284 
C0261 
C0327 
C0304 


c0098 
G0088 
GO101 
C0064 
£0044 


C0033 
c0029 
G0046 
G0047 
G0049 
GO0050 
G0054 


G0339 
C0282 
D0046 
00188 
D0323 
00488 


L0054 
10132 


L0108 
C0250 
C0382 
00059 
00155 
00247 
00339 
00459 
00570 
C0255 
G0274 
G0403 


H0092 
H0133 
G0387 
G0186 


C0257 


C0099 
G0102 
GO115 
C0076 
£0107 


C0031 
G0073 
G0074 
G0091 
G0092 
G0060- 


C0306 
DO066 
D0207 
D0345 
00493 


LO065 
LO135 


LOIS 
C0253 
C0386 
DO064 
D0159 
00252 
00343 
00479 
D0575 
C0258 
G0279 


H0128 
HO140 
G0392 
GO191 


C0270 


£0077 


FO085 


G0075 


PC 1.0 85.157 


C0325 
DOO071 
00229 
DO357 
00517 


L0068 
LO146 


LO122 
C0254 
C0405 
DO084 
00179 
DO272 
00370 
D0482 


C0260 


H0130 


C0313 


E0078 


F0406 


C0366 C0388 
DO0093 D0098 
D0234 D0254 
D0377 D0389 
00522 00548 


L0079 L0091 
L0163 10166 


LO133 LO136 
C0274 C0294 
C0485 C0489 
D0087 D009! 
00182 D0186 
00275 00279 
00375 00402 
00486 00510 


C0303 C0357 


H0134 


GO019 G0414 


15:17:30 


C0436 
BO118 
00259 
D041! 
00553 


LO104 
LO177 


LO147 
C0329 
DO016 
DOI!1 
00204 
D0299 
00405 
DO515 


C0363 


C0491 
DOL39 
D028 1 
00423 
00577 


LO107 
L0180 


C0332 
00032 
DOI16 
00220 
00304 
DO0409 
00539 


C0415 


12- 
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05-85 


00019 
D016! 
00286 
D0443 
H0083 


LO118 
LOI91 


C035! 
00035 
00136. 
00223 
00321 
00436 
D0542 


C0433 


989 
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GRAPHIC 
LABEL 


32020 FAMILY MACRO ASSEMBLER PC 1.0 85.157 


VALUE 


0019 
QOIA 
001B 
OOIC 
001D 
OOIE 
OOIF 
0020 
0021 
0028 
0043 


0002 
O06A 
0018 
0068 
OOIE 
0018 
0010 
0023 
0025 
0058 
0050 
0048 
0040 
0028 
0020 
06A2 
06AB 
0B73 
06B3 
0068 
0060 
0033 
0400 


0022 
0024 
0078 
0070 
O0ES 
OOEF 
O4FF 
0237 
023A 
O27C 
0246 
0249 
0254 
0257 
0262 
0265 
0270 
0273 
0212 
0797 


DEFN 


RO093 
RO0096 
R0099 
RO102 
RO105 
RO108 
RO1L11 
ROL14 
ROLI7 
Q0135 
R0228 


P0027 
P0072 
A0034 
P0066 
A0040 
$0087 
$0085 
Q0120 
Q0126 
$0035 
$0033 
$0031 
$0029 
$0057 
$0055 
F0459 
F0465 
NOO14 
F0471 
$0039 
$0037 
B0029 
$0009 


QO117 
Q0123 
$0043 
$0041 
C0089 
C0099 
£0078 
C0433 
C0436 
C0492 
C0446 
C0449 
C0458 
C0461 
C0470 
C0473 
C0482 
C0485 
C0395 
G0223 


REFERENCES 


G0055 
G0057 
G0058 
G0066 
G0067 
G0068 
G0069 
G0071 
G0072 
H0072 
BO112 
00328 
D0529 


C0082 


C0086 


c0097 
C0095 
FO127 
F0469 
H005S9 
F0464 
c0092 
co091 


B0032 
G0308 
H0073 


c0102 
corot 
E0080 


C0493 
C0434 
C0442 
C0445 
C0454 
C0457 
C0466 
C0469 
C0478 
C0481 
C006! 
G0158 


G0064 
GO105 
G0106 
G0079 
G0080 
GO103 
G0104 
G0089 
G0090 
HOL29 
p0026 
00362 
D0556 


c0085 


C0108 


£0076 
E0074 


F0475 
£0070 
E0068 


F0088 
G0317 
K008! 
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G0076 


GO107 
G0108 
G0093 
G0094 
HO136 
00057 00078 00109 00212 00239 00264 00291 
00394 D0428 00465 00469 00496 00500 00525 
00560 


C0248 


cotle 


GO0132 GO165 GO231 GO240 G0257 G0259 GO0270 
G0334 G0336 G0369 
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X 
XI 


X1H 
XIL 


X2 


X2H 
X2L 
XA 


XB 


XC 


XCLR 
XCYC 
XCYD 
XCY! 
XD000 
xD001 
X0002 
x0003 
XD004 
XD005 
XD006 
X0007 
XDEC 
XDYC 
XDYD 
X0YI 
XINC 
XINT 
XTYC 
XIYD 
XIYI 
XM 
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VALUE 


0022 
0000 


0028 


0029 


0002 


002A 
002B 
0022 


0024 


DEFN 


RO120 
ROOI4 


RO147 


RO150 


RO0021 


RO153 
RO156 
ROI2) 


RO129 


ROO63 


$0103 
$0127 
$0119 
SOrl1 
RO314 
RO320 
R0326 
RO332 
R0339 
RO345 
RO351 
RO357 
$0101 
$0125 
$0117 
$0109 
$0099 
A0038 
$0123 
So1trs 
$0107 
RO133 


REFERENCES 


HO103 10045 
C0120 C0137 
DO168 00209 
00464 00495 
FOII9 FOL2t 
GOI52 G0209 
L0070 L0084 
L0196 

GO153 GO157 
G0362 G0374 
GO155 GO167 
GO322 G0364 
C0121 C0141 
00145 DO!70 
00427 00468 
FO417 GO!147 
L0046 L0060 
LO172 L0186 
GO0149 GO156 
GO151 GO174 
C0166 C0197 
C0461 C0464 
DO127 00140 
00315 D0324 
00463 00466 
C0167 C0181 
C0371 C0374 
D0027 00049 
00198 DO210 
D0316 D0326 
00418 00426 
DO0530 00558 
L0159 LOI167 
C0213 C0265 
G0283 G0288 


F0335 


F0282 F0334 


10051 
C0192 
D0236 
D0524 
F0412 
G0217 
L0097 


G01 70 


G0175 
G0371 
C0180 
00211 
00499 
G0148 
L0074 
L0200 
GO214 
GO216 
C0232 
00009 
00143 
00350 
00494 
C0191 
C0401 
00076 
00213 
D0329 
D0429 
D0561 
LO173 
C0308 
G0349 
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C0287 
D0261 
D0555 
F0418 
G0287 
LOI] 


G0218 


GO179 
G0380 
C0198 
D0238 
00528 
G0212 
L0088 


G0221 
G0263 
C0264 
D0020 
00167 
00358 
00497 
C0236 
C0421 
00079 
00237 
D0352 
DO0455 
L0047 
L0181 
C0368 
G0354 


D0021 
D0288 
F0102 
F0421 
G0295 
LO125 


G0222 
G0220 


C0288 
D0263 
00559 
G0213 
L0101 


G0292 
G0294 
C0286 
00023 
00197 
D0386 
00523 
C0268 
C0424 
DOI01 
00240 
D0360 
00467 
L0055 
L0187 
C0462 
J0056 


D0048 
00325 
FO105 
F0425 
G0353 
LO139 


G0236 
G0233 


00025 
00290 
FO0103 
G0290 
L0115 


G0299 
G0340 
C0307 
DO047 
00208 
D0390 
00526 
C0271 
C0437 
D013! 
00262 
00363 
00470 
L006) 
LO195 
G0140 
K0075 
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D0073 
00359 
FO!10 
F0426 
G0361 
LO154 


G0296 


G0245 


00050 
00327 
FO104 
G0291 
LO129 


G0358 
G0360 
C0343 
D0072 
D0235 
00416 
D0554 
CO311 
C0440 
00144 
00265 
00387 
00498 
L0069 


GO145 
K0077 


E0066 F0155 F0434 GO178 G0241 GO318 GO379 


FO0154 FO0230 


00100 
00391 
FOI14 
F0428 
L0042 
L0168 


G0300 


G0298 


DOO77 
00361 
FOLO9 
G0356 
L0143 


C0367 
DO075 
00260 
00424 
00557 
C0314 
00013 
00147 
00289 
00392 
00501 
L0075 


G0205 
K0085 
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00141 
00425 
FOLI8 
G0144 
L0056 
L0182 


G0313 
G0310 


00102 
00393 
FO411 
G0357 
L0158 


C0397 
00099 
D0287 
00452 


C0347 
00024 
DO0169 
00292 
00395 
D0527 
0083 


GO210 
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XMAX 
XMAXA 
XMIN 
XMINA 


XYOF FH 
XYOFFL 
Y 
Y! 


Y2 


YA 


YB 


Yc 


YCLR 

YOO00 
YO00! 
yo002 
Y0003 
yO004 
Yy0005 
YD006 
YD007 
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VALUE 


0037 
0038 


0003 


0023 


0025 


0011 


0060 
0060 
0062 
0064 
0066 
0068 
OO06A 
006C 
006E 


DEFN 


RO192 
RO0204 
RO195 
R0207 
RO241 
RO250 
R0259 
RO268 
RO277 
RO286 
RO295 
RO304 
ROO13 
RO132 
$0077 
$0075 
$0093 


$0097 
$0073 
$0071 
RO124 
ROO18 


R0024 


RO125 


RO134 


ROO66 


s0t21 
RO317 
RO323 
RO329 
RO335 
RO342 
RO348 
R0354 
RO360 


BO100 
C0422 
C0423 
C0425 


HO1!3 
10049 
£0061 
E0060 
FOL45 
F0434 
FO145 
G0165 
F0088 
H0105 
C0123 
00172 
00472 
F0427 
L0058 
L0184 
C0124 
DOIS} 
00435 
GO312 
L0103 
C0169 
C0473 
00128 
00266 
00367 
00474 
C0170 
C0335 
00014 
00175 
00355 
00506 
L007! 
co215 
G0304 


C0420 
C0465 


C0468 


K0087 
J0049 


G0259 
F0146 
F0460 
F0433 
G0231 
GO132 
10047 
C0139 
00215 
D0503 
F0430 
10072 
L0198 
C0143 
00176 
00476 
G0315 
LOLI7 
C0199 
C0476 
00148 
00269 
00384 
00502 
C0184 
C0349 
00030 
00178 
00368 
D0509 
L0078 
C0267 
G0365 


00129 00349 00385 


C0467 


J0058 


G0336 
FO1S4 
F046! 
F0460 
G0257 
G0270 
10053 
C0194 
00242 
00532 
G0163 
L0086 


C0183 
00219 
00507 
G0367 
L0131 
C0234 
00011 
00171 
00293 
00396 
00505 
C0193 
C0375 
00055 
00202 
00382 
00535 
L0085 
C0310 
J0065 


J0067 


FOLSS 
G0137 
G0137 
G0308 


co291 
00267 
00563 
G0169 
L0099 


C0200 
00246 
00536 
G0373 
LO145 
C0266 
00028 
00174 
00296 
00399 
00531 
C0239 
C0378 
00058 
00218 
00400 
00538 
L0162 
C0370 
K0078 


F0230 
GO178 
G0275 
G0334 


00029 
00294 
F0096 
GO172 
L0113 


C0293 
0027! 
00567 
G0376 
LO16t 
C0289 
00051 
00200 
00319 
100420 
00534 
C0272 
C0403 
00082 
00245 
00421 
00566 
LO169 
C0474 
K0080 


F0282 
G0241 


G0369 


00052 
00331 
F0098 
G0229 
L0127 


00031 
00298 
F0095 
L0049 
LOI75 
C0309 
00054 
00214 
00330 
00430 
00562 
C0290 
C0427 
00107 
00270 
00434 
00569 
LO176 
G0159 
K0089 
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F0334 
G0275 


00081 
00365 
FO101 
G0235 
LO141 


00056 
00335 
F0099 
L0063 
LO189 
C0345 
00080 
00217 
00333 
00433 
00565 
C0292 
C0430 
00110 
00297 
00457 
L0050 
L0183 
G0225 


F0335 
G0318 


00104 
D0397 
FO1L20 
G0238 
10156 


00083 
D0369 
FO100 
L0077 
LO0202 
C0369 
00103 
00241 
00354 
00453 


C0315 
C0449 
00134 
00317 
00475 
L0057 
L0190 
G0278 


F0433 
G0379 


DO149 
D0431 
F0123 
L0044 
L0170 


00108 
D040! 
G0306 
L0090 


C0399 
00106 
D0244 
00364 
00471 


C0333 
C0452 
00150 
00334 


00478 . 


L0064 
L0197 
G0280 
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VALUE 


DEFN 


$0113 
$0105 
RO138 
RO198 
ROZ10 
RO201 
RO213 
RO244 
RO253 
RO262 
RO0271 
RO280 
RO289 
RO298 
RO307 
ROO17 
RO137 
RO128 
RO069 
ROO73 
ROO77 
RO143 
RO247 
RO256 
RO265 
RO274 
RO0283 
RO292 
RO301 
RO310 
RO142 


F0146 


B0102 
C0428 
C0429 
C0431 


HOII5 
10055 
HO107 
J0074 
K0069 
K0063 


40108 


PAGE 0100 


F0461 


C0426 00132 0038! 00419 
C0477 C0479 


C0480 


K0091 
J0051 J0060 J0069 


KO0062 K0065 K0068 


KO070 K0073 K0074 
K0071 K0072 


J0053 J0062 J0071 
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INTRODUCTION 


Algorithms, software, and hardware for designing and 
implementing a digital control system using the Texas 
Instruments TMS32010 signal-processing microprocessor are 
presented in this application report. Microprocessors, such 
as the TMS32010, are increasingly being used to implement 
algorithms for the control of feedback systems. The major 
factors contributing to this trend are increased availability 
and lower cost of suitable digital hardware. 

Current and potential applications include servo motor 
control, process control, robot arm and disk head controllers, 
and temperature and pressure controllers. Military and 
aerospace applications include stabilized platforms, flight 
control and autopilot systems, inertial reference systems, and 
general servomechanisms. 

In meeting control system requirements, designers face 
many alternatives. Cost, size, weight, power, and reliability 
decisions are typically application dependent. This report 
highlights the tradeoffs in developing algorithms, software, 
and hardware for a digital control system. 


DIGITAL CONTROL SYSTEMS 


General Considerations 

A digital controller is a signal processing system that 
executes algebraic algorithms inherent to the control of 
feedback systems (i.e., compensator and filter algorithms). 
Together with the plant (system to be controlled) and signal- 
acquisition circuitry, the digital controller makes up a digital 
control system such as the one shown in Figure 1. 

Note that the system requires analog-to-digital (A/D) 
converters for the external (command) inputs and for the 
state-variable feedback inputs to the digital controller. The 
system also requires a digital-to-analog (D/A) converter for 
the control outputs to the plant. 


INTERNAL 


INPUTS DIGITAL 


CONTROLLER 


The advantages of the digital control approach over the 
analog approach are: 

1. Ability to implement advanced control 
algorithms with software rather than special- 
purpose hardware 

2. Ability to change the design without changing 
the hardware 

3. Reduced size, weight, and power, along with low 
cost 

4. Greater reliability, maintainability, and testability 

5. Increased noise immunity. 


Microprocessor Selection and System 
Development Cycle 

Choosing an appropriate microprocessor is an 
important factor in efficiently implementing a digital control 
design. A class of special-purpose (as opposed to general- 
purpose) digital signal-processing microprocessors has been 
developed to enable fast execution of digital control 
algorithms. The Texas Instruments TMS32010 provides 
several beneficial features for implementing digital control 
system elements through its architecture, speed, and 
instruction set. 

A prominent feature of the TMS32010 is the on-chip, 
16 x 16-bit multiplier that performs two’s-complement 
multiplication and produces a 32-bit product in a single 
200-ns instruction cycle. The TMS32010 instruction set 
includes special instructions necessary for fast 
implementation of sum-of-products computations 
encountered in digital filtering/compensation and Fourier 
transform calculations. Most of the instructions critical to 
signal processing execute in one instruction cycle. References 
[1,2] give full details of the TMS32010 hardware and 
software considerations. 

Many system development tools are available and may 
be used for digital control system design.3 Figure 2 outlines 


PLANT OR OUTPUTS 


SYSTEM TO BE 
CONTROLLED 


Figure 1. Digital Control System 
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IDEA/SYSTEM SPECIFICATIONS 


DIGITAL CONTROL SYSTEM DESIGN PHASE, DISCRETE TRANSFER FUNCTIONS, 
STABILITY ANALYSIS, SIMULATION, ETC. 
(DIGITAL FILTER DESIGN AIDS FOR THE TMS32010) 


SYSTEM ARCHITECTURE DESIGN AND 
EVALUATION OF PROCESSOR PERFORMANCE 
(TMS32010 EVALUATION MODULE AND 
ANALOG INTERFACE BOARD) 


SOFTWARE CONFIGURATION 
AND DEBUGGING 
(TMS32010 ASSEMBLER/LINKER, 
SOFTWARE LIBRARIES, 
TMS32010 SIMULATOR) 


SYSTEM BINDING PHASE. HARDWARE/SOFTWARE INTEGRATION 
(TMS32010 EMULATOR (XDS)) 


Figure 2. Digital Control System Development. Phases 


HARDWARE CONFIGURATION | 
AND DEBUGGING 
(TMS32010 EMULATOR (XDS)) 


the system development cycle and ties the development tools 
to different project phases. 

Among the non-TI development aids, an interactive 
program called the Digital Filter Design Package developed 
by Atlanta Signal Processors Incorporated, may be useful 
in digital control system design. This program designs 
various types of filters, compensators, and other structures. 
It can be used when, for example, there is a need for several 
notch filters to filter out unwanted frequencies in a digital 
feedback loop. 


DIGITAL COMPENSATOR DESIGN 


Alternative methods exist for designing digital 
compensators. This section outlines several approaches to 
digital controller design and points out the analytical tools 
useful in the design process. 


Design Based on Analog Prototype 

A commonly used method of designing a digital control 
system is to first design an equivalent analog control system 
using one of the well-known design procedures. The resulting 
analog controller (analog prototype) is then transformed to 
a digital controller by the use of one the transformations 
described below. 

The design of the analog controller may be carried out 
in the s-plane using design methods such as root-locus 
techniques, Bode plots, the Routh-Hurwitz criterion, state- 
variable techniques, and other graphic or algebraic methods. 
The purpose is to devise a suitable analog compensator 
transfer function which is transformed to a digital transfer 
function. This digital transfer function is then inverse z- 
transformed to produce a difference equation that can be 
implemented as an algorithm to be executed on a digital 
computer. Two of the analog-to-digital transformation 


methods, the matched pole-zero and the bilinear 
transformation, are described as follows: 

1. The matched pole-zero (matched Z-transform) 
method maps all poles and zeroes of the 
compensator transfer function from the s-plane 
to the z-plane according to the relation: 


= esT 
where T is the sampling period. 


If more poles than zeroes exist, additional 
zeroes are added at z = — 1, and the gain of the 
digital filter is adjusted to match the gain of the 
analog filter at some critical frequency (e.g., 
at DC for a lowpass filter). This method is 
somewhat heuristic and may or may not 
produce a suitable compensator. 

2. The bilinear (Tustin) transformation method 
approximates the s-domain transfer function 
with a z-domain transfer function by use of the 
substitution: 


2 z—-1 
T zt+l 


5s = 


As in the matched pole-zero method, the 
bilinear transformation method requires 
substitution for s. Compensators in parallel or 
in cascade maintain their respective structures 
when transformed to their digital counterparts. 
This substitution maps low analog frequencies 
into approximately the same digital frequencies, 
but produces a highly nonlinear mapping for 
the high frequencies. 


To correct this distortion, a frequency 
prewarping scheme is used before the bilinear 
transformation. The frequency prewarping 
operation results in matching the single critical 
frequency between the analog domain and the 
digital domain. To achieve this result, the 
prewarping operation replaces each s in the 
analog transfer function with (wo/wp)s where 
Wo is the frequency to be matched in the digital 
transfer function and 


2 Wot 
Wp = +r tan a 


Bilinear transformation with frequency prewarping 
provides a close approximation to the analog compensator> 
and is the most commonly used technique. Other methods 
for converting a transfer function from the analog to the 
digital domain are: the method of mapping differentials/8, 
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the impulse-invariance method®, the step-invariance 
method!8, and the zero-order hold technique.§ 

The basic disadvantage of design based on an analog 
prototype is that the discrete compensator is only an 
approximation to the analog prototype. This analog prototype 
is an upper bound on the effectiveness of the closed-loop 
response of the digital compensator. 


Direct Digital Design 

The direct digital design technique is a design method 
where a digital control system design is carried out from the 
very beginning in the z-domain to produce a digital control 
algorithm. Various pole-placement and relocation techniques 
are used to position the poles and zeroes of the system’s z- 
domain transfer function to yield the required system 
performance. 

The root-locus method in the z-plane is similar to the 
root-locus design in the s-plane in that both are based on 
observing the position of the closed-loop poles as a function 
of the system gain. However, the effect of the locations of 
the poles and zeroes on system performance is not the same 
as in the s-plane. Knowledge of such correspondences in the 
z-plane allows those familiar with continuous system design 
to design digital compensators. For example, in stabilizing 
an unstable system, the adjustable gain is used to move the 
poles inside the unit circle instead of inside the left-half plane. 
A phase-lead controller may be employed to shift the root- 
locus to the left, which results in a system that responds 
faster. A phase-lag controller may be designed to allow for 
higher loop gain to produce smaller steady-state errors and 
improved disturbance rejection. 10 

The z-domain designer may also use pole-zero 
cancellation. In this technique, some of the poles and zeroes 
of the digital transfer function of the plant may be cancelled 
by zeroes and poles of the digital compensator. The 
compensator then introduces additional poles and zeroes at 
locations that enable the designer to achieve the desired 
performance characteristics. This method may affect system 
stability due to ‘‘inexact cancellation’’. If the poles of the 
plant that are to be cancelled lie close to the unit-circle, 
inexact cancellation may cause the system root-locus to go 
outside the unit circle at some point. This makes the system 


conditionally stable or unstable. 
In a system where fast response to the control input 


is required, a ‘‘deadbeat’’ approach may be taken. The 
deadbeat controller cancels all the zeroes and poles of the 
plant and introduces a pole at z = 1. The result is that the 
system output reaches its steady-state value in one sampling 
period with no overshoot. In practice, an ideal deadbeat 
controller is difficult to implement because of inexact 
cancellation. Although the output does not experience 
overshoot, it may oscillate between sampling instants. The 
design is ‘‘tuned’’ in the sense that the system response may 
be acceptable for a step input but not acceptable for other 
inputs. 

With z-plane design, conventional design techniques 
can be used to place the closed-loop system poles exactly 
where desired. The approximations associated with digitizing 


an analog prototype are thereby eliminated. The disadvantage 
of this technique is the relative difficulty of visualizing the 
effect of pole-zero locations in the z-plane on system 
performance. To overcome the disadvantage of the z-plane 
technique, the designer can use the w-plane or, better yet, 
the w’-plane design technique.> Both techniques transform 
the design to a plane similar to the s-plane by means of the 
same kind of substitution as in the bilinear transformation 
described earlier. This procedure thereby allows the use of 
the familiar s-plane and frequency-domain methods of 
continuous-system design. The designer proceeds by first 
transforming the continuous plant to the z-domain and thence 
to the w-plane or w’-plane. The appropriate compensator is 
then devised and transformed back to the z-plane, where it 
is used to specify the corresponding computational algorithm. 


State-Variable Design Methods 

State-variable design methods can also be used, 
including state-variable feedback and optimal control based 
on quadratic synthesis. 

In the state-variable feedback technique, all of the states 
are measured and fed back through constant gains. This 
allows all of the closed-loop poles to be positioned at any 
desired locations in the z-plane, but does not affect the 
positions of the system zeroes. 

In a design based on quadratic synthesis, a performance 
index or cost function is minimized by proper choices of the 
control law or feedback compensator. In the most practical 
design, the resulting compensator has the same form as that 
resulting from the application of direct digital design 
techniques.’ 


DIGITAL COMPENSATOR IMPLEMENTATION 


Digital compensator algorithms execute on processors 
that use finite-precision arithmetic. The signal-quantization 
errors associated with finite-precision computations and the 
methods for the handling of these errors are presented in this 
section. 


Fixed-Point Arithmetic and Scaling 

Computation with the TMS32010 is based on the fixed- 
point two’s-complement representation of numbers. Each 
16-bit number has a sign bit, i integer bits, and 15-i fractional 
bits. For example, the decimal fraction +0.5 may be 
represented in binary as 


0.100 0000 0000 0000 


This is Q15 format since it has 15 fractional bits, one 
sign bit, and no integer bits. The decimal fraction + 0.5 may 
equivalently be represented in Q12 format as: 


0000. 1000 0000 0000 


This number is in the Q12 format because it has 12 
fractional bits, one sign bit, and three integer bits. Note that 
the Q15 notation allows higher precision while the Q12 
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notation allows direct’ representation of larger numbers. 

For implementing signal-processing algorithms, the 
Q15 representation is advantageous because the basic 
operation is multiply-accumulate and the product of two 
fractions remains a fraction with no possible overflows during 
multiplication. When the Q12 format is used, a software 
check for overflow is necessary. The subsection, 
OVERFLOW AND UNDERFLOW HANDLING, provides 
a detailed analysis of overflow handling. 

In the case where two numbers in Q15 are multiplied, 
the resulting product has 30 fractional bits, two sign bits, 
and (as expected) no integer bits. To store this product as 
a 16-bit result in Q15, the product must be shifted left by 
one bit and the most-significant 16 bits stored. The 
TMS32010 instruction SACH allows for this one-bit shift. 

In the case where a Q15 number is to be multiplied 
by a 13-bit fractional signed constant represented as a Q12 
number, the result (to correspond with Q15) must be left- 
shifted four bits to maintain full precision. The TMS32010 
instruction SACH allows for the appropriate shift. The Q15 
and Q12 representations are used in the example in the 
section, DESIGN EXAMPLE: RATE-INTEGRATING 
GYRO STABILIZATION LOOP. 

When fixed-point representations are used, the control 
system designer must determine the largest magnitudes that 
can occur for all variables involved in the computations 
required by the digital compensator. (Floating-point 
representations allow larger magnitudes, but take more time 
for the microprocessor to perform the required 
computations.) Once these largest magnitudes are known, 
scaling constants can be used to attenuate the compensator 
input as much as necessary to ensure that all variables stay 
within the range that can be expressed in the given 
representation. 

Several methods are used to determine bounds on the 
magnitudes of the variables. One method, called upper-bound 
scaling, provides a useful, although sometimes too 
conservative, bound on the magnitude, yet it is 
straightforward to calculate. Consider a variable y(n) that 
is obtained as the output of a digital compensator H(z) when 
the input is the sequence x(n). The bound on y(n) is given by 


co 


| Xmax| x 
n=1 


| h(n) | 


Ymax = 


where Xmax is the maximum value in x(n) and the sequence 
h(n) is the unit-sample response sequence for the digital 
compensator H(z). 

Other methods for estimating the upper bound are Lp- 
norm scaling, unit-step scaling, and the averaging 
method.9.10 

After ymax is determined, the scale factor can be 
chosen as the multiplier that is applied to x(n) prior to the 
compensator computations to ensure that y(n) remains within 
the required bounds. In addition, the control system designer 
may have knowledge concerning the bounds on the 
compensator variables based on prior experience, the 
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characteristics of the corresponding variables of analog 
prototypes, and simulation results. 


Finite-Wordlength Effects 

All variables involved in the digital compensator — the 
input, the compensator coefficients, the intermediate 
variables, and the output — are represented as finite- 
wordlength numbers. This restriction gives rise to errors. 
Another source of errors is the truncation or rounding that 
takes place when the 32-bit product of two 16-bit numbers 
is stored as a 16-bit number. Both of these errors give rise 
to the finite-wordlength effects discussed in this section. 

The representation of the compensator input as a finite- 
precision (quantized) number produces an input-quantization 
error. The size of this error for a rounding scheme can be 
anywhere from —(2~—8)/2 to (2—8)/2 where B is the 
number of bits in a word. The input-quantization error is 
usefully modeled as a zero-mean random variable uniformly 
distributed between its positive and negative bounds. A 
technique5.10 is available to calculate the variance of the 
corresponding error at the compensator output (its mean is 
zero). In this manner, the designer can determine the effect 
of input quantization on the compensator output. 

Similar quantization errors are associated with the 
multiplication process. Each multiplication is assumed to 
produce the ‘“true’’ product with an error that is a zero-mean, 
uniformly distributed random variable. The variance of the 
corresponding error at the compensator output can be 
calculated in the same manner as for the error due to input 
quantization. These individual variances are then added to 
measure the total effect at the compensator output for each 
truncation or rounding. — 

Another way to describe the effects of truncation or 
rounding is in terms of ‘‘limit cycles’’ which are sustained 
oscillations in the closed-loop system. These oscillations are 
caused by nonlinearities within the loop. In this case, the 
nonlinear quantizations are associated with the 
multiplications. Limit cycles persist even when the system 
input goes to zero, and their amplitude can be sizeable. No 
general theory is available to treat this nonlinear 
phenomenon. Bit-level simulations which model the 
compensator and the complete closed-loop system are used 
to ascertain their presence and effect on the closed-loop 
performance. 

When a digital compensator is implemented as an 
algorithm to be executed on finite-precision hardware, a 
problem arises with implementing the coefficients present 
in the corresponding transfer function (see section, 
TMS32010 IMPLEMENTATION OF COMPENSATORS 
AND FILTERS). The infinite-precision compensator 
coefficients must be rounded and stored using a finite-length, 
fixed-point binary representation. Due to this coefficient- 
quantization effect, the performance of the implemented filter 
will deviate from the performance of the designed digital 
filter. 

The deviation in performance can be estimated by 
computing the filter’s pole and zero locations and the 
corresponding frequency response magnitude and phase for 
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the compensator with the quantized coefficients. Coefficient 
quantization forces the filter’s poles and zeroes into a finite 
number of possible locations in the z-plane and is of most 
concern for filters with stringent specifications, such as 
narrow transition regions. 

The designer must choose the filter structure least 
sensitive to inaccurate coefficient representation. The choice 
should be of a modular rather than a direct filter structure. 
For example, a higher-order filter should be implemented 
as a cascade or parallel combination of first-order and second- 
order blocks. The reason for this choice is the lesser 
sensitivity to coefficient variations of the roots of low-degree 
polynomials in comparison with high-degree polynomials. 
Several methods for selecting the filter structures least 
affected by coefficient quantization are available.? 

To quantitatively evaluate the effect of coefficient 
quantization on the position of the poles or zeroes of a digital 
transfer function, a ‘‘root sensitivity function’’ can be 
computed .6 


Overflow and Underflow Handling 

Digital control system algorithms are usually 
implemented using two’s-complement, fixed-point 
arithmetic. This convention designates a certain number of 
integer and fractional bits. The fixed-point arithmetic 
computations may, at some point, produce a result that is 
too large to be represented in a chosen form of fixed-point 
notation (e.g., Q12). The resulting overflow, if untreated, 
may cause degraded performance such as limit cycles and 
large noise spikes at the filter’s output which may contribute 
to the system’s instability. The system must be able to recover 
from the overflow condition, i.e., return to its normal, 
nonoverflow state. 

Consider an example of the Q12 representation. The 
number 7.5 multiplied by itself gives the result of 56.25, an 
overflow in Q12. However, no hardware overflow occurs 
in the accumulator; i.e., 


7.5 0111.1000 0000 0000 
x 7.5 __0111.1000 0000 0000 
56.25 0011 1000.0100 0000 0000 0000 0000 0000 


For the Q12 representation, the above 32-bit product 
is shifted left four bits and the left-most 16 bits are retained: 


1000.0100 0000 0000 


The correct answer is 56.25, but the number stored in 
the Q12 representation is —7.75. 

The TMS32010 has a built-in overflow mode of 
operation that, if enabled, causes the accumulator to saturate 
upon detection of an overflow during addition when the 
accumulator register overflows. During multiplication, an 
overflow of the fixed-point notation may also occur even 
though the hardware overflow of the accumulator register 
does not occur. This is because the 32-bit result of a 
multiplication of two 16-bit numbers must be stored in a 


16-bit memory word in the form consistent with the chosen 
fixed-point notation (see above example). To adjust the 
location of the binary point, the storing operation requires 
that the number in the accumulator be shifted left and 
truncated on the right before storing. If the most significant 
bits shifted out contain magnitude information in addition 
to sign information, an overflow in the chosen fixed-point 
notation results. 

To track overflows associated with the number 
representation, the control system software should contain 
an appropriate overflow-checking routine in those places 
where multiplications and additions occur. This routine 
should not rely exclusively on the TMS32010’s overflow 
mode to intercept and correct the overflow occurrences. 

Two approaches may be used to handle overflows. The 
first is to prevent the overflow from occurring by choosing 
conservative scaling factors for the numbers used in 
computations, as described in the subsection, FIXED-POINT 
ARITHMETIC AND SCALING. These scaling factors are 
used to limit the range of inputs to each of the basic building 
blocks of the compensator, namely, the first- and second- 
order filter sections. The scaling factor chosen reduces the 
input magnitude and consequently all other signal levels, 
thereby enabling the compensator coefficients, the expected 
inputs, and their products and sums, all to be represented 
without overflow. The scaling must also maintain the signal 
levels well above the quantization noisé. 

The second approach for handling overflow is to adjust 
the sum or product each time an overflow occurs. To 
accomplish this, an overflow checking routine must be 
written and executed at certain points along the computational 
path. The routine must check whether the number just 
computed and residing in the 32-bit accumulator can be stored 
without overflow in a 16-bit memory location in accord with 
the chosen fixed-point notation. Once the routine detects an 
overflow condition, it should replace the computed number 
with the maximum or minimum representable two’s- 
complement number. This scheme simulates a saturation 
condition present in analog control systems. To prevent 
overflow limit cycles, the saturation overflow characteristic 
is preferred to the two’s-complement, ‘‘wrap-around’’ 
characteristic.9 


An example of the overflow checking and correcting 
technique for a first- and second-order filter subroutine is 
provided in the section, TMS32010 IMPLEMENTATION 
OF COMPENSATORS AND FILTERS. This Direct-Form 
II implementation subroutine checks for overflow 
occurrences upon computation of the filter’s intermediate 
state variable and again upon computation of the filter’s 
output. 

In a digital control system, the first- and second-order 
building blocks are either cascaded or connected in parallel 
to compute a series of control algorithms. The first- and 
second-order filter subroutine, called to compute each of the 
control system’s elements, uses 16-bit memory locations as 
storage media for its intermediate values, in which case it 
is appropriate to check for overflow in each block. 
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At the end of a computational chain — before the final, 
computed digital output is ready for transfer to the analog 
domain — it is necessary to check that the number being sent 
to the digital-to-analog converter is within the range based 
on the manner in which the converter is interfaced to the 
processor data bus. For example, if a 12-bit converter is 
wired to the 12 least significant bits (LSBs) of the 16-bit 
processor data bus, then the 12 LSBs must contain both 
magnitude and sign information, which may require that the 
original 16-bit number be adjusted or limited before being 
sent to the converter. 

Underflow conditions, which can also appear during 
digital contro] algorithms, are conceptually similar to 
overflows in that the computed value contained in the 32-bit 
accumulator is too small to be accurately represented in a 
16-bit memory word in the chosen fixed-point notation. One 
possible solution to this problem is to multiply the small result 
by a gain constant to raise its value to a representable level. 
The appropriately chosen gain constant may come as a result 
of gain distribution throughout the digital control system, 
whereby large gains from some of the building blocks get 
uniformly distributed over a range of the system’s sections. 


DIGITAL CONTROL SYSTEM 
SOFTWARE DESIGN PHILOSOPHY 


To maximize the manageability and portability of the 
system software, a modular or top-down design technique 
should be used. This section shows how the modular software 
structure and the proper layout of system memory contribute 
to the efficient implementation of a digital control design. 


Modular Software Structure 

The concept of modular software design is a technique 
developed to make system software more manageable and 
portable. Top-down design is used to break up a large task 
into a series of smaller tasks or building blocks, which in 
turn are used for structuring a total system in a level-by-level 
form. At the end of a top-down design process, a number 
of modules are linked together which, under the control of 
a main program, perform as a complete system. 

In addition to making the software-development and 
software-modification processes more manageable, modular 
design also enhances software portability. Digital control 
systems use a number of standard functional blocks such as 
compensators, notch filters, and demodulators. It is therefore 
likely that a designer who already has access to one digital 
control system will want to ‘‘borrow’’ some of its functional 
building blocks to quickly implement a new, different control 
unit or reconfigure the existing one. The designer who has 
access to these functional blocks or modules needs only 
modify the main program by providing a different sequence 
of subroutine calls. An initialization routine, a first- and 
second-order filter routine, a roundoff routine, and an 
overflow checking routine are examples of functional 
building blocks. 


Each software module is written as a subroutine with 
a Clear and efficient interface (for parameter passing, stack 
use, etc.) with the main program. In order to maintain the 
general-purpose function of the module, the data used in 
computations within a module (i.e., filter coefficients, state- 
variable values, etc.) should be accessed using indirect 
addressing rather than direct addressing. Only those variables 
whose values remain unchanged should be addressed directly. 


Layout of TMS32010 Data Memory 

The layout of the TMS32010 data memory in a digital 
control system implementation should be defined in 
accordance with the requirements of the software modules 
used in the implementation of the system. The procedure is 
illustrated by the first- and second-order filter subroutine of 
the section, TMS32010 IMPLEMENTATION OF 
COMPENSATORS AND FILTERS. This subroutine 
manipulates its pointer registers so that upon completion of 
the computations in one filter section, the registers 
automatically point to the set of coefficients and state 
variables of the next filter section. 

If the software designer arranges his filter coefficient 
and state-variable sections in the order of execution of the 
control-system algorithms, a sequence of compensators and 
filters may be executed with a single subroutine call for each 
element. This scheme enables faster execution of the control 
algorithms since there is no need to explicitly reload the 
pointers in order to match the requirements of the current 
software module being called. 

A designer must define all of the data memory locations 
(set up a system memory map) at the beginning of the 
program. An efficient way to accomplish this is to use the 
TMS32010 assembler’s DORG (dummy origin) directive. 
This directive does not cause code generation. DORG defines 
a data structure to be used by the system; i.e., it generates 
values corresponding to the labels of consecutive data 
memory locations. Using the DORG directive, as opposed 
to equating labels with data memory locations through the 
EQU directive, provides flexibility when the data structure 
needs to be modified. For example, when defining a number 
of new data memory locations, the labels are inserted in the 
middle of the ‘‘dummy’’ block and the assembler assigns 
the values automatically. This function would have to be 
performed manually if the EQU directive were used. 

The software designer must also build a table in the 
TMS32010 program memory that corresponds to the 
previously defined data memory map. The table is then 
loaded into the data memory by the initialization routine 
during system startup. 

These techniques are illustrated in the next section, 
TMS32010 IMPLEMENTATION OF COMPENSATORS 
AND FILTERS. Note that in the example program in 
Appendix B, location ONE has to be the last location in the 
table. Note also that the states and the coefficients of the 
filters are defined in reverse order to the order in which the 
filters execute. This is due to the way the initialization and 
filter routines are written. 
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TMS32010 IMPLEMENTATION OF 
COMPENSATORS AND FILTERS 


Design procedure and error handling for the standard 
first- and second-order compensator and filter subroutine are 
described in this section. Methods for implementing higher- 
order structures, implementation tradeoffs, and examples of 
typical compensators and filters are also given. 


Standard First- and Second-Order Block 
as a Subroutine 

A standard first- and second-order compensator section 
is a prime example of the building block philosophy discussed 
earlier. The routine presented here computes first- and 
second-order IIR filter sections using the Direct-Form II 
network structure!2 and performs roundoffs and overflow 
checking. The Direct-Form II, although it somewhat obscures 
the definition of the variables, is chosen over the Direct-Form 
I because it requires fewer ‘‘delays’’, i.e., data storage 
locations, in its computational algorithm. 

Consider the second-order transfer function: 


D(z) _NO + Niz—1 + N2z-2 — U@) 
~ 1 + Diz-!l + D2z-2 — E@ 


For Direct-Form II, the corresponding difference 
equations are: 


x(n) = e(n) — DI x(n-1) — D2 x(n-2) 
u(n) = NO x(n) + NI x(n—-1) + N2 x(n—2) 


The signal flowgraph for this transfer function is shown 
in Figure 3. 
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Figure 3. Direct-Form II Compensator/Filter 


The filter routine accommodates a scaling scheme as 
defined on the main program level; i.e., the values can be 
scaled by 215 or 212. The routine is written so that chain 
implementation of a number of compensators and filters is 
possible if the data structure, namely, the coefficient and 
state-variable tables, are properly arranged (see the previous 
section, CONTROL SYSTEM SOFTWARE DESIGN 
PHILOSOPHY). The routine takes its input from the 
accumulator and outputs the result to the accumulator so that 
the main program can efficiently call for the successive 
execution of the filter routine with a different set of 
parameters each time. 
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Two checks for overflow are made within the filter 
routine. One is made upon computing the value of the 
intermediate (state) variable and the other upon computing 
the filter’s output. Each overflow check determines whether 
the 32-bit computed result can be stored in a 16-bit memory 
location under the adopted scaling scheme. If an overflow 
condition occurs, the routine ‘‘saturates’’ the output; i.e., 
it returns the maximum or minimum representable value. 

A drawback of overflow checking upon computing the 
output is the loss of precision of the least-significant bits of 
the accumulator, which are truncated during the accumulator 
storing operation. This loss of precision is insignificant, 
however, in comparison with the loss of precision due to an 
overflow condition. 

The first-order filter section is computed exactly like 
the second-order section. The two coefficients N2 and — D2 
that multiply the “‘oldest’’ value of the intermediate (state) 
variable, i.e., the bottom branch of the filter in Figure 3, 
are equal to zero. This scheme reduces the second-order 
digital filter to the first-order filter. An example program 
that uses the first- and second-order filter routine to compute 
several elements of a digital control system is given in 
Appendix B. 


Higher-Order Filters: Cascade Versus 
Parallel Tradeoffs 

A higher-order filter or compensator in a digital control 
system can be implemented either as a single section or as 
a combination of first- and second-order sections. The single 
section or direct implementation form is easier to implement 
and executes faster, but it generates a larger numerical error. 
The larger error occurs because the long filter computation 
process involves a substantial accumulation of errors resulting 
from multiplications by quantized coefficients and because 
the roots of high-order polynomials are increasingly sensitive 
to changes in their (quantized) coefficients. For this reason, 
the direct realization form is not recommended except for 
a very low-order controller. 

The suggested method of implementing a high-order 
transfer function is to decompose it into first-order blocks 
(to accommodate single real poles) and second-order blocks 
(to accommodate complex conjugate poles or pairs of real 
poles), and connect these blocks either in a cascade or a 
parallel configuration. 

For the cascade realization (see Figure 4), the transfer 
function must be decomposed into a product of first-order 
and second-order functions of the form: 


D(z) = K D,(z) Doz). . . Daz) 
Each second-order block has the form: 


NO + Ni z—! + N2z~-2 


BiG) = 1 + Diz-! + D22z-2 


Each first-order block is obtained by equating the 
coefficients of (z—2), i.e., N2 and D2, to zero. 
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Figure 4. Cascade Implementation of a High-Order 
Transfer Function 


The designer must decide how to pair the poles and 
zeroes in forming the D;(z). A pole-zero pairing algorithm 
that minimizes the output noise is available.!9 The ordering 
of the D;(z) also affects output noise due to quantization and 
whether or not limit cycles are present.10 

For the parallel realization (see Figure 5), the transfer 
function is expanded as a sum of first-order and second-order 
expressions of the form: 


D(z) =K + D,(z) + Do(z) +... + Dalz) 


where the first-order blocks have the form: 


NO 


Di@) = TDi gt 


and the second-order blocks have the form: 


NO + Ni z—! 
1+ Diz-1 + D2z-2 


D\(z) = 


Figure 5. Parallel Implementation of a High-Order 
Transfer Function 


The concept of ordering of D;(z) does not apply to the 
parallel configuration. 

Pole-zero pairing is fixed by the constraints imposed 
by the partial-fraction expansion. 

The parallel realization has an obvious advantage over 
the cascade form when an algorithm executes on a 
multiprocessor system where the filter algorithm can be split 
up among the processors and run concurrently. 

No significant difference is apparent between the 
parallel and cascade realizations in performance factors such 
as execution speed and program/data memory use when the 
algorithms execute on a single processor.!3 The parallel 
algorithm could possibly provide more precision in 
computing the filter’s output if the designer decided to save 
the double-precision (32-bit) results from each of the first- 
and second-order sections and perform a double-precision 
addition to calculate the final filter output. 


Examples of Typical Compensators 
and Filters 

Examples of structures used in digital control systems: 
compensators (lowpass filters), and notch filters are shown 
in Table 1. The analog and digital versions of the transfer 
functions, along with the scaled form of the transfer functions 
and their coefficient values, are given. A complete example 
of a transformation from the analog to the digital domain 
using bilinear transformation with frequency prewarping is 
presented in Appendix A. 

The gain constant that appears in some transfer 
functions can be implemented either by integrating it into 
its transfer function or by distributing it over a number of 
filter sections in a cascade implementation scheme. 


PROCESSOR INTERFACE CONSIDERATIONS 


Alternatives should be considered when designing the 
data-acquisition portion of the digital controller hardware. 
This section addresses the A/D and D/A converter selection, 
different analog sensor interface methods, and 
communication with the host processor. 


Table 1. Examples of Analog and Digital Versions of Common Transfer Functions 


1.041.027! 


= 0.0124 
Diz) = 0.012436 = 59. 99975 2-1 


S2 + 68.26 + 3943 


G m1 —— Or 
(9) = 1000 32+ 25128 +6.31 x 108 


D(z) = 870.77 


100-Hz Notch Filter: 


$2 +3.9478 x 106 


Gis) = 52+ 126.6648 + 3.9478 x 108 


1.0- 1.9824 z~1+0.9826 z-2 
1.0-1.7548 z~1+0.6474 2-2 


0.98467 — 1.94534 2- 1+0.98467 2-2 
1.0- 1.94534 z-14+0.96935 z—2 


Scaled Digital Transfer Function 


Scaling Factor = 215 
Diz) = _ 408+408 27" z7! 
32768 — 32760 32768 — 32760 z-! 
Scaling Factor = 212 
N1 = ~8120 
4096 - 8120 2-1+4025 2-2 N2 = 4025 
4096 - 5140 z-1+2242 2-2 Di = §140 
O2 = ~2242 


Diz) = 870.77 


Scaling Factor = 212 NO = 4033 
N1 = -7968 
N2 = 4033 
D1 = 7968 


02 = -3970 


4033 - 7968 z~'+4033 2-2 


Diz) = “4096 - 7968 7-1+3970 2-2 
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A/D and D/A Conversions and 
Integrated Circuits 

The A/D and D/A converter selection for a control 
system design may be based on several factors. Among the 
most crucial factors are the maximum conversion speed of 
the converter and the wordlength of the device. 

The A/D conversion speed relates directly to the 
required sampling rate of the specific application. This rate 
is determined by the need to sample fast enough to prevent 
aliasing and excessive phase lag and to sample slow enough 
to avoid the unnecessary expense and accuracy of high data 
rates. 

The A/D wordlength should be chosen based on a 
worst-case analysis using the following two criteria: 

1. The dynamic range of the continuous input 
signal, and 
2. The quantization noise of the A/D converter. 

For dynamic range, the designer should determine the 
minimum and maximum values of the continuous input that 
need to be accurately represented and select the A/D 
wordlength in bits based on the resolution required within 
this range. 

Quantization noise is due to the quantization effect of 
the A/D. The value of this noise during a single conversion 
can be represented by the difference between the exact analog 
value and the value allowable with the finite resolution of 
the A/D. This quantization noise may assume any value in 
the range —q/2 to +q/2 for a rounding converter or 0 to q 
for a truncating A/D converter where q is the quantization 
level. The quantization level q is equal to the full-scale 
voltage range divided by 2B where B is the number of bits 
in the converter. The quantization noise may be modeled as 
uniformly distributed noise. The designer should make his 


choice based on the maximum acceptable quantization level. 

The D/A converter wordlength should be chosen in a 
similar manner to choosing the A/D wordlength by 
considering the dynamic range of the output signal. 

The effects of A/D and D/A converter wordlength on 
the performance of a high-speed control system are detailed 
in the University of Arkansas study (see the section on the 
design example of the TMS32010-based rate-integrating gyro 
positioning system). The study analyzed the time-domain 
performance of the system (unit impulse; step, ramp, and 
torque-disturbance response) as a function of A/D and D/A 
wordlengths. Twelve-, fourteen-, and sixteen-bit converters 
were used. The only significant difference found between 
them was the steady-state error. Twelve-bit converters were 
found to be adequate. 

In a multi-input digital control system, the signal 
acquisition portion of the digital controller must provide for 
the multiplexing of several analog inputs into a single A/D 
converter. Consequently, some external devices are needed 
to pre-filter (antialiasing filters), sample and hold the analog 
signals from each channel (S/H circuits), and multiplex the 
signals onto the A/D converter (analog multiplexer). 
Multiplexing and filtering may also be necessary at the output 
in cases where the digital control system computes multiple 
outputs for the control of the plant. 

Two configurations of a cost-effective, multichannel 
data acquisition system for a digital controller are shown in 
Figure 6. The first accommodates up to eight inputs; the 
second can accommodate up to 32 inputs. Note that in these 
two systems, only one S/H per eight inputs exists, and the 
variables are sampled in sequence with the same sampling 
interval between successive samples of a given signal. There 
will be a ‘‘skew’’ in time between the samples of the various 
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Figure 6. Cost-Effective Data Acquisition Systems 
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inputs with a possible unwanted effect on the system 
performance. In this case, the use of a fast A/D converter 
may be justified to minimize this effect. 

If truly simultaneous sampling is required, an array of 
S/H circuits may be used to capture the values of all the inputs 
concurrently. This solution, shown in Figure 7, is more 
expensive due to the cost of S/Hs. In such simultaneous 
sampling systems, a fast conversion must be performed 
before the signal values present on the S/Hs start to droop. 
Therefore, the maximum conversion rate must be fast enough 
to accommodate this constraint. 


S/H 


Figure 7. Data Acquisition System with 
Simultaneous Sampling 


In some cases, high-speed A/D converters (100- to 
500-kHz conversion rates) are required. Two 12-bit A/D 
devices that will accommodate these speed requirements are 
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the ADC85 and the AD5240 from Analog Devices. The 
ADC85 allows a conversion rate of up to 100 kHz and the 
ADS5240 up to 200 kHz. There are other converters available 
from several manufacturers. 

D/A converters, on the other hand, are inherently 
faster, the selection is much broader, and the cost is less. 

When high-speed converters are not needed (when only 
a few input channels exist or a single converter per channel 
is justified), devices such as the TCM2913 and TCM2914 
codecs may be useful in digital control applications. Although 
telecommunications-oriented, these devices are low in cost 
and provide on-chip antialiasing and smoothing filters. The 
TCM2913 or TCM2914 both contain A/D and D/A 
converters. The 8-bit digital output of the A/D and the 8-bit 
digital input to the D/A are both arranged in a companded 
(compressed/expanded) form using yp-law or A-law 
companding techniques. The p-law and A-law companding 
techniques allow small numbers to be represented with 
maximum accuracy, but require a conversion routine before 
the companded samples can be used in two’s-complement 
computations. Such conversion routines are based on lookup 
tables and need only a few TMS32010 instruction cycles to 
execute. !4 The devices interface to the processor in a serial 
form and convert the data at a maximum rate of 8 kHz. 

All of these data acquisition systems can accommodate 
differential inputs from analog transducers, such as pressure 
sensors, strain gauges, and others. To maintain accuracy in 
the case of a low-level input signal and to minimize noise 
effects, twisted-pair leads can be used to connect the 
transducer output to an instrumentation amplifier 
(differential-to-single-ended conversion circuit) that in turn 
is connected to the analog multiplexer. Alternatively, 
balanced twisted-pair leads can be connected to a differential 
analog multiplexer which drives an instrumentation amplifier 
of the same kind. The amplifier rejects the common-mode 
noise and presents the single-ended output to the S/H circuit 
and the A/D converter.!5.16 These two configurations are 
shown in Figure 8. 


INSTRUMENTATION 
AMPLIFIER 


> S/H AND A/D 


Figure 8. Differential Input Configurations 
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Synchronization of the Processor 
and External Devices 

In a multichannel data acquisition system with one A/D 
converter, a designer must generate a sequence of timing 
signals to synchronize the S/H circuits, the analog 
multiplexer, the A/D converter, and the input/output latches 
with the operation of the TMS32010. The designer may 
decide to generate the timing signals from the TMS32010 
clock by subdividing its frequency or using a timing and 
control circuit based on its own clock. 

An alternative way to build a multi-channel data 
acquisition system is to designate a separate A/D for each 
channel. In this case, the timing-signal generation is simpler 
and the A/D converters used may be slower and less costly, 
although more of them are necessary. The designer should 
perform a tradeoff analysis based on board space, overall 
system cost, and power consumption. 


Communication with Host Computer 

In addition to having a fast signal-processing 
microprocessor, a need may exist for an executive processor 
to monitor the system’s operation. Such an executive 
processor would be used for system startup/initialization 
(coefficient and initial-condition loading), responding to 
emergency conditions such as overflow and underflow, 
system reprogramming/reconfiguration (loading a new 
program or a new set of coefficients), and a thorough system 
test and calibration. The system should be constructed so that 
the executive processor can interrupt, halt, or alter the 
execution of the signal processor at any time in response to 
contingency situations. 


DESIGN EXAMPLE: RATE-INTEGRATING 
GYRO STABILIZATION LOOP 


An example of using the TMS32010 processor to 
implement a digital control system is presented in this section. 
Sampling rate selection, the system’s hardware and software, 
and system performance are discussed. . 


System Description 

The system used as an example of the application of 
the TMS32010 is a servo-control system for stabilizing a 
large, two-axis gimbaled platform with a DC-motor drive. 
Inertial rate-integrating gyroscopes mounted directly on the 
platform serve as angular motion sensors. Such systems are 
required for the precise control of line-of-sight (LOS) and 
line-of-sight rate for use in pointing and tracking applications 
for laser, video, inertial navigation, and radar systems. 

At present, digital control is not normally used in 
systems of this type because of the fast throughput rates and 
computational accuracy required to perform the control 
computations and notch filtering. Current line-of-sight 
stabilization systems continue to use analog electronics to 
implement servo-compensation functions and error-signal 
conditioning. Thus, the system is representative in 


complexity and performance of typical systems currently in 
use by the aerospace industry and are candidates for 
microprocessor-based digital control. 

The digital control system was designed as part of a 
research contract carried out by the University of Arkansas 
under the sponsorship of Texas Instruments from February 
1982 to February 1984. 18,19 


System Model and Control Compensation | 

A single axis of the stabilization system has two primary 
control loops: the rate loop and the position loop. In addition, 
a tachometer loop exists within the position loop. The rate 
and position loops are identified in Figure 9, a diagram of 
the elevation axis of the system. In its analog version, the 
system employs analog electronics to implement all control 
compensation and signal conditioning functions. 

Figure 10 identifies those filters and compensators in 
the rate loop that are to be incorporated into the digital control 
system. 

This study’s approach provides a digital implementation 
of the designated analog elements of the rate loop without 
sacrificing closed-loop performance. In keeping with the 
recommendations of the DIGITAL COMPENSATOR 
DESIGN section, the technique for the conversions of the 
analog compensators and notch filters to their digital 
counterparts is the bilinear transformation with frequency 
prewarping. Within the rate loop, the transfer functions to 
be implemented digitally consist of a first-order and a second- 
order compensator, along with six notch filters. Within the 
position loop, there is one first-order compensator and one 
notch filter. The transfer functions, shown in Table 2, list 
both the analog prototypes and their digital equivalents. 

The sampling rate chosen is 4020 samples per second 
(sampling period is 249 ys). This rate is more than twice 
the highest frequency of consequence (1800 Hz, the highest 
rate-loop notch frequency) to prevent aliasing. The rate is 
fast enough to prevent excessive phase lag in the rate loop 
and is more than ten times the closed rate-loop bandwidth 
(approximately 80 Hz).5 The rate was also chosen to be an 
integer multiple of 30 Hz, which is a commonly used update 
rate of the video and infrared imaging/tracker devices that 
provide the line-of-sight rate command to the stabilization 
system’s rate loop. The update rate of the imaging device 
and the sampling rate within the rate loop are thus 
synchronized. 

After simulating the closed rate loop, the phase margin 
was found to be five degrees less than it was for the all-analog 
system, due to the computational and other delays associated 
with sampling. To overcome this deterioration in phase 
margin, the second-order rate-loop compensator was 
redesigned to provide additional phase lead. The compensator 
was modified to provide enough additional phase lead so that 
the phase margin of the digital system matched that of the 
analog system. The modified compensator is listed in the 
table. 
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Figure 9. Line-of-Sight Stabilization/Pointing System Elevation Axis 
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Figure 10. Line-of-Sight Stabilization/Pointing System Rate-Loop [19] 


Table 2. Analog and Digital Compensators and Notch Filters 


Digital Transfer Function 


Analog Transfer Function (f, = 4020 Hz) 


Filter Element 
Rate-Loop 
1st-Order 


Compensator 


0.1244 (1.0+1.0 2-1) 
1.0-0.99975 Z-1 


Rate-Loop 
2nd-Order 
Compensator 


1000 (S2+62.8S + 3943) 
S2 + 2512S +6.31x106 


754.7101 (1.0- 1.98426 Z-1+0.9845 Z~ 2) 
1.0-1.255 2-14+0.5474 2-2 


Rate-Loop S2+(2x 1800)2 


: 7+1.83411 Z2-14+0.96877 2-2 
1800-Hz 2x 1800 0.96877 + 1.83411 27 1 +0.96877 Z~* 


1.04+1.83411 Z-1+0.93754 2-2 


; $24 ——— S+(2x 1800)2 
Notch Filter 5 
Rate-Loop 
900-Hz 
Notch Filter 


Rate-Loop 
560-Hz 
Notch Filter 


Rate-Loop 
140-Hz 
Notch Filter 


Rate-Loop 
120-Hz 
Notch Filter 


Rate-Loop 
100-Hz 
Notch Filter 


Position-Loop 
1st-Order 
Compensator 


Position-Loop 
400-Hz 
Notch Filter 


S2 + (2x 900)2 
2x 900 


S$ +(2x 900)2 
2.5 


S2+(2x 560)2 


2x 560 
= S +(2x 560)2 


S2 +(2x% 140)2 


140 
== S+(2x 140)2 


S2+(2x 120)2 


2x 120 
a S+(2x 120)2 


$2 +4(2x 100)2 


2x 100 
sf S+(2x 100)2 


66S + 45.54 
S 


S2+(2x 400)2 


2x 400 
= S+(2" 400)2 


0.97875 —- 1.91083 Z-1+0.97875 Z-2 
1.0-1.91083 Z~14+0.95751 2-2 


0.98467 — 1.94534 Z—- 1+0.98467 z-2 
1.0-1.94534 2-14+0.96935 2-2 


0.94471 — 1.53204 Z-14+0.94471 Z-2 
1.0-1.53204 Z2-1+0.88942 Z-2 


24. Control System Compensation and Implementation with the TMS32010 


0.8352-0.27291 Z— 1+0.8352 2-2 
1.0-0.27291 Z—-1+0.67041 2-2 


0.9287 - 1.19021 Z-1+0.9287 Z-2 
1.0-1.19021 Z-1+0.8574 2-2 


0.9817 —- 1.92896 Z-1+0.9817 2-2 
1.0-1.92896 Z-1+0.96339 Z-2 


6.60566 — 6.59434 Z-1 


1.0-Z-1 
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Hardware 

In the digital control system, the analog compensators 
and the notch filters are replaced by a digital signal processor, 
the TMS32010, along with the additional interface hardware 
needed to provide the digital input signals to the controller 
and the analog signals to the plant. Figure 11 shows the 
system hardware block diagram. 

The hardware was packaged onto five wirewrap 
boards. It was fabricated as a prototype test bed, and was 
constructed from commercially available components that 
have military-specification counterparts. Twelve-bit A/D and 
D/A converters were used, based on the studies of time- 
domain performance characteristics of the system. 


Software 

The TMS32010 software is composed of four modules: 
Initialization Routine, Main Program, Rate-Loop 
Subprogram, and Subroutines. Figure 12 shows the system 
software block diagram. 

The Initialization software disables and enables 
interrupts, loads data memory with filter coefficients, 
program constants, and gain terms, and initializes the 
TMS32010 registers. 

The Main Program software calls the Delay Subroutine 
at the beginning of each sample period to wait for the A/D 
to complete conversion of all input variables. It then does 
on-line compensation for the error signal sensor variation 
by executing the A/D Drift Subroutine. The Main Program 
then reads «* value of the input variable, calls the Rate-Loop 
Subprogram to compute the control output, and, when that 
subprogram returns the output variable, loads it into the 
appropriate output register. 

The Rate Loop Subprogram calls subroutines that 
perform each compensator and notch filter computation and 
checks the computed output for overflow. 

The Subroutines consist of a single routine for 
performing any of the compensator or notch-filter 


GUARD 
FILTER 


SAMPLE 


8-CHANNEL 


computations (first- and second-order .''ter routine), along 
with routines for checking overflow, providing delay, and 
performing multiplication of low-precision numbers by a 
constant. 

The A/D Drift Subroutine compensates on-line for the 
variations in the rate-loop error signal sensor (as a function 
of time and temperature). The subroutine uses an external 
calibration input and follows the model of the sensor 
variations to estimate the true value of the A/D input. 

The digital control system is interrupt-driven. An 
interrupt occurs every 1/4020 seconds (approximately 250 
ps). This starts the A/D conversion of a new set of sample 
inputs and restarts the TMS32010 on a new pass through its 
software. 

A TMS32010 Evaluation Module (EVM) and Emulator 
(XDS) were used in the software development to permit 
single-step execution of the software for comparison with 
the corresponding computations produced by simulations 
written with the aid of the Continuous System modeling 
Program (CSMP). These simulations take into account the 
input/output signal quantization levels, microprocessor 
architecture, memory and internal register lengths. 

Other software functions associated with a complete, 
self-contained control module include: 

1. System calibration, testing, and startup 

2. Error checking and contingency responses 

3. Setting of gains, time constants, and other 
programmable or adjustable parameters 

4. System shutdown. 

These functions are implemented by a general-purpose 
executive processor (SBP9989), thus allowing the TMS32010 
to handle computation-intensive tasks. 


System Performance | 
The system performance was evaluated in the following 


two-step procedure: 
1. A hybrid computer system was constructed 
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Figure 11. Digital Controller Hardware Block Diagram [19] 
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Figure 12. Digital Controller Software Block Diagram [19] 


consisting of an analog-computer implemen- 
tation of part of the nondigital portion of the 
rate loop coupled with the TMS32010-based 
digital controller. 

2. A full-scale CSMP simulation of the entire rate 
loop was conducted. 

The closed-loop performance of the rate loop was 
characterized by the following responses: rate-command step 
response, torque-disturbance step response, and torque- 
disturbance frequency response. The results are shown in 
Figures 13 through 15. 

In the rate-command step response, the percent 
overshoot, peak time, and settling time are similar for both 
the discrete and continuous systems, but the continuous 
system is slightly smoother. The torque-disturbance step 
response shows that the discrete system is slightly slower 
in correcting for a torque disturbance input. In addition, the 
discrete system has a low-level oscillation (limit cycle). The 
frequency responses for a torque-disturbance input are also 
similar, with the continuous system having slightly better 
torque disturbance rejection in the low-frequency region. 
These results show that the analog and the digital systems 
are comparable even though no special efforts were made 
to take advantage of the capability that digital control offers. 

The flexibility of the digital control system was 
demonstrated by programming the digital system with the 
capability to correct for a variation in the sensor input to the 
A/D converter. The system was able to correct on-line (by 
using a known standard, calculating the gain, and dividing 
it out) for a 50 percent sinusoidal variation in the sensor gain. 

The conversion between two different stabilization 
systems serves as another flexibility example. The software 


of a small, two-axis stabilization system was converted to 
the software of the higher-precision, large, two-axis 
gimbaled-platform stabilization loop described earlier. The 
only modification required was in the Main Program and the 
Rate-Loop Subprogram for the latter system. The modular 
software design procedure made possible the use of most of 
the building blocks (subroutines) in the implementation of 
the new controller. 

In general, the study demonstrated the technical 
feasibility of digital control for a wide-bandwidth, high- 
precision type of system. Due to the limited scope of the 
study, the full power of digital control was not utilized, in 
that the control algorithms were constrained by the design 
to emulate their analog prototypes. It is likely that significant 
performance improvements could be achieved by advanced 
control techniques. 

Additional capacity in the TMS32010 remains to 
accommodate improved, more sophisticated compensators. 
Table 3 shows the TMS32010 utilization. 


Table 3. TMS32010 Utilization 
(LOS Stabilization System Rate-Loop) 


Program 
Memory — | 


Data 76 144 
53% 
Memory words words 
Execution 
Time 


* Based on a 16-MHz (i.e., less than maximum) clock rate. 
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Figure 13. Rate-Loop Rate Command Step Response [19] 
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Figure 14. Rate-Loop Normalized Torque-Disturbance Step Response [19] 
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Figure 15. Rate-Loop Normalized Torque-Disturbance Frequency Response [19] 


Other microprocessors that were considered for 
implementing this digital controller system were: Intel 8086, 
Zilog Z-8000, Motorola 68000, and the Fairchild 9445. 
These microprocessors were unable to meet the criterion that 
the maximum allowable time between samples for processing 
be 250 ws. Among the signal-processing microprocessors, 
the AMI 2811, while apparently fast enough, has only a 
12 x 12 multiplier; and the Intel 2920 has only four inputs 
and no branching instructions. 

The principal limitation of the TMS32010 was that of 
having eight inputs and eight outputs. Except for this 
restrictior , che processor would have been able to carry out 
the processing for both axes of the two-axis gimbaled 
platform. This limitation could be removed by the addition 
of logic circuitry. 
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APPENDIX A 
Development of a Digital Compensator Transfer Function 


The development of a digital equivalent of an analog compensator transfer function using the bilinear transformation with 
frequency prewarping is shown in this appendix. The technique is described in the section, DIGITAL COMPENSATOR 
DESIGN. 


Beginning with an analog prototype transfer function, 


S2 + 68.2 S + 3943 


G(s) = 1000 337379512 § + 6.31 x 100 


The sampling frequency to be used in converting to a digital equivalent is f = 4020 Hz (i.e., the sampling period 
T, = 1/4020 s = 248.76 x 10-65). 


The characteristic equation of this analog transfer function is: 
S2 + 2512S + 6.31 x 106 = 0 


which fits the standard, second-order form: 


S2+2fonS + w2 = 0 


The natural frequency wy, = \/6.31x106 = 2511.9713 rad/s 


To compensate for nonlinear mapping of analog-to-digital frequencies by the bilinear transformation method, the natural 
frequency is prewarped according to the formula: 


ee ee tan 2511.9713 x 248.76 x 106 
Op <7 lan 4” ~ 948-76 x 10-6 2 


= 2597.03 rad/s 
This prewarping scheme matches exactly the natural frequency in the analog and digital domains for the compensator. 


To obtain the prewarped version of the analog transfer function, the complex variable s in the original transfer function is 
replaced with (Wo/wp)s. It is therefore convenient to compute the ratio: 


Wo _ _2511.9713 


Ge 25e7 03 


The prewarped G(s), i.e., G(s) is then computed as: 
1000 (0.9672 s)* + 68.2 (0.9672 s) + 3943 
(0.9672 s)2 + 2512 (0.9672 s) + 6.31 x 106 


s2_+ 70.51 s + 4214.87 
s2 + 2597.16 s+ 6.75 x 106 


Gp(s) 


= 1000 


Bilinear transformation is next applied to Gp (s) whereby the continuous variable s is replaced by the expression that involves 
the discrete variable z: 


- 2 
Re ie T zt+i 
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This produces the discrete transfer function D(z). 
For the compensator, 


D(z) = Gps) ae es tees 
saa ae Te 


= 1000 s2 + 70.51 s+ 4214.87 
~ s2 + 2597.16 s+ 6.75 x 106 
2 z—1 


248.76x10-6 z+1 


Ss = 


After further computations, 


1.0 — 1.9824 z—! + 0.9826 z—2 
D@) = 706.76 19 — 1.2548 2-1 + 0.5474 2-2 

The final step is the gain adjustment in the digital transfer function. This can be accomplished by matching the analog and 
digital gains at some predetermined frequency, for example, DC. 


For the DC case, s = jw = O and from the bilinear transformation: 


_ 2st -. 1 
ey Ory ee 


Therefore, at DC, G(O) = D(1). 
For this transfer function, GO) = 0.6249, D(1) = 0.5072. If G(O) = K x D(\), then the constant K becomes 


0.6249 
0.5072 


= 1.2321 


The final form of the digital equivalent transfer function is: 


1.9824 z—1 + 0.9826 z-2 
1 


7 10 - 
D(z) = 870.77 +9 49548 2=1 4 0.5474 2=2 


where the gain of 870.77 is the product of K x (the unadjusted digital gain), i.e., 870.77 = 1.2321 x 706.76. 


~ 
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APPENDIX B 
TMS32010 Example Program 


An example TMS32010 program that uses the first- and second-order filter routine to compute several elements of a digital 
control system is provided in this appendix. The program illustrates the concepts of modular software design, data memory 
layout, and cascade implementation of high-order transfer functions. The program was executed on a combination of the 
TMS32010 Emulator (XDS) and Analog Interface Board (AIB) using random noise as input. The input was sampled at a 
4000-Hz rate. 


The following transfer functions are implemented with Q12 scaling: 


0.8352 — 0.2729 z—! + 0.8352 z—2 


900-Hz Notch. Filter D(z) 1.0 — 0.2729 z-1 + 0.6704 z-2 
7 0.9688 + 1.8341 z-1 + 0.9688 z~2 
1800-Hz Notch Filter DZ) = “Toq¢ 1.8341 z-1 + 0.9375 z-2 


Other transfer functions (compensators, notch filters) can be implemented in identical fashion by expanding the data structure 
(filter coefficients and states) and making additional filter routine calls to compute these elements. 


The output, as observed on a spectrum analyzer, is shown in Figure B-1. 


Figure B-1. Spectrum Analyzer output (900-Hz and 1800-Hz Notch Filters) 


The first notch from the left is at 900 Hz, the second is at 1800 Hz. The attenuation of the notch frequencies is about 23 
dB in reference to the passband region. 


The program that produced this output is as follows: 
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RIG SYS 32010 FAMILY MACRO ASSEMBLER PCZ.1 24.107 11:14:45 03-06-55 
PAGE 0001 


OoO01 IDT “RIG SY5* 

O002 OPTION DUNLST, TUNLST 

OOOs + 

0004 # The program computes a few sections of the LOS 
OOOS + Stabilization System Rate Loop. 

OO04 # 

0007 + Constants 

0008 % The constant SCALE relates to the scaling factor 
O00? % through a relation? 

0010 # scaling factor = 2 ## SCALE (ex. 4094 = 2 ## 12). 
Oot! # Scale constants 12 and 15 are available. 

0012 Oo0oc SCALE ERU {Zz SCALING FACTOR (012) 
OO13 + 

0014 + Data memory map 

001s 0060 DORG 9) DATA MEM START ADRS 

0014 0000 0000 MODE DATA $ AIB MODE 

0017 0001 O001 RATE DATA $ AIB RATE 

0015 # Coefficient table 

0019 0002 0002 NOS DATA $ FILTER #3 COEFF “S 


00Z0 O003 0003 N13 DATA 
0021 0004 0004 NZS DATA 
OO2ZZ 0005 0005 DIS DATA 


0023 0006 0006 D123 BATA 
0024 0007 0007 NOZ BATA FILTER #2 COEFF’S 
0025 0008 0008 N12 DATA 
0024 0009 OOOF NZ DATA 


0027 O0OOA OOOAR Diz DATA 
0025 OOOB OOOB [ze DATA 
0029 O00L OOOC NOI DATA 
0030 O00D OOOD N11 DATA 
0031 OOOE OOOE NZ1 DATA 
OO32 OOOF OOOF O11 DATA 
0033 0010 0010 Del DATA 


FILTER #1 COEFF’S 


eee RR KH HH HTH RH HH HF HH HM HM H HK 


0034 QOO10 COEFFS EQU =] 

0035 + State var table 

0036 0011 0011 X03 DATA FILTER #3 STATES 
0037 OO12 OO1Z X13 DATA 

0038 0013 0013 X23 DATA 

0039 0014 0014 X02 DATA FILTER #2 STATES 
0040 0015 0015 X12 DATA 

0041 0014 0014 X22 DATA 

0042 0017 0017 X01 DATA FILTER #1 STATES 
0043 0018 0015 Xiil DATA 

0044 0019 OO19 XzZi DATA 

004s OO1?7 STATES ERU =f 

0044 + 

0047 OO1A 001A COMAND DATA $ COMMAND INPUT 
0048 OO1B OO1B QUTPLUT DATA $ SYSTEM OUTPUT 
0049 # 

0050 O0O1C O0O1C MAX1& DATA $ MAX 2-COMPLEMENT NUM IN 14 BITS 
0051 O01D OO1D MINI& BATA $ MIN 2-COMPLEMENT NUM IN 14 BITS 
00352 OO1C MASKI EU MAX14 MASK 

OO33 O0O1D MASK2 EU MIN14 MASK 

0054 OO1E OO1E UNE LATA $ ONE 

1018 bara) # 

0054 +t 

0037 0000 ARG 0 
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0058 0000 
0001 

0059 

0040 

0061 

00462 0002 

0063 0004 

0064 0009 

0065 OO0E 

0046 0013 

0067 0016 

0048 0019 

0069 O001C 

0070 OO1E 

0071 0020 

0072 

0073 

0074 

0075 

0076 

0077 0021 
0022 

0078 

0079 

0080 0023 
0024 

0031 0025 
0026 

0082 

0083 

0084 

0085 0027 

0086 0028 

0087 0029 

0088 002A 

0089 

0090 

0091 O02B 

0092 002C 

0093 O002D 

0094 OOZE 
002F 

0095 0030 
0031 

0096 

0097 

0093 0032 

0099 0033 

0100 0034 

0101 0035 

0102 0036 

0103 

0104 

0105 0037 
0033 

0106 

0107 
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F200 
0021 


0002 
000A 
0000 
ODSD 
OF 80 
0000 
0000 
0000 
0000 
7FFEF 
0001 
0020 


0021 
F800 
0039 


F&00 
0027 
F900 
0023 


0027 
421A 
661A 
781iC 
SOLA 


2C1A 
7010 
7119 
F800 
0046 
F800 
0046 


SC1B 
661B 
781D 
301B 
4A1B 


F900 
0023 


TABLE 


TBLEND 


GET 


PAGE 0002 


B START RESTART VECTOR 


Table in prog memory 


EQU $ 
DATA >A, >4DB AIB MODE AND RATE 
DATA 0,0,0,0,0 FILTER #3 COEFF’S 


DATA  3421,-1118,3421,1118,-2746 900 HZ NOTCH FILTER 
DATA 3968, 7513, 3968,-7513,-3840 1800 HZ NOTCH FILTER 


DATA 0,0,0 FILTER #3 INITIAL STATES 
DATA 0,0,0 900 HZ NOTCH INITIAL STATES 
DATA 0,9,0 1800 HZ NOTCH INITIAL STATES 
DATA 0,0 COMMAND INPUT, SYSTEM OUTPUT 
DATA 32747 , -32768 MAX AND MIN 16 BIT NUMBERS 
DATA 1 ONE 

EQU $-1 


Initialize the system 
EQU $ 
CALL INIT INITIALIZATION ROUTINE 


Wait on sample 
BIQZ GET 


B WAIT 


Input sample 


EQU $ 

IN COMAND , FA2 INPUT COMMAND 

ZALS COMAND GET COMMAND 

XOR MASK 1 CORRECT A/D FORMAT 

SACL COMAND UPDATE COMMAND 

Process sample 

LAC COMAND , SCALE LOAD SCALED COMMAND 

LARK ARO, COEFFS ARO = PTR TO COEFF TABLE 
LARK AR1, STATES ARi = PTR TP STATE VAR TABLE 
CALL FILTR2 1800 HZ NOTCH FILTER 
CALL FILTR2 900 HZ NOTCH FILTER 


Output sample 
SACH OUTPUT ;14-SCALE STORE OUTPUT 


ZALS OUTPUT GET OUTPUT 

XOR MASK2 CORRECT FOR D/A FORMAT 
SACL OUTPUT UPDATE OUTPUT 

OUT DUTPUT , PAZ AND SEND IT OUT 


Repeat the sequence 
B WAIT GO GET NEXT SAMPLE 
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0108 # System initialization routine. The routine initializes 
010% # the TMS32010 and other system components, 
0110 # The calling sequence is: 
O111 # CALL INIT 
O112 % 
O113 + Initialize the 32010 
0114 0039 INIT Ei $ 
O115 0039 7FE1 DINT DISABLE INTERRUPTS 
0116 OO3SA 7FRB SDVM SET OVERFLOW MODE 
0117 # 
0118 + Initialize Data Memory 
0119 OO3B 6E00 LDPE 0 WISE PAGE 0 
0120 O0O30 46580 LARF 6) USE ARO 
0121 O003D 7O1E LARK ARO, TBLEND-TABLE INIT PTR TO END OF DATA 
O12 OO3E 7EZO LACK TBLEND INIT FTR TO END OF TABLE 
0123 OO3F XFER EQU $ 
0124 OOSF 4755 TBLR # XFER FROM FROG TO DATA MEM 
0125 0040 1LO1E SUB NE BUMF PTR DOWN 
0124 0041 F400 BANZ XFER GO XFER MORE 
0042 003F 
0127 # 
0128 % Inititialize AIB 
O129 0043 42800 GLUT MODE, PAO AIB made 
0130 0044 4B01 OWT RATE, FAS AIB RATE 
0131 # 
0132 0045 7F8D RET RETURN 
0133 + 
0134 # 
O13 + First and second order filter routine. Computes an IIR 
0136 # filter using Direct Form II algorithm and adapts to a 
0137 + scaling scheme defined in the calling program. 
0138 * 
0137 # The routine incorporates overflow handling code upon 
0140 * computing the intermediate value and the output. 
0141 + 
0142 # The calling sequence is: 
0143 # ACC = scaled filter input 
0144 + ARG = ptr to coeff table 
0145 + ARI = ptr to state var table 
0144 # CALL FILTR2 
0147 # ACC = scaled filter output 
0148 + ARO = ptr ta next set of coeff’s 
O149 + AR1 = ptr to next set of state vars 
0150 # 
0151 0044 FILTRe EQU $ 
O152 0046 6361 LARP AR1 USE ARI 
0153 # 
0154 # Compute intermediate value 
0155 0047 6A?0 LT #-_ ARO T=X2 
01546 0045 4D?1 MPY #- ARI MPY X2#D2 
0157 0047 GCAO LTA #+, ARO T=X1, ACC=KU+X2#D2 
0158 004A 4071 MPY #-, AR1 MFY X1i#D1 
0159 OO4B8 6C92 LTA *- T=X2, ACCHKU+X2#O2+X1#D1 
0140 O04C 4898 MAR #- ARI=FTR TO XO 
O141 
01462 # Round, store and check for intermediate overflow 


0163 004D FAOO BLZ LBL10 CHECK FOR +/- RESULT 
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0144 
0165 
01464 
0167 


O148 
O169 
0170 


0171 
0172 
0173 
0174 
0175 


0174 
0177 
0178 
0179 
0180 
0151 
0182 
0183 
0184 
0135 
0136 
0157 
0188 
0189 
0170 
O191 


0192 
0193 
0124 
O195 


O1%6 
0197 


O19s 
O199 
0200 
0201 
0202 


0203 


0204 


0205 
0206 
0207 
0208 
0209 
0210 
0211 
0212 


O004E 
OO4F 
0050 
0051 
0052 
0053 
0054 
0055 
0054 
0057 


00568 
oos? 
OOSA 
O0O5B 
ooSC 
OoSD 
OOSE 


OOSF 
0060 
0061 
0062 
0063 
0064 
0065 
0066 


00467 
0068 
O06? 
006A 
004B 
O04C 
006D 
OO6E 
OO6F 
0070 


0071 
0072 
0073 
0074 
0075 
0076 
0077 
0078 


007? 


007A 


NO ERRORS, 
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0033 
OBIE 
aCe 
1Cic 
FROO 
QOOSF 
6610 
53088 
F900 
OOSF 
0058 
1B1E 
5C83 
1CiD 
FDROO 
OOSF 
441D 
S088 


OO3F 
43A0 
6D91 
7F389 
6B90 
6D91 
&B90 
4D? 1 
TF SF 


FAQO 
0071 
OBIE 
3C1B 
1C1iC 
FBOO 
0079 
2C1ic 
FYOO 
007A 
0071 
1B1E 
SC1iB 
1ciD 
FDOO 
0079 
2Cil 
F200 
007A 


0079 
2C1B 
007A 
7F8D 


NO WARNINGS 


LBL10 


LBL20 


LBL3O 


+ 
LBL40 


LBL50 


# 


ADD 
SACH 
SUB 
BLEZ 


EGU 
SUB 
SACH 
SUB 
BIGEZ 


ZALS 
SACL 


Compute 
EQU 

MAR 

MPY 

ZAC 

LTO 

MPY 

LTD 
MFY 
APAC 


PC2. 


ONE, SCALE-1 
#, 14-SCALE 

MAX14, SCALE 
LBL20 


MAX14 
ca 


LBL2ZO 


$ 

ONE, SCALE-1 
#, 14-SCALE 
MINIG, SCALE 
LBLZ0 


MINIG 
2 


filter output 
$ 

#+, ARO 

#- ARI 


#- ARO 
#-, AR1 
#-, ARO 
#-  AR1 
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ROUND 

WFDATE INTERMEDIATE VAL 
SUBTRACT SCALED MAX FOS NUMBER 
IF ACC<=0 THEN NO OVERFLOW 


OVERFLOW, LOAD MAX FOS NUMBER 
UPOATE INTERMEDIATE VALUE 
60, COMPUTE QUTFUT 


ROUND 

UPDATE INTERMEDIATE VAL 
SUBTRACT SCALED MIN NEG NUMBER 
IF ACC>=0 THEN NO OVERFLOW 


OVERFLOW, LOAD MIN NEG NUMBER 
UPDATE INTERMEDIATE VALUE 


USE ARO 

MPY XZ#N2 

CLR ACC 

T=X1, ACC=X2#N2, UPDATE x2 

MEY X1#Ni 

T=XO0, ACC=X2#N2+X1#N1, UPDATE X1 
MEY XO#NO 

ALC=X2#N2+X 1#N1+XO#NO 


Check for output overflow 


BLZ 


ADD 
SACH 
SUB 
BLEZ 


LAC 
B 


EQU 
SUB 
SACH 
SUB 
BGEZ 


LAC 
B 


EQu 
LAC 
EQU 
RET 


END 


LBL30 


CINE, SCALE-1 
OUTPUT, 14—SCALE 
MAX14, SCALE 
LBL40 


MAX14, SCALE 
LBLSO 


$ 

ONE, SCALE-1 
OUTPUT, 14-SCALE 
MIN14, SCALE 
LBL40 


MIN16, SCALE 
LBLS50 


$ 
OUTPUT, SCALE 
$ 


CHECK FOR +/- RESULT 


ROUND 

LIPDATE OUTFLT 

SUBTRACT SCALED MAX FOS NUMBER 
IF ACE<=0 THEN NO OVERFLOW 


OVERFLOW, LOAD MAX FOS NUMBER 
bO, RETURN 


ROUND 

LIPDATE OUTPUT 

SUBTRACT SCALED MIN NEG NUMBER 
IF ACC>=0 THEN NO OVERFLOW 


OVERFLOW, LOAD MIN NEG NUMBER 
60, RETURN 


RESTORE ACC 


RETURN 
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